fix: support non-UTF-8 encodings in eval data loading by CodeForgeNet · Pull Request #4100 · microsoft/promptflow

CodeForgeNet · 2026-03-22T20:59:25Z

The eval SDK only read JSONL files as UTF-8. If your data had a BOM (utf-8-sig)
— common for multilingual content generated on Windows — it failed immediately
with ValueError: Expected object or value. Not helpful.

The fix adds BOM detection before reading and a fallback chain
(utf-8 → utf-8-sig → latin-1 → cp1252) so the loader handles real-world
files without requiring users to re-encode their data.

Three files touched:

promptflow/_utils/load_data.py — _pd_read_file() now detects encoding
before calling pd.read_json() on .jsonl files
evaluate/_evaluate.py — _validate_and_load_data() gets the same treatment
evaluate/_utils.py — load_jsonl() updated with BOM detection + fallback

Added a utf-8-sig encoded test file with multilingual content and a unit test
that would have caught this from the start.

Checklist

No breaking changes
Read the contribution guidelines
New dependencies are MIT compatible
CHANGELOG updated
Test coverage included for the change

Fixes microsoft#3670 pd.read_json defaulted to UTF-8 only. Files encoded with utf-8-sig (BOM) raised ValueError: Expected object or value. - Added _detect_encoding() BOM detection in load_data.py, _evaluate.py, _utils.py - Added fallback encoding chain: utf-8, utf-8-sig, latin-1, cp1252 - Improved error messages to show which encodings were attempted - Added test case and utf-8-sig encoded test data file

CodeForgeNet · 2026-03-22T21:01:42Z

@microsoft-github-policy-service agree

github-actions · 2026-04-05T21:43:01Z

Hi, thank you for your interest in helping to improve the prompt flow experience and for your contribution. We've noticed that there hasn't been recent engagement on this pull request. If this is still an active work stream, please let us know by pushing some changes or leaving a comment.

CodeForgeNet · 2026-04-12T18:40:50Z

Still active. Ready for review and merge whenever the team has bandwidth.

github-actions · 2026-04-26T21:45:13Z

Hi, thank you for your interest in helping to improve the prompt flow experience and for your contribution. We've noticed that there hasn't been recent engagement on this pull request. If this is still an active work stream, please let us know by pushing some changes or leaving a comment.

github-actions · 2026-05-03T21:45:52Z

Hi, thank you for your contribution. Since there has not been recent engagement, we are going to close this out. Feel free to reopen if you'd like to continue working on these changes. Please be sure to remove the no-recent-activity label; otherwise, this is likely to be closed again with the next cleanup pass.

CodeForgeNet requested review from a team as code owners March 22, 2026 20:59

github-actions Bot added promptflow-core promptflow-evals external labels Mar 22, 2026

github-actions Bot added the no-recent-activity There has been no recent activity on this issue/pull request label Apr 5, 2026

github-actions Bot removed the no-recent-activity There has been no recent activity on this issue/pull request label Apr 12, 2026

github-actions Bot added the no-recent-activity There has been no recent activity on this issue/pull request label Apr 26, 2026

github-actions Bot closed this May 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: support non-UTF-8 encodings in eval data loading#4100

fix: support non-UTF-8 encodings in eval data loading#4100
CodeForgeNet wants to merge 1 commit intomicrosoft:mainfrom
CodeForgeNet:fix/eval-utf8-encoding-support

CodeForgeNet commented Mar 22, 2026

Uh oh!

CodeForgeNet commented Mar 22, 2026

Uh oh!

github-actions Bot commented Apr 5, 2026

Uh oh!

CodeForgeNet commented Apr 12, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 26, 2026

Uh oh!

github-actions Bot commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

CodeForgeNet commented Mar 22, 2026

Checklist

Uh oh!

CodeForgeNet commented Mar 22, 2026

Uh oh!

github-actions Bot commented Apr 5, 2026

Uh oh!

CodeForgeNet commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 26, 2026

Uh oh!

github-actions Bot commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

CodeForgeNet commented Apr 12, 2026 •

edited

Loading