Skip to content

Skipped status and standardize output v2#5043

Closed
m7md7sien wants to merge 5 commits into
mainfrom
Skipped_Status_and_Standardize_Output_v2
Closed

Skipped status and standardize output v2#5043
m7md7sien wants to merge 5 commits into
mainfrom
Skipped_Status_and_Standardize_Output_v2

Conversation

@m7md7sien
Copy link
Copy Markdown
Contributor

No description provided.

Copilot AI and others added 5 commits May 13, 2026 22:27
…mpty files

Agent-Logs-Url: https://github.com/Azure/azureml-assets/sessions/50beeb9d-8306-4f00-ab60-7924ef98ecd4

Co-authored-by: ashaabansoliman <109526961+ashaabansoliman@users.noreply.github.com>
… output fields

- intent_resolution, relevance: rename 'explanation' -> 'reason', add 'status' field, add skipped handling
- response_completeness: add skipped handling to task section (already had json_object + reason/status)
- task_adherence: replace flagged/reasoning schema with score/reason/status, update all flag/unflag language to score 0/1
- task_completion: (already up to date)
- tool_call_accuracy: increase max_tokens from 3000 to 5000
- tool_call_success: rename explanation->reason, details->properties, success->score+status, add skipped handling
- tool_input_accuracy: rename chain_of_thought->reason, details->properties, result->score, add skipped handling
- tool_output_utilization: wrap faulty_details in properties object, rename label->score (pass/fail -> 1/0), add status field
- tool_selection: rename explanation->reason, details->properties, add status field and skipped handling

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: ashaabansoliman <109526961+ashaabansoliman@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 15, 2026

Test Results for assets-test

1 261 tests   886 ✅  59s ⏱️
   20 suites    0 💤
   20 files    375 ❌

For more details on these failures, see this check.

Results for commit dae266a.

♻️ This comment has been updated with latest results.

Copilot AI added a commit that referenced this pull request May 15, 2026
Replicate all changes from PR #5043 (#5043)
to standardize the output schema across 57 evaluator files.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: m7md7sien <16615690+m7md7sien@users.noreply.github.com>
@m7md7sien m7md7sien closed this May 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants