You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Upgrade to llama-stack 0.6.0 and ragas 0.4.x (#64)
* Upgrade to llama-stack 0.6.0 and ragas 0.4.x
- Bump provider version to 0.7.0 targeting llama-stack >=0.6.0
- Upgrade ragas from ==0.3.0 to >=0.4.0,<0.5.0
- Add 6 new metrics: AnswerAccuracy, ContextRelevance, FactualCorrectness,
NoiseSensitivity, ResponseGroundedness, context_entity_recall
- Implement is_finished() on LLM wrappers (now required by BaseRagasLLM)
- Fix test fixture metric name (semantic_similarity -> answer_similarity)
- Update COMPATIBILITY.md with release/0.6.x branch and version entries
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Post-review fixes for ragas 0.4.x upgrade
- Fix EvaluationResult import in kubeflow components (ragas.dataset_schema → ragas.evaluation)
- Remove stale commented-out is_finished code from inline wrappers
- Eliminate deprecation-triggering lazy imports in base._get_metrics by using METRIC_MAPPING
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Address Sourcery review: guard default metrics, fix is_finished fallback, add tests
- Guard _DEFAULT_METRICS against METRIC_MAPPING drift with .get() + warning
- Replace unconditional `return True` in is_finished with content-based check
- Add unit tests for _get_metrics (6 tests) and is_finished (8 tests)
- Add nv_accuracy (AnswerAccuracy) to benchmark scoring_functions and test_direct_evaluation
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Remove deprecation warning suppression for ragas.metrics imports
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments