Skip to content

Nightly Evaluations #20

Nightly Evaluations

Nightly Evaluations #20

Manually triggered February 25, 2026 17:38
Status Failure
Total duration 32s
Artifacts

evals-nightly.yml

on: workflow_dispatch
Matrix: evaluate
Fit to window
Zoom out
Zoom in

Annotations

14 errors and 5 warnings
Evaluate gemini-2.5-pro
Process completed with exit code 1.
Evaluate gemini-2.5-pro
Process completed with exit code 1.
Evaluate gemini-3-pro-preview
The strategy configuration was canceled because "evaluate.gemini-2_5-pro" failed
Evaluate gemini-3-pro-preview
Process completed with exit code 1.
Evaluate gemini-3-pro-preview
The operation was canceled.
Evaluate gemini-2.5-flash
The strategy configuration was canceled because "evaluate.gemini-2_5-pro" failed
Evaluate gemini-2.5-flash
Process completed with exit code 1.
Evaluate gemini-2.5-flash
The operation was canceled.
Evaluate gemini-2.5-flash-lite
The strategy configuration was canceled because "evaluate.gemini-2_5-pro" failed
Evaluate gemini-2.5-flash-lite
Process completed with exit code 1.
Evaluate gemini-2.5-flash-lite
The operation was canceled.
Evaluate gemini-3-flash-preview
The strategy configuration was canceled because "evaluate.gemini-2_5-pro" failed
Evaluate gemini-3-flash-preview
Process completed with exit code 1.
Evaluate gemini-3-flash-preview
The operation was canceled.
Evaluate gemini-2.5-pro
No files were found with the provided path: eval-results-gemini-2.5-pro.json. No artifacts will be uploaded.
Evaluate gemini-3-pro-preview
No files were found with the provided path: eval-results-gemini-3-pro-preview.json. No artifacts will be uploaded.
Evaluate gemini-2.5-flash
No files were found with the provided path: eval-results-gemini-2.5-flash.json. No artifacts will be uploaded.
Evaluate gemini-2.5-flash-lite
No files were found with the provided path: eval-results-gemini-2.5-flash-lite.json. No artifacts will be uploaded.
Evaluate gemini-3-flash-preview
No files were found with the provided path: eval-results-gemini-3-flash-preview.json. No artifacts will be uploaded.