Skip to content

Nightly Evaluations #26

Nightly Evaluations

Nightly Evaluations #26

Manually triggered February 25, 2026 19:26
Status Failure
Total duration 25m 54s
Artifacts 5

evals-nightly.yml

on: workflow_dispatch
Matrix: evaluate
Fit to window
Zoom out
Zoom in

Annotations

4 errors
Evaluate gemini-2.5-flash-lite
Process completed with exit code 1.
Evaluate gemini-2.5-flash
Process completed with exit code 1.
Evaluate gemini-3-flash-preview
Process completed with exit code 1.
Evaluate gemini-3-pro-preview
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size Digest
eval-results-gemini-2.5-flash Expired
2.29 KB
sha256:d91ce473a9039091cefea9d85b40ba2ec7c122bd1b06b2e771d75ff2af9f5939
eval-results-gemini-2.5-flash-lite Expired
2.75 KB
sha256:a66e3a2c84eb5a5417f9185031897c8f2a3a6933c2aaf9810f12b4c63b7c081b
eval-results-gemini-2.5-pro Expired
2.07 KB
sha256:e6060456cd1f78f6052d707513caf1f5e7bdf52818474648a647ed8559d68789
eval-results-gemini-3-flash-preview Expired
2.53 KB
sha256:9cc3cff0b42aec173f965ca770c8baeb0d510246aeb9b87136e0bceffd68d606
eval-results-gemini-3-pro-preview Expired
2.62 KB
sha256:4b890b17ff12e02dbbc8a1bb7c150eb6534abd394f6238662a15ee9f44a492a5