Skip to content

[executorch][nvidia][tensorrt][22/n] Add correctness tests #10619

[executorch][nvidia][tensorrt][22/n] Add correctness tests

[executorch][nvidia][tensorrt][22/n] Add correctness tests #10619

Triggered via pull request March 5, 2026 18:09
Status Failure
Total duration 1h 4m 57s
Artifacts 17

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 12s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 2 warnings
test-models-cuda (sdpa) / linux-job
Process completed with exit code 1.
test-models-cuda (resnet18) / linux-job
Process completed with exit code 1.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, non-quantized) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:52bd3b8535b32547fabe5e860ebacb92d8ca31a4020902cb42a2181e698ac9f0
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
559 MB
sha256:bbd91fd30c8af2cfffd446e682a1a580270eec4d24b9a6379c824262491c1d31
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
1.1 GB
sha256:09cf376abf5ce3165c78317d24ec02979c9cdde61ff190d150782ab564a51d3d
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:d78b35b149f614d2ad61d4eb9f8081205547e784d0d314844bea015578079aa4
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:1722336107dc0c1074dca7b4a83f63854fc9a130a7831aca0bb9a00ca17a706d
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:30c247a838587b6cee25893944f5a90e69c32c6d0e4e4488c9fe556bbb90ce15
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:c71bd080823a7b288d97df5d245c7152166a8bf6879b7242cfd1119c98cbc2de
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:25f48745e797b867d3c8fb4679bca15b555568907614a7a38b1451f91faea32a
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
15.5 GB
sha256:707f9304225d86cdc72a6de55b1f497186de925c2613bafad53ef3cd5239d1a2
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:62b5d4b39d135111d474a9a8b48c364e7cda844048fe515ff67091482419fc90
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:3f551267f1996fc2ae627c7ab346cda2f826eb486e83cf2be030b040f7f0031b
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:beba874a6c4d2605090f254cb6c04d6522d0856fb45e3c77cdea7e9eca388f3e
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:09cf3b3c0cd013a44a729bafd7b2cf5dc15d56236f3cc4a7ffc6e9a8bc58d5a5
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:639f022476ca7598fa03e4d9f257f6591075f73149de39f17bfc0f031d93de75
openai-whisper-small-cuda-non-quantized
361 MB
sha256:336139ff92a4edb9d24faf312bea6768491c5df7d0b3b5ff664bb263b606b66e
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:ba668675601adfcf9bce29a3b8fa7b1f1646c4090f1a6d906b5526b900b2b4b6
openai-whisper-small-cuda-quantized-int4-weight-only
271 MB
sha256:d5ad196f7efe7b4a814556a3b1703ffe2bb5e274d9c30512811f4406d8881a69