Test CUDA Builds

[executorch][nvidia][tensorrt][22/n] Add correctness tests #10619

Sign in to view logs

Triggered via pull request March 5, 2026 18:09

shoumikhin

opened #17933

gh/shoumikhin/47/head

Status Failure

Total duration 1h 4m 57s

Artifacts 17

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Waiting for pending jobs

Matrix: test-model-cuda-e2e

Waiting for pending jobs

check-all-cuda-builds

Annotations

3 errors and 2 warnings

test-models-cuda (sdpa) / linux-job

Process completed with exit code 1.

test-models-cuda (resnet18) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job

No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.

export-model-cuda-artifact (openai, whisper-large-v3-turbo, non-quantized) / linux-job

Attempt 1 failed. Reason: Child_process exited with error code 255

Artifacts

Produced during runtime

Name	Size	Digest
Qwen-Qwen3-0.6B-cuda-non-quantized	1.1 GB	`sha256:52bd3b8535b32547fabe5e860ebacb92d8ca31a4020902cb42a2181e698ac9f0`
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed	559 MB	`sha256:bbd91fd30c8af2cfffd446e682a1a580270eec4d24b9a6379c824262491c1d31`
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only	1.1 GB	`sha256:09cf376abf5ce3165c78317d24ec02979c9cdde61ff190d150782ab564a51d3d`
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:d78b35b149f614d2ad61d4eb9f8081205547e784d0d314844bea015578079aa4`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:1722336107dc0c1074dca7b4a83f63854fc9a130a7831aca0bb9a00ca17a706d`
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized	6.82 GB	`sha256:30c247a838587b6cee25893944f5a90e69c32c6d0e4e4488c9fe556bbb90ce15`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:c71bd080823a7b288d97df5d245c7152166a8bf6879b7242cfd1119c98cbc2de`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:25f48745e797b867d3c8fb4679bca15b555568907614a7a38b1451f91faea32a`
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed	15.5 GB	`sha256:707f9304225d86cdc72a6de55b1f497186de925c2613bafad53ef3cd5239d1a2`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:62b5d4b39d135111d474a9a8b48c364e7cda844048fe515ff67091482419fc90`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:3f551267f1996fc2ae627c7ab346cda2f826eb486e83cf2be030b040f7f0031b`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:beba874a6c4d2605090f254cb6c04d6522d0856fb45e3c77cdea7e9eca388f3e`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:09cf3b3c0cd013a44a729bafd7b2cf5dc15d56236f3cc4a7ffc6e9a8bc58d5a5`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:639f022476ca7598fa03e4d9f257f6591075f73149de39f17bfc0f031d93de75`
openai-whisper-small-cuda-non-quantized	361 MB	`sha256:336139ff92a4edb9d24faf312bea6768491c5df7d0b3b5ff664bb263b606b66e`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:ba668675601adfcf9bce29a3b8fa7b1f1646c4090f1a6d906b5526b900b2b4b6`
openai-whisper-small-cuda-quantized-int4-weight-only	271 MB	`sha256:d5ad196f7efe7b4a814556a3b1703ffe2bb5e274d9c30512811f4406d8881a69`