Skip to content

[executorch][nvidia][tensorrt][10/n] Add blob serialization format with I/O metadata #10612

[executorch][nvidia][tensorrt][10/n] Add blob serialization format with I/O metadata

[executorch][nvidia][tensorrt][10/n] Add blob serialization format with I/O metadata #10612

Triggered via pull request March 5, 2026 18:02
Status Failure
Total duration 57m 44s
Artifacts 16

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
27m 36s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

2 errors and 5 warnings
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
test-models-cuda (add_mul) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
export-model-cuda-artifact (openai, whisper-small, quantized-int4-weight-only) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255
test-models-cuda (mv3) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:2513f05954e512301ecfee32001b5198f0beee10a6f2c7ebd63bb5d212cb8e87
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
1.1 GB
sha256:8e8784e4e44248a47a5a92446b2578485617fdbc35f74e7e0fb491d3b0ad3563
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:bf7d663ceb4f657815efedb6f12805ad9fbddd2cd990a5b2a4553b40ed2026bc
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:dc2c5481c51b329a06da319c4cdafec2b886c6ad99aa6043d73274d70242ced5
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:53bf309b762360f4e9c9ac43d42fec4bdd6d33f280062ba761c5c89dbf2d4f9a
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:e62b322ac7b1739beb748f62d3e8df4f834f0e6ffd7ee2027b6465738a6ea861
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
15.5 GB
sha256:090ab9294e072bee1c822ed2916f8cff77559ea54170236c83febe9bf86879b6
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:cbfb09810b3ad33d0243d9416f1d9ee9f02591f7d9840d923df1d60907e91b1e
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:b0b5539aaedd3fa62bbfa595fe5115744303f52c42ecae25155046c1afece599
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:adf38aa8f00bd254b592fdafc692709e4885d0628bb0bfcdbb39625820161e8a
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:3030702842a808ef80a11f053ad7a8cd7f6b427535bd088e35e533ff72a5cfa6
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:eca839341b2e05194af70eab1fedcfb709cd5ecba249bea597c4cb5aa3b20085
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:38dfb69596350fac4656db5b47bff5bc528a9f7dfca662ade079875624bc184c
openai-whisper-small-cuda-non-quantized
361 MB
sha256:b62b5d619f849d9e80eac8ef2404da7e5f40c7fc18b2bd6a806c40c6076333a9
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:59b7dcd872551e5c27b7621ad5b77ef66be0c5016431248ddce28f26caf4a21f
openai-whisper-small-cuda-quantized-int4-weight-only
271 MB
sha256:48c563991be6318d22cee7cbb664bf14e2eb56795cd8d88b381afc74f91b1334