[executorch][nvidia][tensorrt][10/n] Add blob serialization format with I/O metadata #10612
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
27m 36s
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
4s
Annotations
2 errors and 5 warnings
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Process completed with exit code 1.
|
|
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Process completed with exit code 1.
|
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
|
|
test-models-cuda (add_mul) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255
|
|
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
|
|
export-model-cuda-artifact (openai, whisper-small, quantized-int4-weight-only) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255
|
|
test-models-cuda (mv3) / linux-job
Attempt 1 failed. Reason: Child_process exited with error code 255
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
|
1.1 GB |
sha256:2513f05954e512301ecfee32001b5198f0beee10a6f2c7ebd63bb5d212cb8e87
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
|
1.1 GB |
sha256:8e8784e4e44248a47a5a92446b2578485617fdbc35f74e7e0fb491d3b0ad3563
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:bf7d663ceb4f657815efedb6f12805ad9fbddd2cd990a5b2a4553b40ed2026bc
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:dc2c5481c51b329a06da319c4cdafec2b886c6ad99aa6043d73274d70242ced5
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:53bf309b762360f4e9c9ac43d42fec4bdd6d33f280062ba761c5c89dbf2d4f9a
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:e62b322ac7b1739beb748f62d3e8df4f834f0e6ffd7ee2027b6465738a6ea861
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
|
15.5 GB |
sha256:090ab9294e072bee1c822ed2916f8cff77559ea54170236c83febe9bf86879b6
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:cbfb09810b3ad33d0243d9416f1d9ee9f02591f7d9840d923df1d60907e91b1e
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:b0b5539aaedd3fa62bbfa595fe5115744303f52c42ecae25155046c1afece599
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:adf38aa8f00bd254b592fdafc692709e4885d0628bb0bfcdbb39625820161e8a
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:3030702842a808ef80a11f053ad7a8cd7f6b427535bd088e35e533ff72a5cfa6
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:eca839341b2e05194af70eab1fedcfb709cd5ecba249bea597c4cb5aa3b20085
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:38dfb69596350fac4656db5b47bff5bc528a9f7dfca662ade079875624bc184c
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:b62b5d619f849d9e80eac8ef2404da7e5f40c7fc18b2bd6a806c40c6076333a9
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:59b7dcd872551e5c27b7621ad5b77ef66be0c5016431248ddce28f26caf4a21f
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
271 MB |
sha256:48c563991be6318d22cee7cbb664bf14e2eb56795cd8d88b381afc74f91b1334
|
|