Test CUDA Builds

[executorch][nvidia][tensorrt][10/n] Add blob serialization format with I/O metadata #10612

Sign in to view logs

Triggered via pull request March 5, 2026 18:02

shoumikhin

opened #17921

gh/shoumikhin/35/head

Status Failure

Total duration 57m 44s

Artifacts 16

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Waiting for pending jobs

Matrix: test-model-cuda-e2e

Waiting for pending jobs

check-all-cuda-builds

Annotations

2 errors and 5 warnings

export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job

Process completed with exit code 1.

export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job

No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.

test-models-cuda (add_mul) / linux-job

Attempt 1 failed. Reason: Child_process exited with error code 255

export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job

No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.

export-model-cuda-artifact (openai, whisper-small, quantized-int4-weight-only) / linux-job

Attempt 1 failed. Reason: Child_process exited with error code 255

test-models-cuda (mv3) / linux-job

Attempt 1 failed. Reason: Child_process exited with error code 255

Artifacts

Produced during runtime

Name	Size	Digest
Qwen-Qwen3-0.6B-cuda-non-quantized	1.1 GB	`sha256:2513f05954e512301ecfee32001b5198f0beee10a6f2c7ebd63bb5d212cb8e87`
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only	1.1 GB	`sha256:8e8784e4e44248a47a5a92446b2578485617fdbc35f74e7e0fb491d3b0ad3563`
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:bf7d663ceb4f657815efedb6f12805ad9fbddd2cd990a5b2a4553b40ed2026bc`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:dc2c5481c51b329a06da319c4cdafec2b886c6ad99aa6043d73274d70242ced5`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:53bf309b762360f4e9c9ac43d42fec4bdd6d33f280062ba761c5c89dbf2d4f9a`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:e62b322ac7b1739beb748f62d3e8df4f834f0e6ffd7ee2027b6465738a6ea861`
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed	15.5 GB	`sha256:090ab9294e072bee1c822ed2916f8cff77559ea54170236c83febe9bf86879b6`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:cbfb09810b3ad33d0243d9416f1d9ee9f02591f7d9840d923df1d60907e91b1e`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:b0b5539aaedd3fa62bbfa595fe5115744303f52c42ecae25155046c1afece599`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:adf38aa8f00bd254b592fdafc692709e4885d0628bb0bfcdbb39625820161e8a`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:3030702842a808ef80a11f053ad7a8cd7f6b427535bd088e35e533ff72a5cfa6`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed	491 MB	`sha256:eca839341b2e05194af70eab1fedcfb709cd5ecba249bea597c4cb5aa3b20085`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:38dfb69596350fac4656db5b47bff5bc528a9f7dfca662ade079875624bc184c`
openai-whisper-small-cuda-non-quantized	361 MB	`sha256:b62b5d619f849d9e80eac8ef2404da7e5f40c7fc18b2bd6a806c40c6076333a9`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:59b7dcd872551e5c27b7621ad5b77ef66be0c5016431248ddce28f26caf4a21f`
openai-whisper-small-cuda-quantized-int4-weight-only	271 MB	`sha256:48c563991be6318d22cee7cbb664bf14e2eb56795cd8d88b381afc74f91b1334`