Test CUDA Builds

ExecuTorch MLX delegate #10284

Sign in to view logs

Triggered via pull request March 3, 2026 00:53

metascroy

synchronize #16718

mlx-delegate

Status Success

Total duration 2h 24m 33s

Artifacts 17

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Matrix: test-model-cuda-e2e

check-all-cuda-builds

Artifacts

Produced during runtime

Name	Size	Digest
Qwen-Qwen3-0.6B-cuda-non-quantized	1.1 GB	`sha256:de25e14a98f17476033b66738d6e1afe027b7c21022ceca77444a73c580d3e66`
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed	559 MB	`sha256:38e7b1435c947355228b46285e65e0d371ef823815d2c4f7371d4de9ff3ddb01`
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only	1.1 GB	`sha256:f4a810bec30fb240109a82203cdfedb019c093920d61dc5d664fafe9f4c598b1`
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:30f0d069e19fb173f0acb699a62defaa69524044fe2d6113e799da7db562e84d`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:9a93263b52e835c249ac1f3147f9495d406715262d7e36c887472b846b446833`
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized	6.82 GB	`sha256:275d27d119405f46532594112cf0ce4a734f7d1cf3eb6e1f0c65cf16fdca46ce`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:1c73794aa4d17a83e8e53cbfde2f49ad62be23d9d3946088008c94317261450a`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:80748d908266b355a16aa671438656b65d7be9dc5bef38244345aa1a27b88041`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:089b63a2e16b5315002c056eafd6b50d1e0418024dc566b8a47e104655c2da8a`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:598c04623e3dc05e6d5b765edc6358a4c4d165bc100ec2234ea8e108837be58c`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:20f3d80b793ec50e769f8c5cc654bdcdaa644423697897d97a8799715d004e0e`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:fb9329a116fc62458daaccc4cbe1c69b477bbb34ef1b30abe28979a7c0209ef7`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed	491 MB	`sha256:1b462ecf753097a2f19df43e31af965f9ff27dc521d9c2d314ed10f09999d822`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:b073ee47bd7dc52e80021d138f4f4631c428a65569c3b68a369de6370b32bd1b`
openai-whisper-small-cuda-non-quantized	361 MB	`sha256:b0a8afce30962fb6c07fac97cca7eb53ccb832050a4a80d61eb7ba4e0491b10c`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:667f6d07c80b436ca995623abdbce4267fb955b9ce26940daad5b8afa6e22b4f`
openai-whisper-small-cuda-quantized-int4-weight-only	270 MB	`sha256:574fd546bcb0fe28f0763b84aeb7afd72042f2afd3d2ca8cd71fc50d08021c74`