ExecuTorch MLX delegate #10284
Triggered via pull request
March 3, 2026 00:53
Status
Success
Total duration
2h 24m 33s
Artifacts
17
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
25m 2s
Matrix: test-models-cuda
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
|
1.1 GB |
sha256:de25e14a98f17476033b66738d6e1afe027b7c21022ceca77444a73c580d3e66
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
|
559 MB |
sha256:38e7b1435c947355228b46285e65e0d371ef823815d2c4f7371d4de9ff3ddb01
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
|
1.1 GB |
sha256:f4a810bec30fb240109a82203cdfedb019c093920d61dc5d664fafe9f4c598b1
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:30f0d069e19fb173f0acb699a62defaa69524044fe2d6113e799da7db562e84d
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:9a93263b52e835c249ac1f3147f9495d406715262d7e36c887472b846b446833
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
|
6.82 GB |
sha256:275d27d119405f46532594112cf0ce4a734f7d1cf3eb6e1f0c65cf16fdca46ce
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:1c73794aa4d17a83e8e53cbfde2f49ad62be23d9d3946088008c94317261450a
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:80748d908266b355a16aa671438656b65d7be9dc5bef38244345aa1a27b88041
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:089b63a2e16b5315002c056eafd6b50d1e0418024dc566b8a47e104655c2da8a
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:598c04623e3dc05e6d5b765edc6358a4c4d165bc100ec2234ea8e108837be58c
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:20f3d80b793ec50e769f8c5cc654bdcdaa644423697897d97a8799715d004e0e
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:fb9329a116fc62458daaccc4cbe1c69b477bbb34ef1b30abe28979a7c0209ef7
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:1b462ecf753097a2f19df43e31af965f9ff27dc521d9c2d314ed10f09999d822
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:b073ee47bd7dc52e80021d138f4f4631c428a65569c3b68a369de6370b32bd1b
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:b0a8afce30962fb6c07fac97cca7eb53ccb832050a4a80d61eb7ba4e0491b10c
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:667f6d07c80b436ca995623abdbce4267fb955b9ce26940daad5b8afa6e22b4f
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
270 MB |
sha256:574fd546bcb0fe28f0763b84aeb7afd72042f2afd3d2ca8cd71fc50d08021c74
|
|