Skip to content

ExecuTorch MLX delegate #10284

ExecuTorch MLX delegate

ExecuTorch MLX delegate #10284

Triggered via pull request March 3, 2026 00:53
Status Success
Total duration 2h 24m 33s
Artifacts 17

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
25m 2s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:de25e14a98f17476033b66738d6e1afe027b7c21022ceca77444a73c580d3e66
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
559 MB
sha256:38e7b1435c947355228b46285e65e0d371ef823815d2c4f7371d4de9ff3ddb01
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
1.1 GB
sha256:f4a810bec30fb240109a82203cdfedb019c093920d61dc5d664fafe9f4c598b1
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:30f0d069e19fb173f0acb699a62defaa69524044fe2d6113e799da7db562e84d
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:9a93263b52e835c249ac1f3147f9495d406715262d7e36c887472b846b446833
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:275d27d119405f46532594112cf0ce4a734f7d1cf3eb6e1f0c65cf16fdca46ce
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:1c73794aa4d17a83e8e53cbfde2f49ad62be23d9d3946088008c94317261450a
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:80748d908266b355a16aa671438656b65d7be9dc5bef38244345aa1a27b88041
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:089b63a2e16b5315002c056eafd6b50d1e0418024dc566b8a47e104655c2da8a
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:598c04623e3dc05e6d5b765edc6358a4c4d165bc100ec2234ea8e108837be58c
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:20f3d80b793ec50e769f8c5cc654bdcdaa644423697897d97a8799715d004e0e
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:fb9329a116fc62458daaccc4cbe1c69b477bbb34ef1b30abe28979a7c0209ef7
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:1b462ecf753097a2f19df43e31af965f9ff27dc521d9c2d314ed10f09999d822
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:b073ee47bd7dc52e80021d138f4f4631c428a65569c3b68a369de6370b32bd1b
openai-whisper-small-cuda-non-quantized
361 MB
sha256:b0a8afce30962fb6c07fac97cca7eb53ccb832050a4a80d61eb7ba4e0491b10c
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:667f6d07c80b436ca995623abdbce4267fb955b9ce26940daad5b8afa6e22b4f
openai-whisper-small-cuda-quantized-int4-weight-only
270 MB
sha256:574fd546bcb0fe28f0763b84aeb7afd72042f2afd3d2ca8cd71fc50d08021c74