Skip to content

[executorch][nvidia][tensorrt][13/n] Add examples, C++ runner and CI workflow #10604

[executorch][nvidia][tensorrt][13/n] Add examples, C++ runner and CI workflow

[executorch][nvidia][tensorrt][13/n] Add examples, C++ runner and CI workflow #10604

Triggered via pull request March 5, 2026 17:59
Status Failure
Total duration 1h 31m 28s
Artifacts 18

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
27m 36s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

1 error
test-models-cuda (mv3) / linux-job
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:2ff2c1350eb78b3f45e02146488ad588a40f0909f552dc0993b14c228f0b3d73
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
559 MB
sha256:2126772bec44e38183de00223e4aa25bc8d8320c705427dfc9ec73fbc5e7754b
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
1.1 GB
sha256:41af4025cda621aaefb74ee0bca18477696ebde5d2a62540dead827b883a179f
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:b02b80a830621c524502768b29fb6284c439f52080df9a2bbda559f85e7c818b
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:b31d87bd678f01be108c611ab9ba0ae1b0b3e2b74a436d3aab8eddc920eb269a
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:d67e92e02669453a8998556cc5d34e50e7ad9361c373842a5fbeebe9630821d1
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:1b521b0f5254fa4096fb82d252c2ec3a0b265045e2e240e53964dafc261df3ae
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:3e914d7124b95589a2064bcdad59fe3243095e8323f3a00edeaee0055928d9dd
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
15.5 GB
sha256:0619b6282464cfabf91f95f36188810da85a6f4a4b9d16d6bcd83be98d625c8a
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:a2554447d9f64f09c97c44162dfddebc894b678cebba9143483d926c4f36fe11
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:dfe7cf0034593173d8933e3f6abf562e9852f9fa0b961bcdc652822d2c04134f
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:c4d2ad7635b29781d30f9d87f81858cb019cbeeb5736cac3cd894f9246373649
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:e63c5856221f5a308f6f59a73039530d82793c052e86aa52136669f68bac275a
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:f5817f7819b7f2d389889af29447be92a03957ce307761cb0506c090d03e0ef6
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:8c3b46c3837a45d4b88d174b165c36ada775efb28c56d9eac700915dc473db9d
openai-whisper-small-cuda-non-quantized
362 MB
sha256:2899de1918afdbb2efc4935dcee2ae5134ca32f48f7e772634374b109921ac3f
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:b259e940552cf788b6b09b4a8029367c44c23e91cccfadd9538d9bfaaa737ca4
openai-whisper-small-cuda-quantized-int4-weight-only
271 MB
sha256:3c37142c15d749f98b5d002d177c50856d306bf94483a0632b2056595710c44c