Test CUDA Builds

[executorch][nvidia][tensorrt][13/n] Add examples, C++ runner and CI workflow #10604

Sign in to view logs

Triggered via pull request March 5, 2026 17:59

shoumikhin

opened #17924

gh/shoumikhin/38/head

Status Failure

Total duration 1h 31m 28s

Artifacts 18

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Matrix: test-model-cuda-e2e

check-all-cuda-builds

Annotations

1 error

test-models-cuda (mv3) / linux-job

Process completed with exit code 1.

Artifacts

Produced during runtime

Name	Size	Digest
Qwen-Qwen3-0.6B-cuda-non-quantized	1.1 GB	`sha256:2ff2c1350eb78b3f45e02146488ad588a40f0909f552dc0993b14c228f0b3d73`
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed	559 MB	`sha256:2126772bec44e38183de00223e4aa25bc8d8320c705427dfc9ec73fbc5e7754b`
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only	1.1 GB	`sha256:41af4025cda621aaefb74ee0bca18477696ebde5d2a62540dead827b883a179f`
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:b02b80a830621c524502768b29fb6284c439f52080df9a2bbda559f85e7c818b`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:b31d87bd678f01be108c611ab9ba0ae1b0b3e2b74a436d3aab8eddc920eb269a`
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized	6.82 GB	`sha256:d67e92e02669453a8998556cc5d34e50e7ad9361c373842a5fbeebe9630821d1`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:1b521b0f5254fa4096fb82d252c2ec3a0b265045e2e240e53964dafc261df3ae`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:3e914d7124b95589a2064bcdad59fe3243095e8323f3a00edeaee0055928d9dd`
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed	15.5 GB	`sha256:0619b6282464cfabf91f95f36188810da85a6f4a4b9d16d6bcd83be98d625c8a`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:a2554447d9f64f09c97c44162dfddebc894b678cebba9143483d926c4f36fe11`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:dfe7cf0034593173d8933e3f6abf562e9852f9fa0b961bcdc652822d2c04134f`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:c4d2ad7635b29781d30f9d87f81858cb019cbeeb5736cac3cd894f9246373649`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:e63c5856221f5a308f6f59a73039530d82793c052e86aa52136669f68bac275a`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed	491 MB	`sha256:f5817f7819b7f2d389889af29447be92a03957ce307761cb0506c090d03e0ef6`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:8c3b46c3837a45d4b88d174b165c36ada775efb28c56d9eac700915dc473db9d`
openai-whisper-small-cuda-non-quantized	362 MB	`sha256:2899de1918afdbb2efc4935dcee2ae5134ca32f48f7e772634374b109921ac3f`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:b259e940552cf788b6b09b4a8029367c44c23e91cccfadd9538d9bfaaa737ca4`
openai-whisper-small-cuda-quantized-int4-weight-only	271 MB	`sha256:3c37142c15d749f98b5d002d177c50856d306bf94483a0632b2056595710c44c`