Skip to content

[executorch][nvidia][tensorrt][4/n] Add operator support base class #10603

[executorch][nvidia][tensorrt][4/n] Add operator support base class

[executorch][nvidia][tensorrt][4/n] Add operator support base class #10603

Triggered via pull request March 5, 2026 17:59
Status Success
Total duration 1h 32m 35s
Artifacts 18

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
27m 36s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
3s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:545bc5cc34cb8de812e44ae7f2f18a5605f18f3890a239fe72bf47e6e76f8be1
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
559 MB
sha256:c77b3d2a57c9f700ec0d75b42435b496a4b35e0b4e2c2c67731555a7a040552e
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
1.1 GB
sha256:70dfc8f7c2913c323c77e4213d5984f25ea550f3e011edf2d58e21a21cfecfef
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:351228d482057b3e488c66b45427fe48ab99360b1962bf9bc007b2d4eab66119
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:0dc4b60cf72a687343f4ffc5225a0e5cd5ea67d5c1c9f5cda9a96e7c261c09df
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:8e1d0e3585aa5757b74c527613d1f4ea571f4b8d80c5f81e3db23aae6e73a852
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:6427bd1171aabc8ce125810accbc89dc387d4bf6fcba9cb0c9bd85133dcd3696
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:714c4d9248bed35651e8d77551c2cbd4b6d0c5e3925dc775bfda7b50f27a696d
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
15.5 GB
sha256:1892f21c7941a29ab24ca17d3c984720597756bbbe7721c9b82b925b01c58081
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:13e80bb2a8c93b264147b2301164dfefc734d860baef0e66efef07d33aa44760
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:0ff38d4ecb9d7ae75ed1a4214ed3899cb1bf3d180a4f7cfa88aebedc8619ce8c
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:ff772d1588a9fa6181aa9dfbbfe656b8deeac279f864ff604717ef7582a80a65
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:1470bb2ccece98ee0ab38f24d56fde66ea3d5b45a5be8b05290f37a822cdde6c
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:23a0811e56ce14355eff1ce9b2ce163792026a718117a57f16cef04cdbb625b7
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:7aea30069286daac270f8260a3ae43660993c48d6af377c609adb8b563146487
openai-whisper-small-cuda-non-quantized
361 MB
sha256:f1d062b5c420258c82e7b62f172d6a0ae234e45f13b51d3c1608ee30cc502af1
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:3869956aafc0d440ae1f84421e2dad1e847b8d1c26a57d953b698cdc27745184
openai-whisper-small-cuda-quantized-int4-weight-only
271 MB
sha256:418abc1ab5fefe0226fd0f91ca8daa0ee7fa87c04c9184ce06ae9c7589ea4baf