[executorch][nvidia][tensorrt][17/n] Add embedding, expand, upsample converters #10610
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
27m 48s
Matrix: test-models-cuda
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
4s
Annotations
5 errors and 1 warning
|
test-executorch-cuda-build-13.0 / linux-job
Process completed with exit code 1.
|
|
test-models-cuda (mv3) / linux-job
Process completed with exit code 1.
|
|
test-models-cuda (linear) / linux-job
Process completed with exit code 1.
|
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Process completed with exit code 1.
|
|
check-all-cuda-builds
Process completed with exit code 1.
|
|
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
|
1.1 GB |
sha256:78f7139e88bc325c50b31973cc33d2fba22f43fe17de186d09607d76c1fb6e0f
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
|
1.1 GB |
sha256:876ecc8aed3be4ca13f497c9da880361c21fd641fc7dc68ba73b4f733b8828a7
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:ea77ccf367996dd1f6881e9ef3f51fadb5660db04ef20ea7616118a0621db7a6
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:9e259ddc6bdb838d08a96789e10b040630a5110490c1bd2ad1aeef4e8266207a
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
|
6.82 GB |
sha256:a4e9a6da6a073fb3eeb3056a486790255f43123e855b5db1e58e2144e2b59af1
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:68852713897d7d9daf403895d579a88960ebdca56155b11b0f65bd5386dd5f16
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:0074df99d0adcd0fd578762e0e14161ecab85bf77de4167bb469d61002af5f54
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
|
15.5 GB |
sha256:62dea8c113513f8bbef2105ab8ab1456bc06af03fc03da9775662cd60b4c2b68
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:307e3999db8e527126050f5d1f5fd0947a298f58259d8c414720d661f836f5e3
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:d380d3a08228f56f788a6a1b634f0d97f804c6616b51e682c7f4a4665f7d3053
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:3e3900a46400d20726c4fe1328bf2ee895a93efd63ffb9808911b334397d65b8
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:f9b33dc954f212d74d9f6f05bf8fde1fd76fb2293c736e4a20cd06384f36df03
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:56cb55a8dfe1e34b504e177c69c65f5ff3f02a5010fdce3b8b076bc4d73666da
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:fa2684b2323c0990ceefc4109a4790a2877f0f0f7164ab65772ffb7cec1161ce
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:fd67c28c2a0ce22ce3d9d02d40357e0ffa906e56786bdcf9cdcf98bcb18d1573
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:9cddc0caec84a31183ce20f8e32b754d99aef0824b0f8899e669f5dd3ed90808
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
270 MB |
sha256:c383a3ed75ac6619865b7ea93718400c7b5db0f837bed65b086e7612e8296035
|
|