[executorch][nvidia][tensorrt][20/n] Add bmm converter #10613
Triggered via pull request
March 5, 2026 18:03
Status
Failure
Total duration
1h 17m 38s
Artifacts
18
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
27m 26s
Matrix: test-models-cuda
Annotations
42 errors
|
test-executorch-cuda-build-13.0 / linux-job
Process completed with exit code 1.
|
|
test-executorch-cuda-build-12.6 / linux-job
Process completed with exit code 1.
|
|
test-executorch-cuda-build-12.8 / linux-job
Process completed with exit code 1.
|
|
check-all-cuda-builds
Process completed with exit code 1.
|
|
test-cuda-pybind (qwen3-0.6b, Qwen-Qwen3-0.6B-cuda-non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-cuda-pybind (qwen3-0.6b, Qwen-Qwen3-0.6B-cuda-non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID 9B8E:2B34B6:269800A:A48D960:69A9D651 and timestamp 2026-03-05 19:15:29 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-cuda-pybind (gemma3-4b, google-gemma-3-4b-it-cuda-non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-cuda-pybind (gemma3-4b, google-gemma-3-4b-it-cuda-non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID CE7A:30AE15:28BE887:ADE0BD8:69A9D68C and timestamp 2026-03-05 19:16:28 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-small, quantized-int4-weight-only) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-small, quantized-int4-weight-only) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID E252:6AC24:1EF89D:84AF6F:69A9D694 and timestamp 2026-03-05 19:16:36 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID DB8E:115F6B:29F6236:B3FC380:69A9D696 and timestamp 2026-03-05 19:16:38 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID AD46:310F7B:1EEBEF:844C68:69A9D6A5 and timestamp 2026-03-05 19:16:53 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-small, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-small, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID 8DEA:3457DC:389C60:F14AA6:69A9D6B0 and timestamp 2026-03-05 19:17:04 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-cuda-pybind (qwen3-0.6b, --quantize, Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-cuda-pybind (qwen3-0.6b, --quantize, Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID A188:95D1A:36B228:E7EE78:69A9D6B6 and timestamp 2026-03-05 19:17:10 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID BBDC:175F1B:3618D5:E54099:69A9D6BC and timestamp 2026-03-05 19:17:16 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID B60A:9D305:34D062:E180E7:69A9D6C4 and timestamp 2026-03-05 19:17:24 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID D0AA:3C83F4:248727:9AAF4F:69A9D6D2 and timestamp 2026-03-05 19:17:38 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID C44E:16BC1A:242CA0:990320:69A9D6E7 and timestamp 2026-03-05 19:17:59 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID 90B0:95D1A:38109E:EDB3A9:69A9D6EE and timestamp 2026-03-05 19:18:06 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID 9A8A:2F6BAE:24F4D6:9CAF8D:69A9D6FD and timestamp 2026-03-05 19:18:21 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID D348:3C83F4:264039:A20791:69A9D712 and timestamp 2026-03-05 19:18:42 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-cuda-pybind (gemma3-4b, --quantize, google-gemma-3-4b-it-cuda-quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-cuda-pybind (gemma3-4b, --quantize, google-gemma-3-4b-it-cuda-quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID 8F76:1FBA0:24F7C5:9C4E65:69A9D719 and timestamp 2026-03-05 19:18:49 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID B82A:2E8184:28CC90F:AD8E595:69A9D71C and timestamp 2026-03-05 19:18:52 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID BB2C:332F6:2B8A6E7:BBA9E1C:69A9D722 and timestamp 2026-03-05 19:18:58 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID BAE0:2A974E:385520:EEED9B:69A9D732 and timestamp 2026-03-05 19:19:14 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-weight-only) / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/ec2-user/actions-runner/_work/executorch/executorch/pytorch/executorch'. No such file or directory
|
|
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-weight-only) / linux-job
API rate limit exceeded for installation. If you reach out to GitHub Support for help, please include the request ID CF9C:265503:27E440F:AA76445:69A9D769 and timestamp 2026-03-05 19:20:09 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service) - https://docs.github.com/rest/overview/rate-limits-for-the-rest-api
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
Qwen-Qwen3-0.6B-cuda-non-quantized
|
1.1 GB |
sha256:0a5c93773a2c1b7d44c3e530bdf302921cc007153eb061599eb3cb29ec228a25
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
|
559 MB |
sha256:b1c587c12228668736c1bc8ff4221f480ac46c54a608fde3828a577ff3e0c9b4
|
|
|
Qwen-Qwen3-0.6B-cuda-quantized-int4-weight-only
|
1.1 GB |
sha256:83621f284dce3f9646ee85b0b7958e18c4c7fc36ea9197a235b60e1252ce6d8d
|
|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:420245dacbc10396d05fea5b1659a08c986fc0a9f4c220c9ffd220e6bc818bfd
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:b926ea5f44046dc96e3fd8c1e129a8f2f3b0273d59794874bc65fcec529a40d0
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
|
6.82 GB |
sha256:fe537dd2bced15e72445839e0a92ace8af49f4bb81248d3e19d8f6c844120ba5
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:951726923e20cb632520248b63793d6c7650d18056d42e62b45163d1e677d097
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:eb84592d91ceb24c845835c54ed245e72cc619c200d2691142b85abc3afeae87
|
|
|
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
|
15.5 GB |
sha256:65f8bf62e8130959fdba28678ec03a6a18837a84d5eec8bf11c3853a474c4bcc
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:98336640f0863c4e338a136dcf8b377f3676f1510bf89b6cd454f8ee9dfc7158
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:4178cd21c74c8d48f7d4ce3b568b209bf40fecea338f45e75648e82cb0f0a372
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:c9a9ee7871ea25b48d7fdba976c62b50c5605bed229f6b7bb761c286589a0de3
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:c628e41478939074f93cfa084a9868c3af24a188e9237d380f056b5991aaf4ba
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:39dfacb6bda0cb029dce0ad6450ad41b563f7b5c3db1ea8876b98de646e96596
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:fb5be156348a61adc49ff2cc8bcf98ff1dd09c72983943bfa56e19b19bb3d66d
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:2c748ce0dc0708ee07e29dcebb8d1259e1c05524ff09027b93785d9f65dc6451
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:5062b00e83ef3861d6cffaffde83306e6f163c9ffbdda951f283039c6fcc5d61
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
271 MB |
sha256:9d80fba32c29dd5bf6494a726446e970aaf0899e7beb02e160cba0d7017c0f1d
|
|