Skip to content

[Executorch] Use temp allocator for allocating scratch memory #2525

[Executorch] Use temp allocator for allocating scratch memory

[Executorch] Use temp allocator for allocating scratch memory #2525

Triggered via pull request November 6, 2025 20:51
Status Success
Total duration 1h 14m 15s
Artifacts 8

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
Matrix: test-models-cuda
Matrix: benchmark-model-cuda
Matrix: test-model-cuda-e2e
check-all-cuda-builds
2s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:55be257556453347216e2896131945db7b81d4274fc39d9a8e25133b4cdf174f
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
4.03 GB
sha256:b3908d392e64c6ee954301f4e9c9d7671de95246347200803601ec43fc555a69
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:405634366032c5b7558dbc2c801af4cbe4c9ec7af47a19037ea7cf56f39fd74f
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.89 GB
sha256:52ceb41385b846e9254d74f88d73e80fa24567f29e7431f343203b45d1bfc245
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:5c2edd23cd22eca80b5c37e4050bab956f2817d1f23529c1820fb19ae16211e0
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:25daad9e9cf7069b9ba12707da5ad6a5775b7b8904967fff39f701f2d579f50d
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:cf760d0af395df7924f8f578fd3ed35905199c43def358cfe9b0209365ed5657
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:d19c68edecc89ea3ec3b86f4a7ba8e02744d904d73ecb74f2d3f226aa3424c66