Skip to content

[Cria][Lllama runner] Use caching temp allocator #4002

[Cria][Lllama runner] Use caching temp allocator

[Cria][Lllama runner] Use caching temp allocator #4002

Triggered via pull request December 4, 2025 16:58
Status Success
Total duration 1h 26m 23s
Artifacts 11

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
Matrix: test-models-cuda
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized Expired
7.22 GB
sha256:07af7a49c543db85537659548bc0c45e409200293fc8809db5aae6bf8cbc3547
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed Expired
4.03 GB
sha256:7ded12417034efbf60a6fe2ed3c5459614c95a1c6d59b5eeee7174bdb949ac96
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized Expired
6.82 GB
sha256:fdb5341bce2980cd57dadb0e37098959b70196780281fa419d44a4fceae53347
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed Expired
2.89 GB
sha256:e4612a0e6c26d0427b0188fca3ff48abbad3ca3df6e7925c901f0b1e0f35c228
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only Expired
6.14 GB
sha256:f1ea3f9c654fd6667f67b59ecc306b920d9cb29a74029559efb4504f0c4639b7
openai-whisper-large-v3-turbo-cuda-non-quantized Expired
1.17 GB
sha256:314d37cfec7086cf9c1c6fab7effe0746225732fa4eb89d93f39518d0bc87efc
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed Expired
490 MB
sha256:f90de75b021b9ade4b4eb04016af58b7485b951383d2d0590cda679cd4fb912b
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only Expired
484 MB
sha256:e72dc684e42b3055426b4d528d85844b07c48e9a88f10c21a061eee8ac429bdb
openai-whisper-small-cuda-non-quantized Expired
361 MB
sha256:bea396674afa263b758bdf506b11e97f1911e933ece4aee4115944c334f25739
openai-whisper-small-cuda-quantized-int4-tile-packed Expired
172 MB
sha256:7942bbf4ea7e9852b17b58f9be3bbe50be4360883c803a7d36c18138a7b2176a
openai-whisper-small-cuda-quantized-int4-weight-only Expired
270 MB
sha256:ded941c49fa38a9ca43e61c962cdc0a3dafaf53ec961681c98e8453771ec5973