[Cria][Lllama runner] Use caching temp allocator #2657
metal.yml
on: pull_request
Matrix: export-model-metal-artifact
test-executorch-metal-build
/
macos-job
5m 9s
Matrix: test-model-metal-e2e
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-metal-non-quantized
Expired
|
6.82 GB |
sha256:69d95020b3092d8bf19fe2f2ddd359ac223450ca3858647197c2c5ab56e56b4a
|
|
|
openai-whisper-large-v3-turbo-metal-non-quantized
Expired
|
1.18 GB |
sha256:4b2d31a331301ab70ecd2623619376343184fe520c39ab17af750dc1ce854d4c
|
|
|
openai-whisper-small-metal-non-quantized
Expired
|
361 MB |
sha256:695b51b658083c6a09d28c384c5ceffe1448cbc90033feca4dbe1feca13bc94e
|
|