Reduce allocation overhead in quantized sdpa #2683
metal.yml
on: pull_request
Matrix: export-model-metal-artifact
test-executorch-metal-build
/
macos-job
5m 9s
Matrix: test-model-metal-e2e
Waiting for pending jobs
Annotations
1 error
|
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-metal-non-quantized
Expired
|
6.82 GB |
sha256:0db05c146ace5dbc453cd2d270d549d1dd0484d7fa939da6366814c1214f2853
|
|
|
openai-whisper-large-v3-turbo-metal-non-quantized
Expired
|
1.18 GB |
sha256:c515dfa851dde7c5f564120b03743e952557f783779de65c77ebbfd31726ce8a
|
|
|
openai-whisper-small-metal-non-quantized
Expired
|
361 MB |
sha256:77dd5f10907b6ac3dcb902b3f48567362f3cda9f6bbca935afe4ac7a4be96d36
|
|