Skip to content

Reduce allocation overhead in quantized sdpa #2683

Reduce allocation overhead in quantized sdpa

Reduce allocation overhead in quantized sdpa #2683

Triggered via pull request December 4, 2025 21:04
Status Failure
Total duration 12m 33s
Artifacts 3

metal.yml

on: pull_request
Matrix: export-model-metal-artifact
test-executorch-metal-build  /  macos-job
5m 9s
test-executorch-metal-build / macos-job
Matrix: test-model-metal-e2e
Waiting for pending jobs
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
mistralai-Voxtral-Mini-3B-2507-metal-non-quantized Expired
6.82 GB
sha256:0db05c146ace5dbc453cd2d270d549d1dd0484d7fa939da6366814c1214f2853
openai-whisper-large-v3-turbo-metal-non-quantized Expired
1.18 GB
sha256:c515dfa851dde7c5f564120b03743e952557f783779de65c77ebbfd31726ce8a
openai-whisper-small-metal-non-quantized Expired
361 MB
sha256:77dd5f10907b6ac3dcb902b3f48567362f3cda9f6bbca935afe4ac7a4be96d36