Add non-flash SDPA path gated by ET_USE_UNFUSED_SDPA by kimishpatel · Pull Request #18717 · pytorch/executorch

kimishpatel · 2026-04-06T15:48:15Z

Stack from ghstack (oldest at bottom):

Benchmarks show ET's flash attention is 10-12% slower than non-tiled
GEMM-based SDPA for decode (SeqLen=1) due to within-head tiling overhead
(multiple small GEMMs + online softmax rescaling). This adds an alternative
non-flash code path that computes full Q@K^T, standard softmax, then
scores@V using two GEMM calls, gated by #ifdef ET_USE_UNFUSED_SDPA so it
can be tested without disrupting the existing flash path.

The new cpu_sdpa function reuses existing SeqDim, stride extraction,
cpublas::gemm, and parallel_for infrastructure. Float-only (no quantized
input support). Threading granularity is one (batch, head) per work unit.

Differential Revision: D99677685

Benchmarks show ET's flash attention is 10-12% slower than non-tiled GEMM-based SDPA for decode (SeqLen=1) due to within-head tiling overhead (multiple small GEMMs + online softmax rescaling). This adds an alternative non-flash code path that computes full Q@K^T, standard softmax, then scores@V using two GEMM calls, gated by #ifdef ET_USE_UNFUSED_SDPA so it can be tested without disrupting the existing flash path. The new cpu_sdpa function reuses existing SeqDim, stride extraction, cpublas::gemm, and parallel_for infrastructure. Float-only (no quantized input support). Threading granularity is one (batch, head) per work unit. Differential Revision: [D99677685](https://our.internmc.facebook.com/intern/diff/D99677685/) [ghstack-poisoned]

pytorch-bot · 2026-04-06T15:48:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18717

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 126 New Failures, 1 Unrelated Failure

As of commit 867ba3e with merge base fb1618e ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (ios-simulator) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (ios) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (llm) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (macos) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (profiling) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (pybind) / build (gh)
Build Presets / linux (linux, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (linux, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
Build Presets / linux (llm, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (llm, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
Build Presets / linux (pybind, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (pybind, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
Build Presets / windows (pybind) / build (gh)
Lint / lintrunner (gh)
>>> Lint for extension/llm/custom_ops/op_sdpa_impl.h:
periodic / test-models-linux (buck2, mv3, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (buck2, mv3, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, mv3, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, mv3, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, vit, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, vit, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / android / build-android (gh)
/home/runner/work/executorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-arm-cortex-m-size-test (bare_metal) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-arm-cortex-m-size-test (zephyr-preset) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-build-wasm-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-coreml-bc-macos (macos-m1-stable) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-coreml-bc-macos (macos-m2-stable) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-custom-ops-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-eval_llama-wikitext-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama_runner_eager-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (bf16, custom, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.arm64.2xlarge, executorch-ubuntu-22.04-gc... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu-22.04-... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-llama-runner-linux-android / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-lora-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-lora-multimethod-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-mcu-cortex-m-backend / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-mediatek-models-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add_mul, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add_mul, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_join, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_join, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_transcribe, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_transcribe, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic3, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic3, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic4, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic4, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (linear, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (linear, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (llama3_2_vision_encoder, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mobilebert, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mobilebert, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mv2, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mv2, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (phi_4_mini, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet18, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet18, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet50, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet50, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (w2l, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-models-linux-basic (vit, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-moshi-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-openvino-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-parakeet-xnnpack-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-phi-3-mini-runner-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.10) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.11) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.13) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-quantized-aot-lib-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-samsung-models-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-samsung-quantmodels-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-selective-build-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-setup-linux-gcc / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1191:31: error: ‘seq_dim’ was not declared in this scope; did you mean ‘q_seq_dim’?
pull / test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-voxtral-realtime-xnnpack-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-vulkan-models-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / test-vulkan-operators-linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest / linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest / macos / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest / windows / windows-job (gh)
Process completed with exit code 1.
pull / unittest-arm-backend-with-no-deps (test_pytest_models_tosa) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_no_target) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_tosa) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_run_tosa) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-buck / linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-buck / macos / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / linux / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / macos / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / windows / windows-job (gh)
Process completed with exit code 1.
pull / unittest-nxp-neutron / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
pull / unittest-wasm-bindings (--enable-etdump) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_tosa_fp, models) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_tosa_fp, operators) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_vgf_fp, models) / linux-job (gh)
RuntimeError: Command docker exec -t cca887beb841a0491a9e54f14d8d168b8458bdb2236ed85ccce0ac01bd539fd0 /exec failed with exit code 92
Test ARM Backend / test-arm / test-backend-linux (arm_vgf_fp, operators) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test CoreML Backend / test-coreml / test-backend-macos (coreml, models) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test CoreML Backend / test-coreml / test-backend-macos (coreml, operators) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test QNN Backend / test-qnn / test-backend-linux (qnn, models) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test QNN Backend / test-qnn / test-backend-linux (qnn, operators) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test Vulkan Backend / test-vulkan / test-backend-linux (vulkan, models) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test Vulkan Backend / test-vulkan / test-backend-linux (vulkan, operators) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test XNNPACK Backend / test-xnnpack / test-backend-linux (xnnpack, models) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'
Test XNNPACK Backend / test-xnnpack / test-backend-linux (xnnpack, operators) / linux-job (gh)
/pytorch/executorch/../executorch/extension/llm/custom_ops/op_sdpa_impl.h:1202:31: error: use of undeclared identifier 'seq_dim'

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-qnn-wheel-packages-linux (3.12) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-04-06T15:48:50Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

kimishpatel · 2026-04-06T16:07:23Z

submitted by accident, not meant to land immedidately

github-actions · 2026-06-06T01:23:01Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

kimishpatel requested review from larryliu0820 and mergennachin as code owners April 6, 2026 15:48

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 6, 2026

meta-codesync Bot added fb-exported meta-exported labels Apr 6, 2026

github-actions Bot added the Stale PRs inactive for over 60 days label Jun 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add non-flash SDPA path gated by ET_USE_UNFUSED_SDPA#18717

Add non-flash SDPA path gated by ET_USE_UNFUSED_SDPA#18717
kimishpatel wants to merge 1 commit into
gh/kimishpatel/237/basefrom
gh/kimishpatel/237/head

kimishpatel commented Apr 6, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 6, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 6, 2026

Uh oh!

kimishpatel commented Apr 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kimishpatel commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18717

❌ 126 New Failures, 1 Unrelated Failure

Uh oh!

github-actions Bot commented Apr 6, 2026

This PR needs a release notes: label

Uh oh!

kimishpatel commented Apr 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kimishpatel commented Apr 6, 2026 •

edited

Loading

pytorch-bot Bot commented Apr 6, 2026 •

edited

Loading

This PR needs a `release notes:` label