Support separate K/V seq dim in custom_sdpa op by kimishpatel · Pull Request #18714 · pytorch/executorch

kimishpatel · 2026-04-06T15:47:59Z

Stack from ghstack (oldest at bottom):

Previously custom_sdpa used a single is_seq_at_dim_2 flag for all
tensors. This meant v_only transpose required a runtime transpose
copy for K (converting from [B,H,S,D] to [B,S,H,D]), which caused
a 2.3x decode slowdown (15.35 vs 35.63 tok/s).

Now the C++ op accepts separate is_seq_dim_2, is_k_seq_dim_2,
is_v_seq_dim_2 flags so Q, K, V can each have independent layouts.
The Python layer passes K and V in their native cache layout
without any transpose, and the flash attention kernel handles the
mixed strides directly.

Changes:

op_sdpa_impl.h: cpu_flash_attention takes q_seq_dim, k_seq_dim,
v_seq_dim instead of single seq_dim
op_sdpa.cpp/h: custom_sdpa_out takes 3 bool params
op_sdpa_aot.cpp: Updated schema strings and wrappers
sdpa.py: SDPACustom uses is_k_seq_at_dim_2 / is_v_seq_at_dim_2,
Q always at dim 2, no input transposes
custom_kv_cache.py: update() returns native cache layout,
added is_seq_at_dim_2 compat property
export_llama_lib.py: passes separate K/V flags

Differential Revision: D99677678

Previously custom_sdpa used a single is_seq_at_dim_2 flag for all tensors. This meant v_only transpose required a runtime transpose copy for K (converting from [B,H,S,D] to [B,S,H,D]), which caused a 2.3x decode slowdown (15.35 vs 35.63 tok/s). Now the C++ op accepts separate is_seq_dim_2, is_k_seq_dim_2, is_v_seq_dim_2 flags so Q, K, V can each have independent layouts. The Python layer passes K and V in their native cache layout without any transpose, and the flash attention kernel handles the mixed strides directly. Changes: - op_sdpa_impl.h: cpu_flash_attention takes q_seq_dim, k_seq_dim, v_seq_dim instead of single seq_dim - op_sdpa.cpp/h: custom_sdpa_out takes 3 bool params - op_sdpa_aot.cpp: Updated schema strings and wrappers - sdpa.py: SDPACustom uses is_k_seq_at_dim_2 / is_v_seq_at_dim_2, Q always at dim 2, no input transposes - custom_kv_cache.py: update() returns native cache layout, added is_seq_at_dim_2 compat property - export_llama_lib.py: passes separate K/V flags Differential Revision: [D99677678](https://our.internmc.facebook.com/intern/diff/D99677678/) [ghstack-poisoned]

pytorch-bot · 2026-04-06T15:48:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18714

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 126 New Failures, 2 Cancelled Jobs

As of commit 45468e1 with merge base fb1618e ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (ios-simulator) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (ios) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (llm) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (macos) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (profiling) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / apple (pybind) / build (gh)
/Users/runner/work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (linux, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (linux, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
Build Presets / linux (llm, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (llm, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
Build Presets / linux (pybind, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Build Presets / linux (pybind, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
Build Presets / windows (pybind) / build (gh)
Process completed with exit code 1.
periodic / test-models-linux (buck2, mv3, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (buck2, mv3, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, mv3, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, mv3, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, vit, portable, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
periodic / test-models-linux (cmake, vit, xnnpack-quantization-delegation, linux.2xlarge, 90) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / android / build-android (gh)
/home/runner/work/executorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-arm-cortex-m-size-test (bare_metal) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-arm-cortex-m-size-test (zephyr-preset) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-build-wasm-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-coreml-bc-macos (macos-m1-stable) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-coreml-bc-macos (macos-m2-stable) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-custom-ops-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-eval_llama-wikitext-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama_runner_eager-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (bf16, custom, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.arm64.2xlarge, executorch-ubuntu-22.04-gc... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu-22.04-... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-llama-runner-linux-android / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-lora-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-lora-multimethod-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-mcu-cortex-m-backend / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-mediatek-models-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add_mul, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add_mul, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (add, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_join, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_join, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_transcribe, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (emformer_transcribe, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic3, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic3, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic4, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (ic4, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (linear, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (linear, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (llama3_2_vision_encoder, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mobilebert, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mobilebert, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mv2, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (mv2, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (phi_4_mini, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet18, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet18, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet50, portable, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (resnet50, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux (w2l, portable, linux.4xlarge.memory) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-models-linux-basic (vit, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-moshi-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-openvino-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-parakeet-xnnpack-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-phi-3-mini-runner-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.10) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.11) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.12) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-qnn-wheel-packages-linux (3.13) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-quantized-aot-lib-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-samsung-models-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-samsung-quantmodels-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-selective-build-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-setup-linux-gcc / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: ‘seq_dim’ was not declared in this scope; did you mean ‘v_seq_dim’?
pull / test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-voxtral-realtime-xnnpack-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-vulkan-models-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / test-vulkan-operators-linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest / linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest / windows / windows-job (gh)
Process completed with exit code 1.
pull / unittest-arm-backend-with-no-deps (test_pytest_models_tosa) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_no_target) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_tosa) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-arm-backend-with-no-deps (test_run_tosa) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-buck / linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-buck / macos / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / linux / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / macos / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-editable / windows / windows-job (gh)
Process completed with exit code 1.
pull / unittest-nxp-neutron / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-wasm-bindings (--enable-etdump) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
pull / unittest-wasm-bindings / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_tosa_fp, models) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_tosa_fp, operators) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_vgf_fp, models) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test ARM Backend / test-arm / test-backend-linux (arm_vgf_fp, operators) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test CoreML Backend / test-coreml / test-backend-macos (coreml, models) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test CoreML Backend / test-coreml / test-backend-macos (coreml, operators) / macos-job (gh)
/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test QNN Backend / test-qnn / test-backend-linux (qnn, models) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test QNN Backend / test-qnn / test-backend-linux (qnn, operators) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test Vulkan Backend / test-vulkan / test-backend-linux (vulkan, models) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test Vulkan Backend / test-vulkan / test-backend-linux (vulkan, operators) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test XNNPACK Backend / test-xnnpack / test-backend-linux (xnnpack, models) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'
Test XNNPACK Backend / test-xnnpack / test-backend-linux (xnnpack, operators) / linux-job (gh)
/pytorch/executorch/extension/llm/custom_ops/op_sdpa.cpp:425:15: error: use of undeclared identifier 'seq_dim'

CANCELLED JOBS - The following jobs were cancelled. Please retry:

Check Labels / Check labels (gh)
pull / unittest / macos / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-04-06T15:48:39Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

kimishpatel · 2026-04-06T16:06:53Z

submitted by accident, not meant to land immedidately

github-actions · 2026-06-06T01:22:58Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

kimishpatel requested review from larryliu0820, lucylq and mergennachin as code owners April 6, 2026 15:48

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 6, 2026

meta-codesync Bot added fb-exported meta-exported labels Apr 6, 2026

github-actions Bot added the Stale PRs inactive for over 60 days label Jun 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support separate K/V seq dim in custom_sdpa op#18714

Support separate K/V seq dim in custom_sdpa op#18714
kimishpatel wants to merge 1 commit into
gh/kimishpatel/234/basefrom
gh/kimishpatel/234/head

kimishpatel commented Apr 6, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 6, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 6, 2026

Uh oh!

kimishpatel commented Apr 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kimishpatel commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18714

❌ 126 New Failures, 2 Cancelled Jobs

Uh oh!

github-actions Bot commented Apr 6, 2026

This PR needs a release notes: label

Uh oh!

kimishpatel commented Apr 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kimishpatel commented Apr 6, 2026 •

edited

Loading

pytorch-bot Bot commented Apr 6, 2026 •

edited

Loading

This PR needs a `release notes:` label