Commit 068b686
Fix MHA CrossAttention regression: use num_heads_ for non-GQA paths
The internal present KV buffer shape must use num_heads_ for MHA
(where kv_num_heads_ is 0) and kv_num_heads_ only for GQA. Using
kv_num_heads_ unconditionally caused zero-sized buffers for MHA
CrossAttention tests.
Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>1 parent 08bbc41 commit 068b686
1 file changed
Lines changed: 3 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
425 | 425 | | |
426 | 426 | | |
427 | 427 | | |
| 428 | + | |
428 | 429 | | |
429 | | - | |
| 430 | + | |
430 | 431 | | |
431 | 432 | | |
432 | 433 | | |
433 | 434 | | |
434 | 435 | | |
435 | | - | |
| 436 | + | |
436 | 437 | | |
437 | 438 | | |
438 | 439 | | |
| |||
0 commit comments