Skip to content

fix: build_attn V-padded reshape uses Q-head count, not KV-head count (#78)#116

Merged
TheTom merged 1 commit into
feature/turboquant-kv-cachefrom
fix/issue-78-gqa-reshape
May 1, 2026
Merged

fix: build_attn V-padded reshape uses Q-head count, not KV-head count (#78)#116
TheTom merged 1 commit into
feature/turboquant-kv-cachefrom
fix/issue-78-gqa-reshape

fix(llama-graph): n_head_v reshape uses Q-head count, not KV-head cou…

b6f8e7f
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
labeler
succeeded May 1, 2026 in 12s