[Models] Update SWA RoPE theta for MLA/GQA attention#8077
Merged
Jiang-Jia-Jun merged 3 commits intoJun 26, 2026
background
wait
wait-all
cancel
parallel
Loading