Skip to content

Split-KV decode, refactor prefill instantiation, and add flash_attn CI benchmarking#145

Merged
airMeng merged 37 commits into
mainfrom
split_kv_decode
Apr 8, 2026
Merged

Split-KV decode, refactor prefill instantiation, and add flash_attn CI benchmarking#145
airMeng merged 37 commits into
mainfrom
split_kv_decode

Commits

Commits on Mar 18, 2026

Commits on Mar 23, 2026

Commits on Mar 24, 2026

Commits on Mar 31, 2026

Commits on Apr 2, 2026

Commits on Apr 3, 2026

Commits on Apr 7, 2026

Commits on Apr 8, 2026