Skip to content

perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap#4605

Open
windreamer wants to merge 6 commits into
InternLM:mainfrom
windreamer:feat/guided-decoding-optimization
Open

perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap#4605
windreamer wants to merge 6 commits into
InternLM:mainfrom
windreamer:feat/guided-decoding-optimization