Skip to content

Pull requests: GeeeekExplorer/nano-vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: use hf download in README
#194 opened Apr 2, 2026 by sablecode Loading…
fix: correct postprocess return type annotation
#192 opened Mar 30, 2026 by Desirer Loading…
Fix CUDA graph block_tables shape mismatch
#191 opened Mar 24, 2026 by ilrewrite Loading…
Feature/support llama3
#188 opened Mar 21, 2026 by wudong5 Loading…
fix: update download command for model weights in README
#185 opened Mar 12, 2026 by SYaoJun Loading…
docs: add Chinese README and language links
#183 opened Mar 8, 2026 by LJS1124 Loading…
add a Dockerfile for nano-vllm
#178 opened Mar 3, 2026 by pacoxu Loading…
[Doc]Add Repository Architecture Overview Document
#177 opened Feb 26, 2026 by CalvinXKY Loading…
Update embed_head.py
#174 opened Feb 21, 2026 by TianduoWang Loading…
enable 'slots=True' for dataclasses
#172 opened Feb 9, 2026 by IceCreamMilkyTea Loading…
fix: modify input when input is fp32
#171 opened Feb 8, 2026 by philhuan Loading…
fix(rms_norm): add copy for residual
#169 opened Jan 28, 2026 by tpoisonooo Loading…
test
#160 opened Jan 15, 2026 by volcano98 Loading…
remove hard code for block_size
#148 opened Dec 29, 2025 by guodongxiaren Loading…
bug for tensor parallelism # issue 144
#145 opened Dec 17, 2025 by LiaoMengqi Loading…
With detailed Chinese comments for easy learning
#138 opened Nov 29, 2025 by lioZ129 Loading…
[ADD] Add TTFT, TPOT metrics in tqdm bars.
#133 opened Nov 16, 2025 by mumupika Loading…
Add Qwen3-VL multimodal support
#132 opened Nov 11, 2025 by 86MaxCao Loading…
ProTip! Filter pull requests by the default branch with base:main.