Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258 opened Apr 10, 2026 by DongheJin Collaborator Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256 opened Apr 10, 2026 by yiming-l21 Collaborator Loading…
feat: add mlu mooncake pd push support.
#1246 opened Apr 10, 2026 by phantomlei3 Collaborator Loading…
perf: Qwen Image Optimize.
#1242 opened Apr 9, 2026 by shan-chen-feng Collaborator Loading…
feat: support in-batch prefix cache.
#1240 opened Apr 9, 2026 by Clement-Wang26 Collaborator Loading…
bugfix: optimize multi-modal preprocess accuracy.
#1235 opened Apr 9, 2026 by wly-115 Collaborator Loading…
feat: add configurable decode ACL-graph fallback threshold.
#1233 opened Apr 8, 2026 by DongheJin Collaborator Loading…
feat: support tensor parallel for Flux model on npu device.
#1231 opened Apr 8, 2026 by z-jun03 Collaborator Loading…
perf: Qwen image optimize.
#1230 opened Apr 8, 2026 by shan-chen-feng Collaborator Loading…
feat: enable rec fast sampler for llm beam search.
#1224 opened Apr 8, 2026 by RobbieLeung Collaborator Loading…
feat: improve cuda shared memory tensor handling.
#1222 opened Apr 8, 2026 by RobbieLeung Collaborator Loading…
bugfix: support per-fork disagg pd port for fork master.
#1220 opened Apr 8, 2026 by Clement-Wang26 Collaborator Loading…
feat: support Qwen3.5-VL model on npu device[6/N].
#1212 opened Apr 7, 2026 by yingxudeng Collaborator Loading…
feat: add more model manual loader.
#1210 opened Apr 7, 2026 by Clement-Wang26 Collaborator Loading…
feat: add beam top for beam search last step for rec.
#1209 opened Apr 7, 2026 by ChrisGao001 Collaborator Loading…
bugfix: update Glm5-W8A8 & Draft Model on npu device.
#1193 opened Apr 6, 2026 by sanlio36 Collaborator Loading…
ProTip! Filter pull requests by the default branch with base:main.