-
Notifications
You must be signed in to change notification settings - Fork 179
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: init ssm_cache by config ssm_cache_type and unify compute precision to fp32.
#1259
opened Apr 10, 2026 by
JC-ut0
Contributor
Loading…
bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258
opened Apr 10, 2026 by
DongheJin
Collaborator
Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256
opened Apr 10, 2026 by
yiming-l21
Collaborator
Loading…
feat: support Qwen down_proj fallback for compressed-tensors ignored modules.
#1254
opened Apr 10, 2026 by
yingxudeng
Collaborator
Loading…
bugfix: remove spurious backslash breaking output redirection in launch scripts.
#1248
opened Apr 10, 2026 by
kuishou68
Loading…
feat: add mlu mooncake pd push support.
#1246
opened Apr 10, 2026 by
phantomlei3
Collaborator
Loading…
refactor: simplify xllm server startup routing and lifecycle helpers.
#1243
opened Apr 9, 2026 by
liutongxuan
Collaborator
Loading…
feat: support in-batch prefix cache.
#1240
opened Apr 9, 2026 by
Clement-Wang26
Collaborator
Loading…
bugfix: optimize multi-modal preprocess accuracy.
#1235
opened Apr 9, 2026 by
wly-115
Collaborator
Loading…
feat: add configurable decode ACL-graph fallback threshold.
#1233
opened Apr 8, 2026 by
DongheJin
Collaborator
Loading…
feat: support tensor parallel for Flux model on npu device.
#1231
opened Apr 8, 2026 by
z-jun03
Collaborator
Loading…
feat: expose startup runtime flags through c and python apis.
#1229
opened Apr 8, 2026 by
RobbieLeung
Collaborator
•
Draft
feat: enable rec fast sampler for llm beam search.
#1224
opened Apr 8, 2026 by
RobbieLeung
Collaborator
Loading…
feat: improve cuda shared memory tensor handling.
#1222
opened Apr 8, 2026 by
RobbieLeung
Collaborator
Loading…
bugfix: support per-fork disagg pd port for fork master.
#1220
opened Apr 8, 2026 by
Clement-Wang26
Collaborator
Loading…
bugfix: fix failures when EP/DP and ACL Graph are enabled simultaneously.
#1218
opened Apr 8, 2026 by
DongheJin
Collaborator
Loading…
feat: support Qwen3.5-VL model on npu device[6/N].
#1212
opened Apr 7, 2026 by
yingxudeng
Collaborator
Loading…
feat: add more model manual loader.
#1210
opened Apr 7, 2026 by
Clement-Wang26
Collaborator
Loading…
feat: add beam top for beam search last step for rec.
#1209
opened Apr 7, 2026 by
ChrisGao001
Collaborator
Loading…
bugfix: fix the problem of incorrect kv cache data format when enabli…
#1203
opened Apr 7, 2026 by
longhui-z
Contributor
Loading…
bugfix: update Glm5-W8A8 & Draft Model on npu device.
#1193
opened Apr 6, 2026 by
sanlio36
Collaborator
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.