-
Notifications
You must be signed in to change notification settings - Fork 52
Pull requests: ROCm/ATOM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix](gpt-oss): fix quark quantized model in moe bias
#787
opened May 14, 2026 by
PerryZhang01
Contributor
Loading…
Add DSR1-MXFP4 recipe for MI355X (Team Jons contest submission, 2840/3000)
#786
opened May 14, 2026 by
j0ons
Loading…
ci: branch-aware docker release + fix benchmark model selection
#785
opened May 14, 2026 by
ZhangLirong-amd
Contributor
Loading…
1 task
ci(benchmark): upgrade Kimi K2.5 to K2.6
#781
opened May 14, 2026 by
carlushuang
Contributor
Loading…
1 of 2 tasks
[codex] DeepSeek FP4 MTP decode safeguards and MLA hooks
#779
opened May 13, 2026 by
josusanmartin
•
Draft
feat(server): add Anthropic Messages API endpoint (/v1/messages)
#778
opened May 13, 2026 by
carlushuang
Contributor
Loading…
4 of 5 tasks
(ci)[SGLang-ATOM]: Add Qwen3.5 cases for ci, nightly and benchmark
#777
opened May 13, 2026 by
zhuyuhua-v
Collaborator
Loading…
Qwen3Next MTP for vLLM plugin mode
#772
opened May 13, 2026 by
ganyi1996ppo
Contributor
Loading…
1 task
Add mooncake dockerfile build
#771
opened May 13, 2026 by
ZhangLirong-amd
Contributor
Loading…
1 task
[Perf][vLLM-ATOM] Optimize Sparse MLA in vLLM-ATOM
#765
opened May 12, 2026 by
kliuae
Contributor
Loading…
1 task
[MoE] adapt to triton_kernels matmul_ogs -> matmul rename
#763
opened May 12, 2026 by
Liang-jianhao97
Loading…
1 task done
[feat][Attention Refactor] Reconstruct the Attention arch
#750
opened May 11, 2026 by
zejunchen-zejun
Collaborator
•
Draft
Add Mistral-3-8B + Qwen3-8B-FP8 + native triton attention backend for gfx1201 (RDNA4 / RX 9070 XT)
#749
opened May 11, 2026 by
carlushuang
Contributor
Loading…
[feat][breaking] Enable prefix caching by default
#741
opened May 11, 2026 by
functionstackx
Contributor
Loading…
3 of 6 tasks
perf: optimize GDN decode with SGLang fused recurrent kernel
#727
opened May 9, 2026 by
zovonoir
Contributor
Loading…
1 of 2 tasks
[Feat] Support GLM-4.7 MTP in vLLM-ATOM plugin
#722
opened May 8, 2026 by
kliuae
Contributor
Loading…
1 task
docs: deploy compressor page with docs workflow
#715
opened May 7, 2026 by
gyohuangxin
Member
Loading…
perf: fused Triton kernels for Qwen3.5 RMSNorm and MRoPE
#708
opened May 7, 2026 by
zovonoir
Contributor
Loading…
1 of 2 tasks
[ci] add Qwen3.5 Dense/MoE models accuracy validation and benchmark tests for atom-plugined sglang
#700
opened May 6, 2026 by
wanzhenchn
Contributor
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.