Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[AMD][MI355X] update model for gpt-oss full-sweep-enabled
#1670 opened Jun 5, 2026 by chunfangamd Collaborator Loading…
Update DSv4 B200 TRT image to 2dd03e6 (non-MTP + MTP) full-sweep-enabled
#1664 opened Jun 4, 2026 by Oseltamivir Collaborator Loading…
3 tasks
[WIP] Initial work to add llm-d-vllm framework with H200
#1660 opened Jun 4, 2026 by ezrasilvera Collaborator Loading…
Throwaway: conc-64 gsm8k eval for DEP8+MTP3 dispatch token bug non-canary-full-sweep-enabled Run the full sweep without the canary gate (full search space, no trim)
#1659 opened Jun 3, 2026 by Oseltamivir Collaborator Loading…
[WIP] Update Dsv4 B300 configs full-sweep-enabled
#1656 opened Jun 3, 2026 by wzhao18 Collaborator Loading…
Update B200 Dsv4 configs full-sweep-enabled
#1655 opened Jun 3, 2026 by wzhao18 Collaborator Loading…
[DNM][AMD] agentx-v0.4
#1654 opened Jun 3, 2026 by seungrokj Collaborator Loading…
[NV] Add GitHub Action to collect SPEED-Bench AL matrix
#1650 opened Jun 2, 2026 by qiching Loading…
3 tasks done
fix(power): classify zero-decode-GPU multinode runs as aggregated
#1646 opened Jun 2, 2026 by arygupt Collaborator Loading…
[WIP] agentX v0.4
#1640 opened Jun 2, 2026 by cquil11 Collaborator Draft
feat(power): vendor-agnostic GPU power/telemetry aggregation core
#1635 opened Jun 1, 2026 by arygupt Collaborator Loading…
2 of 3 tasks
Update new fixed-AR-MTP CI workflow for kimik2.5_int4, kimik2.5_fp4, …
#1633 opened Jun 1, 2026 by haic0 Collaborator Loading…
ProTip! Follow long discussions with comments:>50.