Skip to content

Commit 86ba33f

Browse files
Merge remote-tracking branch 'inferencex/nv/jasonli/minimaxm2.5-fp4-gb300-only' into stack-pr1648-on-1641-20260603
# Conflicts: # .github/configs/nvidia-master.yaml # perf-changelog.yaml # runners/launch_gb200-nv.sh
2 parents f601cfe + 66b7e04 commit 86ba33f

80 files changed

Lines changed: 6258 additions & 34 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/configs/amd-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1197,7 +1197,7 @@ dsr1-fp8-mi355x-atom:
11971197
- { tp: 8, conc-start: 4, conc-end: 128 }
11981198

11991199
dsr1-fp8-mi355x-atom-mtp:
1200-
image: rocm/atom:rocm7.2.3_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom20260511
1200+
image: rocm/atom:rocm7.2.4_ubuntu24.04_py3.12_pytorch_release_2.10.0_atom0.1.3
12011201
model: deepseek-ai/DeepSeek-R1-0528
12021202
model-prefix: dsr1
12031203
runner: mi355x
@@ -1209,7 +1209,7 @@ dsr1-fp8-mi355x-atom-mtp:
12091209
- isl: 1024
12101210
osl: 1024
12111211
search-space:
1212-
- { tp: 8, conc-start: 4, conc-end: 256, spec-decoding: mtp }
1212+
- { tp: 8, conc-start: 4, conc-end: 512, spec-decoding: mtp }
12131213
- isl: 8192
12141214
osl: 1024
12151215
search-space:

0 commit comments

Comments
 (0)