Skip to content

Commit 2315338

Browse files
Update qwen3.5-fp8-b300-sglang (+mtp) SGLang image to v0.5.12-cu130
1 parent 80c944e commit 2315338

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2397,7 +2397,7 @@ qwen3.5-fp8-b200-sglang-mtp:
23972397

23982398

23992399
qwen3.5-fp8-b300-sglang-mtp:
2400-
image: lmsysorg/sglang:v0.5.11-cu130
2400+
image: lmsysorg/sglang:v0.5.12-cu130
24012401
model: Qwen/Qwen3.5-397B-A17B-FP8
24022402
model-prefix: qwen3.5
24032403
runner: b300
@@ -2416,7 +2416,7 @@ qwen3.5-fp8-b300-sglang-mtp:
24162416
- { tp: 4, ep: 1, conc-start: 4, conc-end: 256, spec-decoding: mtp }
24172417

24182418
qwen3.5-fp8-b300-sglang:
2419-
image: lmsysorg/sglang:v0.5.10.post1-cu130
2419+
image: lmsysorg/sglang:v0.5.12-cu130
24202420
model: Qwen/Qwen3.5-397B-A17B-FP8
24212421
model-prefix: qwen3.5
24222422
runner: b300

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3022,3 +3022,10 @@
30223022
description:
30233023
- "Update SGLang image from nightly-dev-cu13-20260518-c67b2870 to nightly-dev-cu13-20260519-dbac4647"
30243024
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1492
3025+
3026+
- config-keys:
3027+
- qwen3.5-fp8-b300-sglang
3028+
- qwen3.5-fp8-b300-sglang-mtp
3029+
description:
3030+
- "Update SGLang image from v0.5.10.post1-cu130 / v0.5.11-cu130 (30d old) to v0.5.12-cu130"
3031+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1451

0 commit comments

Comments
 (0)