Skip to content

Commit d131e22

Browse files
Update qwen3.5-fp8-b300-sglang (+mtp) SGLang image to v0.5.12-cu130
1 parent c07bf5d commit d131e22

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2396,7 +2396,7 @@ qwen3.5-fp8-b200-sglang-mtp:
23962396

23972397

23982398
qwen3.5-fp8-b300-sglang-mtp:
2399-
image: lmsysorg/sglang:v0.5.11-cu130
2399+
image: lmsysorg/sglang:v0.5.12-cu130
24002400
model: Qwen/Qwen3.5-397B-A17B-FP8
24012401
model-prefix: qwen3.5
24022402
runner: b300
@@ -2415,7 +2415,7 @@ qwen3.5-fp8-b300-sglang-mtp:
24152415
- { tp: 4, ep: 1, conc-start: 4, conc-end: 256, spec-decoding: mtp }
24162416

24172417
qwen3.5-fp8-b300-sglang:
2418-
image: lmsysorg/sglang:v0.5.10.post1-cu130
2418+
image: lmsysorg/sglang:v0.5.12-cu130
24192419
model: Qwen/Qwen3.5-397B-A17B-FP8
24202420
model-prefix: qwen3.5
24212421
runner: b300

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2629,3 +2629,10 @@
26292629
description:
26302630
- "Update vLLM ROCm image from v0.18.0 to v0.21.0"
26312631
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1404
2632+
2633+
- config-keys:
2634+
- qwen3.5-fp8-b300-sglang
2635+
- qwen3.5-fp8-b300-sglang-mtp
2636+
description:
2637+
- "Update SGLang image from v0.5.10.post1-cu130 / v0.5.11-cu130 (30d old) to v0.5.12-cu130"
2638+
pr-link: PLACEHOLDER

0 commit comments

Comments
 (0)