Skip to content

Commit 1492b0e

Browse files
[Klaud Cold] Update qwen3.5-fp8-b200-sglang (+mtp) SGLang image to v0.5.12-cu130 (#1473)
* Update qwen3.5-fp8-b200-sglang (+mtp) SGLang image to v0.5.12-cu130 Update SGLang image from nightly-dev-20260422-de962f32 (18d/12d old) to v0.5.12-cu130 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: fill pr-link for #1473 --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 35558d2 commit 1492b0e

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2107,7 +2107,7 @@ qwen3.5-bf16-b200-sglang-mtp:
21072107
# - { tp: 8, ep: 1, offloading: none, conc-list: [1, 2, 4, 8, 16, 32] }
21082108

21092109
qwen3.5-fp8-b200-sglang:
2110-
image: lmsysorg/sglang:nightly-dev-20260422-de962f32
2110+
image: lmsysorg/sglang:v0.5.12-cu130
21112111
model: Qwen/Qwen3.5-397B-A17B-FP8
21122112
model-prefix: qwen3.5
21132113
runner: b200
@@ -2375,7 +2375,7 @@ glm5-fp4-b300-sglang-mtp:
23752375
- { tp: 4, ep: 1, conc-start: 4, conc-end: 256, spec-decoding: mtp }
23762376

23772377
qwen3.5-fp8-b200-sglang-mtp:
2378-
image: lmsysorg/sglang:nightly-dev-20260422-de962f32
2378+
image: lmsysorg/sglang:v0.5.12-cu130
23792379
model: Qwen/Qwen3.5-397B-A17B-FP8
23802380
model-prefix: qwen3.5
23812381
runner: b200

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2783,3 +2783,10 @@
27832783
description:
27842784
- "Update SGLang image from nightly-dev-20260422-de962f32 (17d/13d old) to v0.5.12-cu130"
27852785
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1474
2786+
2787+
- config-keys:
2788+
- qwen3.5-fp8-b200-sglang
2789+
- qwen3.5-fp8-b200-sglang-mtp
2790+
description:
2791+
- "Update SGLang image from nightly-dev-20260422-de962f32 (18d/12d old) to v0.5.12-cu130"
2792+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1473

0 commit comments

Comments
 (0)