Skip to content

Commit 3542221

Browse files
[Klaud Cold] Update qwen3.5-fp8-h200-sglang SGLang image to v0.5.12-cu130 (#1458)
* Update qwen3.5-fp8-h200-sglang SGLang image to v0.5.12-cu130 * chore: fill pr-link for #1458
1 parent bdccf00 commit 3542221

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3083,7 +3083,7 @@ dsv4-fp4-b300-vllm-mtp:
30833083
- { tp: 4, ep: 4, dp-attn: true, conc-start: 256, conc-end: 512, spec-decoding: mtp }
30843084

30853085
qwen3.5-fp8-h200-sglang:
3086-
image: lmsysorg/sglang:v0.5.9-cu129-amd64
3086+
image: lmsysorg/sglang:v0.5.12-cu130
30873087
model: Qwen/Qwen3.5-397B-A17B-FP8
30883088
model-prefix: qwen3.5
30893089
runner: h200

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2647,3 +2647,9 @@
26472647
description:
26482648
- "Update SGLang image from custom glm5-hopper tag (59d old) to v0.5.12-cu130"
26492649
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1459
2650+
2651+
- config-keys:
2652+
- qwen3.5-fp8-h200-sglang
2653+
description:
2654+
- "Update SGLang image from v0.5.9-cu129-amd64 (74d old) to v0.5.12-cu130"
2655+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1458

0 commit comments

Comments
 (0)