Skip to content

Commit f8ba0a8

Browse files
Update qwen3.5-fp4-b300-sglang (+mtp) SGLang image to v0.5.12-cu130
Update SGLang image from v0.5.11-cu130 (5d old) to v0.5.12-cu130 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 891a72c commit f8ba0a8

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2434,7 +2434,7 @@ qwen3.5-fp8-b300-sglang:
24342434
- { tp: 4, ep: 1, conc-start: 4, conc-end: 256 }
24352435

24362436
qwen3.5-fp4-b300-sglang:
2437-
image: lmsysorg/sglang:v0.5.11-cu130
2437+
image: lmsysorg/sglang:v0.5.12-cu130
24382438
model: nvidia/Qwen3.5-397B-A17B-NVFP4
24392439
model-prefix: qwen3.5
24402440
runner: b300
@@ -2455,7 +2455,7 @@ qwen3.5-fp4-b300-sglang:
24552455
- { tp: 2, ep: 2, conc-start: 4, conc-end: 128 }
24562456

24572457
qwen3.5-fp4-b300-sglang-mtp:
2458-
image: lmsysorg/sglang:v0.5.11-cu130
2458+
image: lmsysorg/sglang:v0.5.12-cu130
24592459
model: nvidia/Qwen3.5-397B-A17B-NVFP4
24602460
model-prefix: qwen3.5
24612461
runner: b300

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2653,3 +2653,10 @@
26532653
description:
26542654
- "Update SGLang image from v0.5.9-cu129-amd64 (74d old) to v0.5.12-cu130"
26552655
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1458
2656+
2657+
- config-keys:
2658+
- qwen3.5-fp4-b300-sglang
2659+
- qwen3.5-fp4-b300-sglang-mtp
2660+
description:
2661+
- "Update SGLang image from v0.5.11-cu130 (5d old) to v0.5.12-cu130"
2662+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

0 commit comments

Comments
 (0)