Skip to content

Commit f6a1048

Browse files
Update qwen3.5-fp4-b300-sglang (+mtp) SGLang image to v0.5.12-cu130
Update SGLang image from v0.5.11-cu130 (5d old) to v0.5.12-cu130 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 80c944e commit f6a1048

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2435,7 +2435,7 @@ qwen3.5-fp8-b300-sglang:
24352435
- { tp: 4, ep: 1, conc-start: 4, conc-end: 256 }
24362436

24372437
qwen3.5-fp4-b300-sglang:
2438-
image: lmsysorg/sglang:v0.5.11-cu130
2438+
image: lmsysorg/sglang:v0.5.12-cu130
24392439
model: nvidia/Qwen3.5-397B-A17B-NVFP4
24402440
model-prefix: qwen3.5
24412441
runner: b300
@@ -2456,7 +2456,7 @@ qwen3.5-fp4-b300-sglang:
24562456
- { tp: 2, ep: 2, conc-start: 4, conc-end: 128 }
24572457

24582458
qwen3.5-fp4-b300-sglang-mtp:
2459-
image: lmsysorg/sglang:v0.5.11-cu130
2459+
image: lmsysorg/sglang:v0.5.12-cu130
24602460
model: nvidia/Qwen3.5-397B-A17B-NVFP4
24612461
model-prefix: qwen3.5
24622462
runner: b300

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3022,3 +3022,10 @@
30223022
description:
30233023
- "Update SGLang image from nightly-dev-cu13-20260518-c67b2870 to nightly-dev-cu13-20260519-dbac4647"
30243024
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1492
3025+
3026+
- config-keys:
3027+
- qwen3.5-fp4-b300-sglang
3028+
- qwen3.5-fp4-b300-sglang-mtp
3029+
description:
3030+
- "Update SGLang image from v0.5.11-cu130 (5d old) to v0.5.12-cu130"
3031+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1475

0 commit comments

Comments
 (0)