Skip to content

Commit 506258a

Browse files
Klaud-Coldgithub-actions[bot]functionstackx
authored
Update dsr1-fp8-b300-sglang and dsr1-fp8-b300-sglang-mtp SGLang image to v0.5.12-cu130 (#1419)
Ref #1154 Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>
1 parent 9fe6bfd commit 506258a

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1936,7 +1936,7 @@ dsr1-fp8-b200-sglang:
19361936
# does not have a B300-specific recipe, so this config reuses the existing DSR1 FP8
19371937
# B200 SGLang recipe as-is until B300-specific tuning is available.
19381938
dsr1-fp8-b300-sglang:
1939-
image: lmsysorg/sglang:v0.5.11-cu130
1939+
image: lmsysorg/sglang:v0.5.12-cu130
19401940
model: deepseek-ai/DeepSeek-R1-0528
19411941
model-prefix: dsr1
19421942
runner: b300
@@ -2568,7 +2568,7 @@ dsr1-fp8-b200-sglang-mtp:
25682568
# B200 SGLang MTP recipe as-is until B300-specific tuning is available. Image bumped
25692569
# to v0.5.10.post1-cu130 to match the standard B300 SGLang image used by other B300 configs.
25702570
dsr1-fp8-b300-sglang-mtp:
2571-
image: lmsysorg/sglang:v0.5.10.post1-cu130
2571+
image: lmsysorg/sglang:v0.5.12-cu130
25722572
model: deepseek-ai/DeepSeek-R1-0528
25732573
model-prefix: dsr1
25742574
runner: b300

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2560,3 +2560,10 @@
25602560
description:
25612561
- "Update vLLM image from v0.20.2 to v0.21.0"
25622562
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1401
2563+
2564+
- config-keys:
2565+
- dsr1-fp8-b300-sglang
2566+
- dsr1-fp8-b300-sglang-mtp
2567+
description:
2568+
- "Update SGLang image from v0.5.11-cu130 to v0.5.12-cu130"
2569+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1419

0 commit comments

Comments
 (0)