Skip to content

Commit d27433d

Browse files
Copilotfunctionstackxcquil11Ankur-singh
authored
Update NVIDIA DeepSeek sglang Docker image from v0.5.5 to v0.5.6 (#276)
* Initial plan * Update NVIDIA DeepSeek sglang from v0.5.5 to v0.5.6 Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com> * make changes to perf changelog * fix perf-change file fix perf-changelog * fix delete line error * remove resource cleanup * update perf-change logs --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com> Co-authored-by: Cameron Quilici <cjquilici@gmail.com> Co-authored-by: Ankur Singh <ankusingh@nvidia.com>
1 parent dcfeeb7 commit d27433d

2 files changed

Lines changed: 11 additions & 3 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
dsr1-fp4-b200-sglang:
2-
image: lmsysorg/sglang:v0.5.5-cu129-amd64
2+
image: lmsysorg/sglang:v0.5.6-cu129-amd64
33
model: nvidia/DeepSeek-R1-0528-FP4-V2
44
model-prefix: dsr1
55
runner: b200
@@ -75,7 +75,7 @@ dsr1-fp4-b200-trt:
7575
- { tp: 8, ep: 8, dp-attn: true, conc-start: 64, conc-end: 256 }
7676

7777
dsr1-fp8-b200-sglang:
78-
image: lmsysorg/sglang:v0.5.5-cu129-amd64
78+
image: lmsysorg/sglang:v0.5.6-cu129-amd64
7979
model: deepseek-ai/DeepSeek-R1-0528
8080
model-prefix: dsr1
8181
runner: b200
@@ -124,7 +124,7 @@ dsr1-fp8-b200-trt:
124124
- { tp: 8, ep: 8, conc-start: 4, conc-end: 64 }
125125

126126
dsr1-fp8-h200-sglang:
127-
image: lmsysorg/sglang:v0.5.5-cu129-amd64
127+
image: lmsysorg/sglang:v0.5.6-cu129-amd64
128128
model: deepseek-ai/DeepSeek-R1-0528
129129
model-prefix: dsr1
130130
runner: h200

perf-changelog.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -116,3 +116,11 @@
116116
description:
117117
- "Updating MI355x Deepseek-R1 FP4 SGLang Image to upstream v0.5.6.post2"
118118
pr-link: https://github.com/InferenceMAX/InferenceMAX/pull/369
119+
120+
- config-keys:
121+
- dsr1-fp4-b200-sglang
122+
- dsr1-fp8-b200-sglang
123+
- dsr1-fp8-h200-sglang
124+
description:
125+
- "Update NVIDIA DeepSeek sglang Docker image from v0.5.5 to v0.5.6"
126+
pr-link: https://github.com/InferenceMAX/InferenceMAX/pull/276

0 commit comments

Comments
 (0)