Skip to content

Commit 57b5dbd

Browse files
Klaud-Coldgithub-actions[bot]claude-fix-botfunctionstackx
authored
Update dsr1-fp8-mi325x-sglang SGLang image to v0.5.12-rocm700-mi30x (#1428)
Ref #1154 Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com> Co-authored-by: claude-fix-bot <claude-fix-bot@local> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>
1 parent 30add15 commit 57b5dbd

3 files changed

Lines changed: 10 additions & 3 deletions

File tree

.github/configs/amd-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -85,7 +85,7 @@ dsr1-fp8-mi300x-sglang:
8585
- { tp: 8, conc-start: 4, conc-end: 64 }
8686

8787
dsr1-fp8-mi325x-sglang:
88-
image: lmsysorg/sglang:v0.5.9-rocm700-mi30x
88+
image: lmsysorg/sglang:v0.5.12-rocm700-mi30x
8989
model: deepseek-ai/DeepSeek-R1-0528
9090
model-prefix: dsr1
9191
runner: mi325x

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2653,3 +2653,10 @@
26532653
description:
26542654
- "Update SGLang image from v0.5.9-cu129-amd64 (74d old) to v0.5.12-cu130"
26552655
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1458
2656+
2657+
- config-keys:
2658+
- dsr1-fp8-mi325x-sglang
2659+
description:
2660+
- "Update SGLang image from v0.5.9-rocm700-mi30x to v0.5.12-rocm700-mi30x"
2661+
- "Workaround LlamaTokenizer.all_special_tokens_extended removal in newer transformers: prefer backend_request_func.get_tokenizer over vLLM's"
2662+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1428

utils/bench_serving/benchmark_serving.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -46,9 +46,9 @@
4646
from transformers import PreTrainedTokenizerBase
4747

4848
try:
49-
from vllm.transformers_utils.tokenizer import get_tokenizer
50-
except ImportError:
5149
from backend_request_func import get_tokenizer
50+
except ImportError:
51+
from vllm.transformers_utils.tokenizer import get_tokenizer
5252

5353
try:
5454
from vllm.utils import FlexibleArgumentParser

0 commit comments

Comments
 (0)