Skip to content

Commit c8c6b3c

Browse files
committed
Switch B200 DSv4 TRT image to 2dd03e6
1 parent 700e5ec commit c8c6b3c

2 files changed

Lines changed: 3 additions & 3 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1801,7 +1801,7 @@ dsv4-fp4-b200-vllm-agentic:
18011801
- { tp: 8, ep: 8, dp-attn: true, offloading: cpu, conc-list: [64, 128, 256] }
18021802

18031803
dsv4-fp4-b200-trt:
1804-
image: nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc15.post1
1804+
image: ghcr.io#semianalysisai/trtllm-deepseek-v4:feat-deepseek_v4-2dd03e6
18051805
model: deepseek-ai/DeepSeek-V4-Pro
18061806
model-prefix: dsv4
18071807
runner: b200-dsv4
@@ -1822,7 +1822,7 @@ dsv4-fp4-b200-trt:
18221822
- { tp: 8, ep: 8, dp-attn: true, conc-start: 32, conc-end: 256 }
18231823

18241824
dsv4-fp4-b200-trt-mtp:
1825-
image: nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc15.post1
1825+
image: ghcr.io#semianalysisai/trtllm-deepseek-v4:feat-deepseek_v4-2dd03e6
18261826
model: deepseek-ai/DeepSeek-V4-Pro
18271827
model-prefix: dsv4
18281828
runner: b200-dsv4

perf-changelog.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3463,5 +3463,5 @@
34633463
- dsv4-fp4-b200-trt
34643464
- dsv4-fp4-b200-trt-mtp
34653465
description:
3466-
- "Use official TRT-LLM release image (nvcr.io/nvidia/tensorrt-llm/release:1.3.0rc15.post1) for B200 DeepSeek-V4-Pro TRT configs, replacing the custom ghcr.io feat/deepseek_v4 build (9aa3715)."
3466+
- "Update B200 DeepSeek-V4-Pro TRT image to ghcr.io/semianalysisai/trtllm-deepseek-v4:feat-deepseek_v4-2dd03e6 (non-MTP and MTP), replacing the older 9aa3715 build."
34673467
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1664

0 commit comments

Comments
 (0)