Skip to content

Commit 8c5128a

Browse files
[Klaud Cold] Update dsr1-fp8-h200-trt (+mtp) TRT-LLM image to v1.3.0rc14 (#1487)
* Update dsr1-fp8-h200-trt (+mtp) TRT-LLM image to v1.3.0rc14 Update TensorRT-LLM image from v1.1.0rc2.post2 (154d/124d old) to v1.3.0rc14 (latest pre-release) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: fill pr-link for #1487 --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent e43ae2f commit 8c5128a

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/configs/nvidia-master.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3160,7 +3160,7 @@ glm5-fp8-h200-sglang-mtp:
31603160
- { tp: 8, conc-start: 4, conc-end: 64, spec-decoding: mtp }
31613161

31623162
dsr1-fp8-h200-trt:
3163-
image: nvcr.io#nvidia/tensorrt-llm/release:1.1.0rc2.post2
3163+
image: nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14
31643164
model: deepseek-ai/DeepSeek-R1-0528
31653165
model-prefix: dsr1
31663166
runner: h200
@@ -3183,7 +3183,7 @@ dsr1-fp8-h200-trt:
31833183
- { tp: 8, ep: 8, dp-attn: true, conc-start: 64, conc-end: 64 }
31843184

31853185
dsr1-fp8-h200-trt-mtp:
3186-
image: nvcr.io#nvidia/tensorrt-llm/release:1.1.0rc2.post2
3186+
image: nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14
31873187
model: deepseek-ai/DeepSeek-R1-0528
31883188
model-prefix: dsr1
31893189
runner: h200

perf-changelog.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2762,3 +2762,10 @@
27622762
description:
27632763
- "Update TensorRT-LLM image (off: v1.2.0rc6.post2 109d / mtp: v1.2.0rc6.post3 102d) to v1.3.0rc14 (latest pre-release)"
27642764
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1488
2765+
2766+
- config-keys:
2767+
- dsr1-fp8-h200-trt
2768+
- dsr1-fp8-h200-trt-mtp
2769+
description:
2770+
- "Update TensorRT-LLM image from v1.1.0rc2.post2 (154d/124d old) to v1.3.0rc14 (latest pre-release)"
2771+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1487

0 commit comments

Comments
 (0)