File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -4655,7 +4655,7 @@ dsr1-fp8-h100-dynamo-sglang:
46554655 dp-attn : true
46564656
46574657gptoss-fp4-h200-trt :
4658- image : nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc11
4658+ image : nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14
46594659 model : openai/gpt-oss-120b
46604660 model-prefix : gptoss
46614661 runner : h200
Original file line number Diff line number Diff line change 28152815 - " Update vLLM image from v0.19.0-cu130 (26d old) to v0.21.0"
28162816 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1453
28172817
2818+ - config-keys :
2819+ - gptoss-fp4-h200-trt
2820+ description :
2821+ - " Update TensorRT-LLM image from v1.3.0rc11 (34d old) to v1.3.0rc14 (latest pre-release)"
2822+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1491
2823+
28182824- config-keys :
28192825 - qwen3.5-fp4-b300-sglang
28202826 - qwen3.5-fp4-b300-sglang-mtp
You can’t perform that action at this time.
0 commit comments