We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent d570a11 commit 44aec22Copy full SHA for 44aec22
1 file changed
perf-changelog.yaml
@@ -80,4 +80,9 @@
80
description: |
81
- Update vLLM image for NVIDIA configs from vLLM 0.11.0 to vLLM 0.11.2
82
- Adds kv-cache-dtype: fp8 to benchmarks/gptoss_fp4_b200_docker.sh
83
- PR: https://github.com/InferenceMAX/InferenceMAX/pull/273
+ PR: https://github.com/InferenceMAX/InferenceMAX/pull/273
84
+- config-keys:
85
+ - dsr1-fp4-mi355x-sglang
86
+ description: |
87
+ - Updating MI355x Deepseek-R1 FP4 SGLang Image to upstream v0.5.6.post1
88
+ PR: https://github.com/InferenceMAX/InferenceMAX/pull/330
0 commit comments