We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 33760bb commit f6a9d0bCopy full SHA for f6a9d0b
1 file changed
perf-changelog.yaml
@@ -80,4 +80,4 @@
80
description: |
81
- Update vLLM image for NVIDIA configs from vLLM 0.11.0 to vLLM 0.11.2
82
- Adds kv-cache-dtype: fp8 to benchmarks/gptoss_fp4_b200_docker.sh
83
- PR: https://github.com/InferenceMAX/InferenceMAX/pull/273
+ PR: https://github.com/InferenceMAX/InferenceMAX/pull/273
0 commit comments