make changes to perf changelog

cquil11 · cquil11 · commit ca8f30faa707 · 2025-12-16T23:58:26.000Z
diff --git a/perf-changelog.yaml b/perf-changelog.yaml
@@ -81,3 +81,12 @@
     - Update vLLM image for NVIDIA configs from vLLM 0.11.0 to vLLM 0.11.2
     - Adds kv-cache-dtype: fp8 to benchmarks/gptoss_fp4_b200_docker.sh
     PR: https://github.com/InferenceMAX/InferenceMAX/pull/273
+- config-keys:
+    - gptoss-fp4-b200-vllm
+    - gptoss-fp4-h100-vllm
+    - gptoss-fp4-h200-vllm
+  description: |
+    - Update vLLM image for NVIDIA configs from vLLM 0.11.2 to vLLM 0.12.0
+    - Adds VLLM_MXFP4_USE_MARLIN=1 to benchmarks/gptoss_fp4_h100_docker.sh and benchmarks/gptoss_fp4_h200_slurm.sh
+    - Adds VLLM_USE_FLASHINFER_MOE_MXFP4_MXFP8=1 to benchmarks/gptoss_fp4_h100_slurm.sh
+    PR: https://github.com/InferenceMAX/InferenceMAX/pull/327