Skip to content

chore: update vllm to 11.0 and make changes from PR 102#159

Merged
functionstackx merged 1 commit into
mainfrom
update-vllm-11
Nov 4, 2025
Merged

chore: update vllm to 11.0 and make changes from PR 102#159
functionstackx merged 1 commit into
mainfrom
update-vllm-11

Conversation

@cquil11
Copy link
Copy Markdown
Collaborator

@cquil11 cquil11 commented Nov 4, 2025

New PR with changes present in https://github.com/InferenceMAX/InferenceMAX/pull/102
Now post-refactor

@cquil11 cquil11 requested a review from a team as a code owner November 4, 2025 04:33
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR updates vLLM from version 0.10.2 to 0.11.0 and incorporates configuration changes that were previously made in PR 102. The changes include adding compilation configuration for CUDA graph mode and fixing command-line argument syntax.

  • Updates vLLM Docker image versions from v0.10.2 to v0.11.0
  • Adds CUDA graph compilation configuration with "PIECEWISE" mode
  • Fixes command-line argument format for the config parameter

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File Description
.github/configs/nvidia-master.yaml Updates vLLM Docker image versions to v0.11.0
benchmarks/gptoss_fp4_h100_slurm.sh Adds compilation config and fixes --config argument format
benchmarks/gptoss_fp4_h100_docker.sh Adds compilation config and fixes --config argument format
benchmarks/gptoss_fp4_h200_slurm.sh Adds compilation config for CUDA graph mode

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@functionstackx
Copy link
Copy Markdown
Collaborator

@cquil11 can u screenshot the test command and send links to the h100, h200, b200 validation

excited for p00 of https://github.com/InferenceMAX/InferenceMAX/issues/120

@functionstackx
Copy link
Copy Markdown
Collaborator

@cquil11 overall lgtm from reading the code but need validation links

@cquil11
Copy link
Copy Markdown
Collaborator Author

cquil11 commented Nov 4, 2025

Copy link
Copy Markdown
Collaborator

@functionstackx functionstackx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lgtm

@functionstackx functionstackx merged commit 7103d66 into main Nov 4, 2025
17 checks passed
@functionstackx functionstackx deleted the update-vllm-11 branch November 4, 2025 15:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants