feat(vllm-tensorizer): Bump vLLM to v0.20.2 on CUDA 13.2 / Ubuntu 24.04#160
Open
JustinPerlman wants to merge 12 commits into
Open
feat(vllm-tensorizer): Bump vLLM to v0.20.2 on CUDA 13.2 / Ubuntu 24.04#160JustinPerlman wants to merge 12 commits into
JustinPerlman wants to merge 12 commits into
Conversation
…-containers into jperlman/vllm0.20.2
|
@JustinPerlman Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/25751418629 |
Contributor
Author
Fair enough lol |
Contributor
|
Pure 13.2, no matrix with 12.9? 🫣I would really like having both options…if it’s a giant pain on vllm side it’s fine, but I think you then need to validate this actually works on b40/rtxp6000 with latest supported/installed drivers cw ships |
Contributor
|
I am still not aware of a cuda + driver combo that has decent support and works as expected, but haven’t followed too closely lately |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Ubuntu 24.04 compatibility fixes
python3-pipfrom apt inbuilder-baseand addrm -f /usr/lib/python3.*/EXTERNALLY-MANAGEDbefore pip bootstrap — on Ubuntu 24.04, apt-installed pip has no RECORD file and blocks pip self-upgradepython3-jwtin the finalbasestage before pip installs — same root cause: Debian-managed PyJWT has no RECORD file and blocks vLLM's dependency resolutioncuda-pythonversion spec from~=${CUDA_VERSION}to~=${CUDA_VERSION%.*}— patch-level CUDA versions (e.g.13.2.1) don't match available cuda-python releases; strip to major.minorwheelpackage inlmcache-builderand restore it tobuilder-basepip installRelevant information: vllm-project/vllm@6c964bd