Skip to content

Update vLLM version to v0.9.0.1#1921

Merged
chensuyue merged 10 commits into
mainfrom
update_vLLM
Jun 9, 2025
Merged

Update vLLM version to v0.9.0.1#1921
chensuyue merged 10 commits into
mainfrom
update_vLLM

Conversation

@CICD-at-OPEA

@CICD-at-OPEA CICD-at-OPEA commented May 8, 2025

Copy link
Copy Markdown
Collaborator

Update vLLM version to v0.9.0.1

Signed-off-by: CICD-at-OPEA <CICD@opea.dev>
@github-actions

github-actions Bot commented May 8, 2025

Copy link
Copy Markdown

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

@chensuyue

Copy link
Copy Markdown
Collaborator

#1933

@eero-t

eero-t commented May 12, 2025

Copy link
Copy Markdown
Contributor

Regarding CI fail, I don't think vLLM used for embeddings yet, in which case this would be unrelated to vLLM update:

+ echo '[ mega-codegen ] HTTP status is not 200. Received status was 500'
+ docker logs codegen-xeon-backend-server
...
   File "/usr/local/lib/python3.11/site-packages/aiohttp/client_reqrep.py", line 1281, in json
    raise ContentTypeError(
aiohttp.client_exceptions.ContentTypeError: 500, message='Attempt to decode JSON with unexpected mimetype: text/plain; charset=utf-8', url='http://10.244.47.206:6000/v1/embeddings'
+ exit 1

@chensuyue

Copy link
Copy Markdown
Collaborator

Regarding CI fail, I don't think vLLM used for embeddings yet, in which case this would be unrelated to vLLM update:

+ echo '[ mega-codegen ] HTTP status is not 200. Received status was 500'
+ docker logs codegen-xeon-backend-server
...
   File "/usr/local/lib/python3.11/site-packages/aiohttp/client_reqrep.py", line 1281, in json
    raise ContentTypeError(
aiohttp.client_exceptions.ContentTypeError: 500, message='Attempt to decode JSON with unexpected mimetype: text/plain; charset=utf-8', url='http://10.244.47.206:6000/v1/embeddings'
+ exit 1

Yes, I paste the issue before, #1933

yinghu5 and others added 2 commits May 13, 2025 16:00
@yinghu5 yinghu5 requested review from Copilot and yinghu5 May 15, 2025 07:03

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot wasn't able to review any files in this pull request.

Files not reviewed (15)
  • .github/env/_build_image.sh: Language not supported
  • AudioQnA/tests/test_compose_multilang_on_xeon.sh: Language not supported
  • AudioQnA/tests/test_compose_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_faqgen_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_faqgen_tgi_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_mariadb_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_milvus_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_pinecone_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_qdrant_on_xeon.sh: Language not supported
  • ChatQnA/tests/test_compose_without_rerank_on_xeon.sh: Language not supported
  • CodeGen/tests/test_compose_on_xeon.sh: Language not supported
  • CodeTrans/tests/test_compose_on_xeon.sh: Language not supported
  • DocSum/tests/test_compose_on_xeon.sh: Language not supported
  • WorkflowExecAgent/tests/2_start_vllm_service.sh: Language not supported

@yinghu5 yinghu5 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

CICD-at-OPEA and others added 2 commits May 15, 2025 22:41
@chensuyue chensuyue changed the title Update vLLM version to v0.8.5 Update vLLM version to v0.9.0 May 16, 2025
@chensuyue

Copy link
Copy Markdown
Collaborator

Looks like in v0.9.0 release torch version doesn't match intel_extension_for_pytorch version, not sure if there is the main issue.
https://github.com/vllm-project/vllm/blob/dc1440cf9f8f6233a3c464e1a01daa12207f8680/requirements/cpu.txt#L6
https://github.com/vllm-project/vllm/blob/dc1440cf9f8f6233a3c464e1a01daa12207f8680/docker/Dockerfile.cpu#L55

ERROR! Intel® Extension for PyTorch* needs to work with PyTorch 2.6.*, but PyTorch 2.7.0+cpu is found. Please switch to the matching version and run again.

@chensuyue

Copy link
Copy Markdown
Collaborator

We may need to wait for a new release or use v0.8.5.

CICD-at-OPEA and others added 2 commits May 30, 2025 22:41
@chensuyue chensuyue requested a review from yao531441 as a code owner June 5, 2025 02:39
Signed-off-by: CICD-at-OPEA <CICD@opea.dev>
@github-actions github-actions Bot requested a review from ftian1 as a code owner June 5, 2025 22:41
@chensuyue

Copy link
Copy Markdown
Collaborator

Looks like in v0.9.0 release torch version doesn't match intel_extension_for_pytorch version, not sure if there is the main issue. https://github.com/vllm-project/vllm/blob/dc1440cf9f8f6233a3c464e1a01daa12207f8680/requirements/cpu.txt#L6 https://github.com/vllm-project/vllm/blob/dc1440cf9f8f6233a3c464e1a01daa12207f8680/docker/Dockerfile.cpu#L55

ERROR! Intel® Extension for PyTorch* needs to work with PyTorch 2.6.*, but PyTorch 2.7.0+cpu is found. Please switch to the matching version and run again.

The latest release v0.9.0.1 solve this issue, vllm-project/vllm@7ab056c

@chensuyue chensuyue changed the title Update vLLM version to v0.9.0 Update vLLM version to v0.9.0.1 Jun 9, 2025
@chensuyue

Copy link
Copy Markdown
Collaborator

AudioQnA issue trace in #2048

@chensuyue chensuyue merged commit 1b22200 into main Jun 9, 2025
64 of 66 checks passed
@chensuyue chensuyue deleted the update_vLLM branch June 9, 2025 06:47
edlee123 pushed a commit to edlee123/GenAIExamples that referenced this pull request Jun 24, 2025
Signed-off-by: CICD-at-OPEA <CICD@opea.dev>
Signed-off-by: Ed Lee <16417837+edlee123@users.noreply.github.com>
alexsin368 pushed a commit to alexsin368/GenAIExamples that referenced this pull request Aug 13, 2025
Signed-off-by: CICD-at-OPEA <CICD@opea.dev>
Signed-off-by: alexsin368 <alex.sin@intel.com>
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
Signed-off-by: CICD-at-OPEA <CICD@opea.dev>
Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants