Skip to content

xinference2.7.0使用vllm运行qwen-3.5模型报错 #4869

@shuo-oss

Description

@shuo-oss

System Info / 系統信息

cuda版本13.0,vllm版本0.17.1

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

v2.7.0

The command used to start Xinference / 用以启动 xinference 的命令

docker run -e XINFERENCE_MODEL_SRC=modelscope -p 9998:9997 --gpus all xprobe/xinference:v<your_version> xinference-local -H 0.0.0.0 --log-level debug

Reproduction / 复现过程

Image Image Image

Expected behavior / 期待表现

成功运行模型

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions