Skip to content

Commit c541848

Browse files
committed
better readme + 3.14
1 parent 23bd80e commit c541848

2 files changed

Lines changed: 9 additions & 2 deletions

File tree

.github/workflows/vllm.yml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -32,8 +32,7 @@ env:
3232
VLLM_MODEL: "Qwen/Qwen3-0.6B"
3333
# we only test on Ubuntu to keep vLLM server running simple
3434
TEST_MATRIX_OS: '["ubuntu-latest"]'
35-
# numba not compatible with Python 3.14
36-
TEST_MATRIX_PYTHON: '["3.10", "3.13"]'
35+
TEST_MATRIX_PYTHON: '["3.10", "3.14"]'
3736

3837
jobs:
3938
compute-test-matrix:

integrations/vllm/README.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,3 +10,11 @@
1010
## Contributing
1111

1212
Refer to the general [Contribution Guidelines](https://github.com/deepset-ai/haystack-core-integrations/blob/main/CONTRIBUTING.md).
13+
14+
To run integration tests locally, you need to have a running vLLM server. Refer to the [workflow file](https://github.com/deepset-ai/haystack-core-integrations/blob/main/.github/workflows/vllm.yml) for more details.
15+
16+
For example, on macOs, you can install [vLLM-metal](https://github.com/vllm-project/vllm-metal) and run the server with:
17+
18+
```bash
19+
source ~/.venv-vllm-metal/bin/activate && vllm serve Qwen/Qwen3-0.6B --reasoning-parser qwen3 --max-model-len 1024 --enforce-eager --dtype bfloat16 --enable-auto-tool-choice --tool-call-parser hermes
20+
```

0 commit comments

Comments
 (0)