Skip to content

Commit 9a1bc8b

Browse files
include vllm qwen runtimes in kustomize
1 parent 29839e7 commit 9a1bc8b

1 file changed

Lines changed: 15 additions & 16 deletions

File tree

config/runtimes/kustomization.yaml

Lines changed: 15 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -32,21 +32,6 @@ resources:
3232
- srt/mistral-7b-instruct-rt.yaml
3333
- srt/mixtral-8x7b-instruct-pd-rt.yaml
3434
- srt/mixtral-8x7b-instruct-rt.yaml
35-
- srt/Qwen/qwen-text-tp1-rt.yaml
36-
- srt/Qwen/qwen-text-tp2-rt.yaml
37-
- srt/Qwen/qwen-text-tp4-rt.yaml
38-
- srt/Qwen/qwen-text-tp8-rt.yaml
39-
- srt/Qwen/qwen-vl-tp1-rt.yaml
40-
- srt/Qwen/qwen-vl-tp2-rt.yaml
41-
- srt/Qwen/qwen-vl-tp4-rt.yaml
42-
- srt/Qwen/qwen-vl-tp8-rt.yaml
43-
- srt/Qwen/qwen-text-tp1-fp8-rt.yaml
44-
- srt/Qwen/qwen-text-tp2-fp8-rt.yaml
45-
- srt/Qwen/qwen-text-tp4-fp8-rt.yaml
46-
- srt/Qwen/qwen-vl-tp1-fp8-rt.yaml
47-
- srt/Qwen/qwen-vl-tp2-fp8-rt.yaml
48-
- srt/Qwen/qwen-vl-tp4-fp8-rt.yaml
49-
- srt/Qwen/qwen-vl-tp8-fp8-rt.yaml
5035
# vLLM runtimes
5136
- vllm/e5-mistral-7b-instruct-rt.yaml
5237
- vllm/llama-3-1-405b-instruct-fp8-rt.yaml
@@ -64,4 +49,18 @@ resources:
6449
- vllm/llama-4-maverick-17b-128e-instruct-fp8-rt.yaml
6550
- vllm/llama-4-scout-17b-16e-instruct-rt.yaml
6651
- vllm/mistral-7b-instruct-rt.yaml
67-
- vllm/mixtral-8x7b-instruct-rt.yaml
52+
- vllm/mixtral-8x7b-instruct-rt.yaml
53+
- vllm/Qwen/qwen-tp1-rt.yaml
54+
- vllm/Qwen/qwen-tp2-rt.yaml
55+
- vllm/Qwen/qwen-tp4-rt.yaml
56+
- vllm/Qwen/qwen-tp8-rt.yaml
57+
- vllm/Qwen/qwen-tp1-fp8-rt.yaml
58+
- vllm/Qwen/qwen-tp2-fp8-rt.yaml
59+
- vllm/Qwen/qwen-tp4-fp8-rt.yaml
60+
- vllm/Qwen/qwen-35-tp1-fp8-rt.yaml
61+
- vllm/Qwen/qwen-35-tp2-fp8-rt.yaml
62+
- vllm/Qwen/qwen-35-tp8-fp8-rt.yaml
63+
- vllm/Qwen/qwen-35-tp1-rt.yaml
64+
- vllm/Qwen/qwen-35-tp2-rt.yaml
65+
- vllm/Qwen/qwen-35-tp4-rt.yaml
66+
- vllm/Qwen/qwen-35-tp8-rt.yaml

0 commit comments

Comments
 (0)