Skip to content

Commit 3de9afd

Browse files
authored
Merge pull request #1 from sauagarwa/main
Adjusted resource parameters of Model
2 parents 5b473a6 + 0f6c40d commit 3de9afd

1 file changed

Lines changed: 8 additions & 6 deletions

File tree

overrides/values-cpu.yaml

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,18 +8,20 @@ global:
88
enabled: true
99
resources:
1010
limits:
11-
cpu: "4"
12-
memory: 32Gi
11+
cpu: "6"
12+
memory: 48Gi
1313
requests:
14-
cpu: "1"
15-
memory: 16Gi
14+
cpu: "2"
15+
memory: 24Gi
1616
args:
1717
- --enable-auto-tool-choice
1818
- --chat-template
1919
- /chat-templates/tool_chat_template_llama3.2_json.jinja
2020
- --tool-call-parser
2121
- llama3_json
22+
- --dtype
23+
- auto
2224
- --max-model-len
23-
- "8192"
25+
- "16384"
2426
- --max-num-seqs
25-
- "2"
27+
- "1"

0 commit comments

Comments
 (0)