Skip to content

Commit 7003793

Browse files
authored
Bump images for llama2 and update compute (#436)
image and compute
1 parent 1813eaf commit 7003793

2 files changed

Lines changed: 4 additions & 6 deletions

File tree

examples/inference-deployments/llama2/llama2_13b.yaml

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -5,9 +5,8 @@ command: |- # Note this command is a workaround until we build vllm into the inf
55
pip uninstall torch -y
66
pip install torch==2.0.1
77
compute:
8-
gpus: 1
9-
instance: oci.vm.gpu.a10.1
10-
image: mosaicml/inference:0.1.37
11-
cluster: r7z15
8+
gpus: 2
9+
instance: oci.vm.gpu.a10.2
10+
image: mosaicml/inference:0.1.40
1211
default_model:
1312
model_type: llama2-13b

examples/inference-deployments/llama2/llama2_7b_chat.yaml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,6 @@ command: |- # Note this command is a workaround until we build vllm into the inf
77
compute:
88
gpus: 1
99
instance: oci.vm.gpu.a10.1
10-
image: mosaicml/inference:0.1.37
11-
cluster: r7z15
10+
image: mosaicml/inference:0.1.40
1211
default_model:
1312
model_type: llama2-7b-chat

0 commit comments

Comments
 (0)