Skip to content

Commit 72b1a58

Browse files
authored
fix: update inference processor from 'inf2' to 'neuronx'
The current version generates the following image tag for 0.4.1: 0.10.2-inf2-py310-sdk2.26.0-ubuntu22.04 But it should be: 0.10.2-neuronx-py310-sdk2.26.0-ubuntu22.04
1 parent 33bf993 commit 72b1a58

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

sagemaker-core/src/sagemaker/core/image_uri_config/huggingface-vllm-neuronx.json

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
{
22
"inference": {
33
"processors": [
4-
"inf2"
4+
"neuronx"
55
],
66
"version_aliases": {
77
"0.4": "0.4.1"
@@ -27,12 +27,12 @@
2727
"tag_prefix": "0.10.2",
2828
"repository": "huggingface-vllm-inference-neuronx",
2929
"container_version": {
30-
"inf2": "ubuntu22.04"
30+
"neuronx": "ubuntu22.04"
3131
},
3232
"sdk_versions": [
3333
"sdk2.26.0"
3434
]
3535
}
3636
}
3737
}
38-
}
38+
}

0 commit comments

Comments
 (0)