We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 60645e6 commit e6d1c86Copy full SHA for e6d1c86
1 file changed
config/samples/_v1alpha1_openmodel.yaml
@@ -14,10 +14,11 @@ spec:
14
modelHub:
15
# name: Huggingface
16
modelID: "facebook/opt-125m"
17
- inferenceFlavors:
18
- - name: a100
19
- requests:
20
- nvidia.com/gpu: 1
+ inferenceConfig:
+ flavors:
+ - name: a100
+ limits:
21
+ nvidia.com/gpu: 1
22
# nodeSelector:
23
# cloud-provider.com/accelerator: nvidia-a100
24
# params:
0 commit comments