You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. Make sure your `MODELDIR` exists on the node where your workload is schedueled so you can cache the downloaded model for next time use. Otherwise, set `global.modelUseHostPath` to 'null' if you don't want to cache the model.
53
+
1. Make sure your `MODELDIR` exists on the node where your workload is scheduled so you can cache the downloaded model for next time use. Otherwise, set `global.modelUseHostPath` to 'null' if you don't want to cache the model.
54
+
55
+
2. Please set `http_proxy`, `https_proxy` and `no_proxy` values while installing chart, if you are behind a proxy.
36
56
37
57
## Verify
38
58
@@ -46,8 +66,9 @@ Run the command `kubectl port-forward svc/chatqna 8888:8888` to expose the servi
46
66
47
67
Open another terminal and run the following command to verify the service if working:
48
68
49
-
```console
69
+
```bash
50
70
curl http://localhost:8888/v1/chatqna \
71
+
-X POST \
51
72
-H "Content-Type: application/json" \
52
73
-d '{"messages": "What is the revenue of Nike in 2023?"}'
53
74
```
@@ -71,7 +92,6 @@ docker save -o ui.tar opea/chatqna-conversation-ui:latest
71
92
sudo ctr -n k8s.io image import ui.tar
72
93
73
94
# install UI using helm chart. Replace image tag if required
@@ -88,4 +108,5 @@ Access `http://localhost:5174` to play with the ChatQnA workload through UI.
88
108
| image.repository | string |`"opea/chatqna"`||
89
109
| service.port | string |`"8888"`||
90
110
| tgi.LLM_MODEL_ID | string |`"Intel/neural-chat-7b-v3-3"`| Models id from https://huggingface.co/, or predownloaded model directory |
91
-
| global.horizontalPodAutoscaler.enabled | bop; | false | HPA autoscaling for the TGI and TEI service deployments based on metrics they provide. See HPA section in ../README.md before enabling! |
111
+
| vllm-openvino.LLM_MODEL_ID | string |`"Intel/neural-chat-7b-v3-3"`| Models id from https://huggingface.co/, or predownloaded model directory |
112
+
| global.horizontalPodAutoscaler.enabled | bool | false | HPA autoscaling for the TGI and TEI service deployments based on metrics they provide. See HPA section in ../README.md before enabling! |
0 commit comments