You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
cd GenAIExamples/DocSum/docker_compose/intel/cpu/xeon/
30
+
cd GenAIExamples/DocSum/docker_compose
31
+
source set_env.sh
32
+
cd intel/cpu/xeon/
31
33
```
32
34
33
-
Checkout a released version, such as v1.2:
35
+
NOTE: by default vLLM does "warmup" at start, to optimize its performance for the specified model and the underlying platform, which can take long time. For development (and e.g. autoscaling) it can be skipped with `export VLLM_SKIP_WARMUP=true`.
34
36
35
-
```
36
-
git checkout v1.2
37
+
Checkout a released version, such as v1.3:
38
+
39
+
```bash
40
+
git checkout v1.3
37
41
```
38
42
39
43
### Generate a HuggingFace Access Token
40
44
41
45
Some HuggingFace resources, such as some models, are only accessible if you have an access token. If you do not already have a HuggingFace access token, you can create one by first creating an account by following the steps provided at [HuggingFace](https://huggingface.co/) and then generating a [user access token](https://huggingface.co/docs/transformers.js/en/guides/private#step-1-generating-a-user-access-token).
42
46
43
-
### Configure the Deployment Environment
44
-
45
-
To set up environment variables for deploying DocSum services, source the _set_env.sh_ script in this directory:
46
-
47
-
```
48
-
source ./set_env.sh
49
-
```
50
-
51
-
The _set_env.sh_ script will prompt for required and optional environment variables used to configure the DocSum services. If a value is not entered, the script will use a default value for the same. It will also generate a _.env_ file defining the desired configuration. Consult the section on [DocSum Service configuration](#docsum-service-configuration) for information on how service specific configuration parameters affect deployments.
52
-
53
47
### Deploy the Services Using Docker Compose
54
48
55
49
To deploy the DocSum services, execute the `docker compose up` command with the appropriate arguments. For a default deployment, execute:
@@ -78,13 +72,13 @@ Please refer to the table below to build different microservices from source:
78
72
79
73
After running docker compose, check if all the containers launched via docker compose have started:
80
74
81
-
```
75
+
```bash
82
76
docker ps -a
83
77
```
84
78
85
79
For the default deployment, the following 5 containers should have started:
86
80
87
-
```
81
+
```bash
88
82
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
89
83
748f577b3c78 opea/whisper:latest "python whisper_s…" 5 minutes ago Up About a minute 0.0.0.0:7066->7066/tcp, :::7066->7066/tcp docsum-xeon-whisper-server
90
84
4eq8b7034fd9 opea/docsum-gradio-ui:latest "docker-entrypoint.s…" 5 minutes ago Up About a minute 0.0.0.0:5173->5173/tcp, :::5173->5173/tcp docsum-xeon-ui-server
@@ -109,7 +103,7 @@ curl -X POST http://${host_ip}:8888/v1/docsum \
109
103
110
104
To stop the containers associated with the deployment, execute the following command:
cd GenAIExamples/DocSum/docker_compose/intel/hpu/gaudi/
32
+
cd GenAIExamples/DocSum/docker_compose
33
+
source set_env.sh
34
+
cd intel/hpu/gaudi/
33
35
```
34
36
35
-
Checkout a released version, such as v1.2:
37
+
NOTE: by default vLLM does "warmup" at start, to optimize its performance for the specified model and the underlying platform, which can take long time. For development (and e.g. autoscaling) it can be skipped with `export VLLM_SKIP_WARMUP=true`.
36
38
37
-
```
38
-
git checkout v1.2
39
+
Checkout a released version, such as v1.3:
40
+
41
+
```bash
42
+
git checkout v1.3
39
43
```
40
44
41
45
### Generate a HuggingFace Access Token
42
46
43
47
Some HuggingFace resources, such as some models, are only accessible if you have an access token. If you do not already have a HuggingFace access token, you can create one by first creating an account by following the steps provided at [HuggingFace](https://huggingface.co/) and then generating a [user access token](https://huggingface.co/docs/transformers.js/en/guides/private#step-1-generating-a-user-access-token).
44
48
45
-
### Configure the Deployment Environment
46
-
47
-
To set up environment variables for deploying DocSum services, source the _set_env.sh_ script in this directory:
48
-
49
-
```
50
-
source ./set_env.sh
51
-
```
52
-
53
-
The _set_env.sh_ script will prompt for required and optional environment variables used to configure the DocSum services. If a value is not entered, the script will use a default value for the same. It will also generate a _.env_ file defining the desired configuration. Consult the section on [DocSum Service configuration](#docsum-service-configuration) for information on how service specific configuration parameters affect deployments.
54
-
55
49
### Deploy the Services Using Docker Compose
56
50
57
51
To deploy the DocSum services, execute the `docker compose up` command with the appropriate arguments. For a default deployment, execute:
@@ -80,13 +74,13 @@ Please refer to the table below to build different microservices from source:
80
74
81
75
After running docker compose, check if all the containers launched via docker compose have started:
82
76
83
-
```
77
+
```bash
84
78
docker ps -a
85
79
```
86
80
87
81
For the default deployment, the following 5 containers should have started:
88
82
89
-
```
83
+
```bash
90
84
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
91
85
748f577b3c78 opea/whisper:latest "python whisper_s…" 5 minutes ago Up About a minute 0.0.0.0:7066->7066/tcp, :::7066->7066/tcp docsum-gaudi-whisper-server
92
86
4eq8b7034fd9 opea/docsum-gradio-ui:latest "docker-entrypoint.s…" 5 minutes ago Up About a minute 0.0.0.0:5173->5173/tcp, :::5173->5173/tcp docsum-gaudi-ui-server
@@ -111,7 +105,7 @@ curl -X POST http://${host_ip}:8888/v1/docsum \
111
105
112
106
To stop the containers associated with the deployment, execute the following command:
0 commit comments