You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LCORE-1872: Launch Llama Stack container via Makefile orchestration
Enables LCORE to automatically launch Llama Stack as a containerized service
through Makefile orchestration. Running `make run` now handles all infrastructure
setup:
i) building the container image
ii) stopping any existing instance
iii) launching a fresh llama-stack container
iv) waiting for started container's health check
v) and finally starting the lightspeed-stack service.
Signed-off-by: Anik Bhattacharjee <anbhatta@redhat.com>
Copy file name to clipboardExpand all lines: README.md
+12-49Lines changed: 12 additions & 49 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -178,21 +178,21 @@ To quickly get hands on LCS, we can run it using the default configurations prov
178
178
```bash
179
179
export OPENAI_API_KEY=sk-xxxxx
180
180
```
181
-
3. start Llama stack server
181
+
3. start LCS server
182
182
```bash
183
-
uv run llama stack run local-run.yaml
184
-
```
185
-
4. [Optional] If you're new to Llama stack, run through a quick tutorial to learn the basics of what the server is used for, by running the interactive tutorial script
186
-
```bash
187
-
./scripts/llama_stack_tutorial.sh
188
-
```
189
-
5. check the LCS settings in [lightspeed-stack.yaml](lightspeed-stack.yaml). `llama_stack.url` should be `url: http://localhost:8321`
190
-
6. start LCS server
191
-
```
192
183
make run
193
-
```
194
-
7. access LCS web UI at [http://localhost:8080/](http://localhost:8080/)
184
+
```
185
+
186
+
4. access LCS web UI at [http://localhost:8080/](http://localhost:8080/)
187
+
188
+
## Container Runtime Requirements
189
+
190
+
The service requires either Podman or Docker to launch the Llama Stack container:
195
191
192
+
- **Podman** (recommended for RHEL/Fedora): `sudo dnf install podman`
193
+
- **Docker**: Install from [docker.com](https://docs.docker.com/get-docker/)
194
+
195
+
The Makefile will auto-detect which runtime is available.
196
196
197
197
# Configuration
198
198
@@ -289,43 +289,6 @@ These settings will be used when no provider or model are specified in REST API
289
289
290
290
For a comprehensive list of supported providers, take a look [here](docs/providers.md).
291
291
292
-
## Integration with Llama Stack
293
-
294
-
The Llama Stack can be run as a standalone server and accessed via its the REST
295
-
API. However, instead of direct communication via the REST API (and JSON
296
-
format), there is an even better alternative. It is based on the so-called
297
-
Llama Stack Client. It is a library available for Python, Swift, Node.js or
298
-
Kotlin, which "wraps" the REST API stack in a suitable way, which is easier for
299
-
many applications.
300
-
301
-
302
-

303
-
304
-
305
-
306
-
## Llama Stack as separate server
307
-
308
-
If Llama Stack runs as a separate server, the Lightspeed service needs to be configured to be able to access it. For example, if server runs on localhost:8321, the service configuration stored in file `lightspeed-stack.yaml` should look like:
309
-
310
-
```yaml
311
-
name: foo bar baz
312
-
service:
313
-
host: localhost
314
-
port: 8080
315
-
auth_enabled: false
316
-
workers: 1
317
-
color_log: true
318
-
access_log: true
319
-
llama_stack:
320
-
use_as_library_client: false
321
-
url: http://localhost:8321
322
-
user_data_collection:
323
-
feedback_enabled: true
324
-
feedback_storage: "/tmp/data/feedback"
325
-
transcripts_enabled: true
326
-
transcripts_storage: "/tmp/data/transcripts"
327
-
```
328
-
329
292
### MCP Server and Tool Configuration
330
293
331
294
**Note**: The `run.yaml` configuration is currently an implementation detail. In the future, all configuration will be available directly from the lightspeed-core config.
0 commit comments