You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LCORE-1037: remove all llama-stack references from BYOK and RAG guides
Replace remaining Llama Stack mentions with generic terms — users
should not need to know about the underlying Llama Stack layer.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: docs/byok_guide.md
+52-39Lines changed: 52 additions & 39 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -148,10 +148,8 @@ class CustomMetadataProcessor(MetadataProcessor):
148
148
```
149
149
150
150
**Important Notes:**
151
-
- The vector database must be compatible with Llama Stack
152
151
- Supported formats:
153
-
- Llama-Stack Faiss Vector-IO
154
-
- Llama-Stack SQLite-vec Vector-IO
152
+
- Faiss Vector-IO
155
153
- The same embedding model must be used for both creation and querying
156
154
157
155
### Step 3: Configure Embedding Model
@@ -178,15 +176,15 @@ The embedding model is specified per knowledge source in the `byok_rag` section
178
176
179
177
### Step 4: Configure BYOK Knowledge Sources
180
178
181
-
Declare your knowledge sources in the `byok_rag` section of your `lightspeed-stack.yaml`. The Lightspeed Stack service automatically generates the required Llama Stack configuration at startup.
179
+
Declare your knowledge sources in the `byok_rag` section of your `lightspeed-stack.yaml`. The required configuration is automatically generated at startup when using `make run`, `make run-stack`, `docker-compose`, or library mode.
182
180
183
181
```yaml
184
182
byok_rag:
185
183
- rag_id: my-docs # Unique identifier for this knowledge source
186
184
rag_type: inline::faiss # Vector store type (default: inline::faiss)
187
185
embedding_model: sentence-transformers/all-mpnet-base-v2 # Embedding model (default)
188
186
embedding_dimension: 768# Must match your embedding model's output
189
-
vector_db_id: your-index-id# Llama Stack vector store ID (from index generation)
187
+
vector_db_id: vs_8c94967b-81cc-4028-a294-9cfac6fd9ae2# Generated by rag-content during index creation
190
188
db_path: /path/to/vector_db/faiss_store.db # Path to the vector database file
191
189
score_multiplier: 1.0# Weight for Inline RAG result ranking (default: 1.0)
192
190
```
@@ -199,7 +197,7 @@ byok_rag:
199
197
| `rag_type` | No | `inline::faiss` | Vector store provider type |
200
198
| `embedding_model` | No | `sentence-transformers/all-mpnet-base-v2` | Embedding model identifier or path |
score_multiplier: 1.2 # Boost results from this store
227
225
```
228
226
229
-
**⚠️ Important**: The `vector_db_id` value must exactly match the ID you provided when creating the vector database using the rag-content tool. This identifier links your configuration to the specific vector database index you created.
227
+
**⚠️ Important**: The `vector_db_id` value must exactly match the ID generated by the rag-content tool during index creation (e.g. `vs_8c94967b-81cc-4028-a294-9cfac6fd9ae2`). This identifier links your configuration to the specific vector database index.
> Your LLM inference provider (e.g., OpenAI, vLLM) must also be configured.
344
+
> Your LLM inference provider (e.g., OpenAI, vLLM) must also be configured in your `run.yaml`.
344
345
> For OpenAI, set the `OPENAI_API_KEY` environment variable.
345
346
346
347
### Example 2: Multiple Knowledge Sources with pgvector
347
348
348
-
A configuration combining a local FAISS store with a remote pgvector store:
349
+
A configuration combining a local FAISS store (via `byok_rag`) with a remote pgvector store (configured directly in the Llama Stack configuration file):
350
+
351
+
> [!NOTE]
352
+
> pgvector is not yet supported via `byok_rag` in `lightspeed-stack.yaml` (see [LCORE-2437](https://redhat.atlassian.net/browse/LCORE-2437)).
353
+
> The pgvector provider must be configured directly in the Llama Stack configuration file.
354
+
355
+
**`lightspeed-stack.yaml`** — FAISS store and RAG strategy:
0 commit comments