@@ -282,6 +282,14 @@ providers:
282282 content_field: chunk
283283 embedding_dimension: 384
284284 embedding_model: ${env.EMBEDDING_MODEL_DIR}
285+ chunk_window_config:
286+ chunk_parent_id_field: "parent_id"
287+ chunk_content_field: "chunk_field"
288+ chunk_index_field: "chunk_index"
289+ chunk_token_count_field: "num_tokens"
290+ parent_total_chunks_field: "total_chunks"
291+ parent_total_tokens_field: "total_tokens"
292+ chunk_filter_query: "is_chunk:true"
285293 persistence:
286294 namespace: portal-rag
287295 backend: kv_default
@@ -294,6 +302,19 @@ registered_resources:
294302 embedding_dimension: 384
295303` ` `
296304
305+ Note : if the vector database (portal-rag) is not in the persistent data store within the vector_io provider
306+ (e.g. after deleting the llama stack cache) you will need to register the vector database under registered resources :
307+
308+
309+ ` ` ` yaml
310+ vector_stores:
311+ - embedding_dimension: 384
312+ embedding_model: sentence-transformers/${env.EMBEDDING_MODEL_DIR}
313+ provider_id: solr-vector
314+ vector_store_id: portal-rag
315+ ` ` `
316+
317+
297318**2. Configure Lightspeed Stack (`lightspeed-stack.yaml`):**
298319
299320` ` ` yaml
@@ -324,6 +345,14 @@ Note: Solr does not currently work with RAG tools. You will need to specify "no_
324345 - **Offline mode**: Uses `parent_id` with Mimir base URL
325346 - **Online mode**: Uses `reference_url` from document metadata
326347
348+ **Query Filtering:**
349+
350+ To filter the Solr context edit the *chunk_filter_query* field in the
351+ Solr **vector_io** provider in the `run.yaml`. Filters should follow the key:value format:
352+ ex. `"product:*openshift*`"
353+
354+ Note: This static filter is a temporary work-around.
355+
327356**Prerequisites:**
328357
329358- Solr must be running and accessible at the configured URL
0 commit comments