You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| score_multiplier | number | Multiplier applied to relevance scores from this vector storewhen querying multiple sources. Values > 1 boost results; values < 1 reduce them. Default: 1.0. |
138
+
| score_multiplier | number | Multiplier applied to relevance scores from this vector store. Used to weight results when querying multiple knowledge sources. Values > 1 boost this store's results; values < 1 reduce them. |
| deployment_environment | string | Deployment environment name (e.g., 'development', 'staging', 'production'). Used in telemetry events. |
190
-
| rag | | RAG strategy configuration (OKP and BYOK). Controls pre-query (Inline RAG) and tool-based (Tool RAG) retrieval. |
190
+
| rag | | Configuration for all RAG strategies (inline and tool-based). |
191
+
| okp | | OKP provider settings. Only used when 'okp' is listed in rag.inline or rag.tool. |
191
192
192
193
193
194
## ConversationHistoryConfiguration
@@ -395,6 +396,21 @@ Useful resources:
395
396
| timeout | integer | Timeout in seconds for requests to the MCP server. If not specified, the default timeout from Llama Stack will be used. Note: This field is reserved for future use when Llama Stack adds timeout support. |
Controls provider-specific behaviour for the OKP vector store.
405
+
Only relevant when ``"okp"`` is listed in ``rag.inline`` or ``rag.tool``.
406
+
407
+
408
+
| Field | Type | Description |
409
+
|-------|------|-------------|
410
+
| offline | boolean | When True, use parent_id for OKP chunk source URLs. When False, use reference_url for chunk source URLs. |
411
+
| chunk_filter_query | string | OKP filter query applied to every OKP search request. Defaults to 'is_chunk:true' to restrict results to chunk documents. To add extra constraints, extend the expression using boolean syntax, e.g. 'is_chunk:true AND product:*openshift*'. |
412
+
413
+
398
414
## PostgreSQLDatabaseConfiguration
399
415
400
416
@@ -501,6 +517,28 @@ Red Hat Identity authentication configuration.
501
517
| required_entitlements | array | List of all required entitlements. |
502
518
503
519
520
+
## RagConfiguration
521
+
522
+
523
+
RAG strategy configuration.
524
+
525
+
Controls which RAG sources are used for inline and tool-based retrieval.
526
+
527
+
Each strategy lists RAG IDs to include. The special ID ``"okp"`` defined in constants,
528
+
activates the OKP provider; all other IDs refer to entries in ``byok_rag``.
529
+
530
+
Backward compatibility:
531
+
- ``inline`` defaults to ``[]`` (no inline RAG).
532
+
- ``tool`` defaults to ``None`` which means all registered vector stores
533
+
are used (identical to the previous ``tool.byok.enabled = True`` default).
534
+
535
+
536
+
| Field | Type | Description |
537
+
|-------|------|-------------|
538
+
| inline | array | RAG IDs whose sources are injected as context before the LLM call. Use 'okp' to enable OKP inline RAG. Empty by default (no inline RAG). |
539
+
| tool | array | RAG IDs made available to the LLM as a file_search tool. Use 'okp' to include the OKP vector store. When omitted, all registered BYOK vector stores are used (backward compatibility). |
540
+
541
+
504
542
## SQLiteDatabaseConfiguration
505
543
506
544
@@ -537,62 +575,6 @@ the service can handle requests concurrently.
537
575
| cors | | Cross-Origin Resource Sharing configuration for cross-domain requests |
538
576
539
577
540
-
## RagConfiguration
541
-
542
-
543
-
Top-level RAG strategy configuration. Controls two complementary retrieval modes:
544
-
545
-
- **Inline RAG**: context is fetched from the listed sources and injected before the
546
-
LLM request.
547
-
- **Tool RAG**: the LLM can call the `file_search` tool during generation to retrieve
548
-
context on demand from the listed vector stores. Supports both BYOK and OKP.
549
-
550
-
Each strategy is configured as a list of RAG IDs referencing entries in `byok_rag`.
551
-
The special ID `okp` activates the OKP provider (no `byok_rag` entry needed).
552
-
553
-
**Backward compatibility**: omitting `tool` uses all registered BYOK vector stores
554
-
(equivalent to the old `tool.byok.enabled = True`). Omitting `inline` means no
555
-
context is injected before the LLM request.
556
-
557
-
Example:
558
-
559
-
```yaml
560
-
rag:
561
-
inline:
562
-
- my-docs # inject context from my-docs before the LLM request
563
-
tool:
564
-
- okp # LLM can search OKP as a tool
565
-
- my-docs # LLM can also search my-docs as a tool
566
-
567
-
okp:
568
-
offline: true # use parent_id for OKP URL construction
569
-
```
570
-
571
-
572
-
| Field | Type | Description |
573
-
|-------|------|-------------|
574
-
| inline | list[string] | RAG IDs whose content is injected before the LLM request. Use `okp` for OKP. Empty by default (no inline RAG). |
575
-
| tool | list[string] or null | RAG IDs exposed as a `file_search` tool the LLM can invoke. Use `okp` to include OKP. When omitted, all registered BYOK vector stores are used (backward compatibility). |
576
-
577
-
578
-
## OkpConfiguration
579
-
580
-
OKP (Offline Knowledge Portal) provider settings. Only used when `okp` is listed in `rag.inline` or `rag.tool`.
581
-
582
-
Example:
583
-
584
-
```yaml
585
-
okp:
586
-
offline: true # use parent_id for OKP URL construction
587
-
chunk_filter_query: "is_chunk:true"
588
-
```
589
-
590
-
| Field | Type | Description |
591
-
|-------|------|-------------|
592
-
| offline | boolean | When `true` (default), use `parent_id` for OKP chunk source URLs. When `false`, use `reference_url`. |
593
-
| chunk_filter_query | string | OKP filter query (`fq`) applied to every OKP search request. Defaults to `"is_chunk:true"`. Extend with `AND` for extra constraints. |
0 commit comments