You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/configuration.md
+12-8Lines changed: 12 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,13 +8,17 @@ The system configuration is driven by YAML file. The default config file is [con
8
8
| max_threads |`50`| Maximum number of threads, set to null for Python default. 50 is OK on a typical laptop. Check your Judge-LLM service for max requests per minute |
9
9
| fail_on_invalid_data |`true`| If `false` don't fail on invalid conversations (like missing `context` field for some metrics) |
10
10
| skip_on_failure |`false`| If `true`, skip remaining turns and conversation metrics when a turn evaluation fails (FAIL or ERROR). Can be overridden per conversation in the input data yaml file. |
11
+
| cache_enabled |`true`| Global caching toggle for embeddings, agent API, and LLM judge queries. (_Component-level cache settings are deprecated._) |
12
+
| cache_base_dir |`".caches"`| Base directory for all evaluation caches (embeddings, agent, LLM judge). Component-specific subdirectories are appended automatically (`/llm` for LLM-as-a-judge and `/agent` for agent API calls). |
11
13
12
14
### Example
13
15
```yaml
14
16
core:
15
17
max_threads: 50
16
18
fail_on_invalid_data: true
17
-
skip_on_failure: false # Set to true to stop evaluation on first failure
19
+
skip_on_failure: false # Set to true to stop evaluation on first failure
20
+
cache_enabled: true # Global cache toggle (affects all components)
21
+
cache_base_dir: ".caches"# Base cache directory
18
22
```
19
23
20
24
## LLM Pool
@@ -25,7 +29,7 @@ Define a centralized pool of LLM configurations for the Judge Panel feature.
| cache_enabled | `true` | Is LLM cache enabled? |
120
+
| cache_dir | `".caches/llm_cache"` | Directory with cached LLM responses (_deprecated - use `core.cache_base_dir`_) |
121
+
| cache_enabled | `true` | Is LLM cache enabled? (_deprecated - use `core.cache_enabled`_) |
118
122
119
123
Dynamic LLM parameters are only supported through `llm_pool` config. To use dynamic parameters, migrate to `llm_pool`.
120
124
@@ -128,8 +132,8 @@ Some Ragas metrics use embeddings to compute similarity between generated answer
128
132
| provider | `"openai"` | Supported providers: `"openai"`, `"gemini"` or `"huggingface"`. `"huggingface"` downloads the model to the local machine and runs inference locally (requires optional dependencies). |
129
133
| model | `"text-embedding-3-small"` | Model name for the provider |
130
134
| provider_kwargs | `{}` | Optional arguments for the model |
131
-
| cache_dir | `".caches/embedding_cache"` | Directory with cached embeddings |
132
-
| cache_enabled | `true` | Is embeddings cache enabled? |
135
+
| cache_dir | `".caches/embedding_cache"` | Directory with cached embeddings (_deprecated - use `core.cache_base_dir`_) |
136
+
| cache_enabled | `true` | Is embeddings cache enabled? (_deprecated - use `core.cache_enabled`_) |
133
137
134
138
#### Remote vs Local Embedding Models
135
139
@@ -201,8 +205,8 @@ Note that it can be easily integrated with other APIs with a minimal change.
201
205
| model | `"gpt-4o-mini"` | Model to use for API queries (optional) |
0 commit comments