You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+22-18Lines changed: 22 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,8 +14,8 @@ This repository implements [Ragas](https://github.com/explodinggradients/ragas)
14
14
The goal is to provide all of Ragas' evaluation functionality over Llama Stack's eval API, while leveraging the Llama Stack's built-in APIs for inference (llms and embeddings), datasets, and benchmarks.
15
15
16
16
There are two versions of the provider:
17
-
-`inline`: runs the Ragas evaluation in the same process as the Llama Stack server.
18
-
-`remote`: runs the Ragas evaluation in a remote process, using Kubeflow Pipelines.
17
+
-`inline`: runs the Ragas evaluation in the same process as the Llama Stack server. This is always available with the base installation.
18
+
-`remote`: runs the Ragas evaluation in a remote process, using Kubeflow Pipelines. Only available when remote dependencies are installed with `pip install llama-stack-provider-ragas[remote]`.
19
19
20
20
## Prerequisites
21
21
- Python 3.12
@@ -41,12 +41,29 @@ There are two versions of the provider:
41
41
```
42
42
- The sample LS distributions (one for inline and one for remote provider) is a simple LS distribution that uses Ollama for inference and embeddings. See the provider-specific sections below for setup and run commands.
43
43
44
-
### Remote provider (default)
44
+
### Inline provider (default with base installation)
45
+
46
+
Create a `.env` file with the required environment variable:
47
+
```bash
48
+
EMBEDDING_MODEL=ollama/all-minilm:l6-v2
49
+
```
50
+
51
+
Run the server:
52
+
```bash
53
+
dotenv run uv run llama stack run distribution/run.yaml
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/index.adoc
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,8 +15,8 @@ The goal is to provide all of Ragas' evaluation functionality over Llama Stack's
15
15
16
16
There are two versions of the provider:
17
17
18
-
* `remote`: runs the Ragas evaluation in a remote process, using Kubeflow Pipelines. This is the *default* when using the module-based import.
19
-
* `inline`: runs the Ragas evaluation in the same process as the Llama Stack server.
18
+
* `inline`: runs the Ragas evaluation in the same process as the Llama Stack server. This is always available with the base installation.
19
+
* `remote`: runs the Ragas evaluation in a remote process, using Kubeflow Pipelines. Only available when remote dependencies are installed with `pip install llama-stack-provider-ragas[remote]`.
0 commit comments