You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LCORE-1422: Add streaming responses test, prow config, and cleanup
- Add streaming Responses API inline RAG e2e scenario (stream: true)
- Copy inline RAG config to e2e-prow/rhoai/configs for prow compatibility
- Extract restart into explicit "The service is restarted" Gherkin step
- Replace custom _resolve_env_var with llama_stack replace_env_vars
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: tests/e2e/features/inline_rag.feature
+14-10Lines changed: 14 additions & 10 deletions
Original file line number
Diff line number
Diff line change
@@ -2,12 +2,13 @@ Feature: Inline RAG (BYOK) support tests
2
2
3
3
Background:
4
4
Given The service is started locally
5
+
And The system is in default state
6
+
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
5
7
And REST API service prefix is /v1
6
8
And The service uses the lightspeed-stack-inline-rag.yaml configuration
9
+
And The service is restarted
7
10
8
11
Scenario: Check if inline RAG source is registered
9
-
Given The system is in default state
10
-
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
11
12
When I access REST API endpoint rags using HTTP GET method
12
13
Then The status code of the response is 200
13
14
And the body of the response has the following structure
@@ -20,8 +21,6 @@ Feature: Inline RAG (BYOK) support tests
20
21
"""
21
22
22
23
Scenario: Query with inline RAG returns relevant content
23
-
Given The system is in default state
24
-
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
25
24
When I use "query" to ask question with authorization header
26
25
"""
27
26
{"query": "What is the title of the article from Paul?", "system_prompt": "You are an assistant. Write only lowercase letters"}
@@ -33,8 +32,6 @@ Feature: Inline RAG (BYOK) support tests
33
32
And The response should contain non-empty rag_chunks
34
33
35
34
Scenario: Inline RAG query includes referenced documents
36
-
Given The system is in default state
37
-
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
38
35
When I use "query" to ask question with authorization header
39
36
"""
40
37
{"query": "What does Paul Graham say about great work?"}
@@ -43,8 +40,6 @@ Feature: Inline RAG (BYOK) support tests
43
40
And The response should contain non-empty referenced_documents
44
41
45
42
Scenario: Streaming query with inline RAG returns relevant content
46
-
Given The system is in default state
47
-
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
48
43
When I use "streaming_query" to ask question with authorization header
49
44
"""
50
45
{"query": "What is the title of the article from Paul?", "system_prompt": "You are an assistant. Write only lowercase letters"}
@@ -56,8 +51,6 @@ Feature: Inline RAG (BYOK) support tests
56
51
| greatwork |
57
52
58
53
Scenario: Responses API with inline RAG returns relevant content
59
-
Given The system is in default state
60
-
And I set the Authorization header to Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiIxMjM0NTY3ODkwIiwibmFtZSI6Ikpva
61
54
When I use "responses" to ask question with authorization header
62
55
"""
63
56
{"input": "What is the title of the article from Paul?", "model": "{PROVIDER}/{MODEL}", "stream": false, "instructions": "You are an assistant. Write only lowercase letters"}
@@ -66,3 +59,14 @@ Feature: Inline RAG (BYOK) support tests
66
59
And The response should contain following fragments
67
60
| FragmentsinLLMresponse |
68
61
| greatwork |
62
+
63
+
Scenario: Streaming Responses API with inline RAG returns relevant content
64
+
When I use "responses" to ask question with authorization header
65
+
"""
66
+
{"input": "What is the title of the article from Paul?", "model": "{PROVIDER}/{MODEL}", "stream": true, "instructions": "You are an assistant. Write only lowercase letters"}
67
+
"""
68
+
Then The status code of the response is 200
69
+
And I wait for the response to be completed
70
+
And The streamed response should contain following fragments
0 commit comments