Skip to content

Commit 417c6b7

Browse files
committed
Updated OpenAPI specification
1 parent 8361b37 commit 417c6b7

2 files changed

Lines changed: 13 additions & 3 deletions

File tree

docs/openapi.json

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -3664,7 +3664,7 @@
36643664
"rlsapi-v1"
36653665
],
36663666
"summary": "Infer Endpoint",
3667-
"description": "Handle rlsapi v1 /infer requests for stateless inference.\n\nThis endpoint serves requests from the RHEL Lightspeed Command Line Assistant (CLA).\n\nAccepts a question with optional context (stdin, attachments, terminal output,\nsystem info) and returns an LLM-generated response.\n\nArgs:\n infer_request: The inference request containing question and context.\n auth: Authentication tuple from the configured auth provider.\n\nReturns:\n RlsapiV1InferResponse containing the generated response text and request ID.\n\nRaises:\n HTTPException: 503 if the LLM service is unavailable.",
3667+
"description": "Handle rlsapi v1 /infer requests for stateless inference.\n\nThis endpoint serves requests from the RHEL Lightspeed Command Line Assistant (CLA).\n\nAccepts a question with optional context (stdin, attachments, terminal output,\nsystem info) and returns an LLM-generated response.\n\nArgs:\n infer_request: The inference request containing question and context.\n request: The FastAPI request object for accessing headers and state.\n background_tasks: FastAPI background tasks for async Splunk event sending.\n auth: Authentication tuple from the configured auth provider.\n\nReturns:\n RlsapiV1InferResponse containing the generated response text and request ID.\n\nRaises:\n HTTPException: 503 if the LLM service is unavailable.",
36683668
"operationId": "infer_endpoint_v1_infer_post",
36693669
"requestBody": {
36703670
"content": {
@@ -4290,7 +4290,7 @@
42904290
],
42914291
"summary": "Handle A2A Jsonrpc",
42924292
"description": "Handle A2A JSON-RPC requests following the A2A protocol specification.\n\nThis endpoint uses the DefaultRequestHandler from the A2A SDK to handle\nall JSON-RPC requests including message/send, message/stream, etc.\n\nThe A2A SDK application is created per-request to include authentication\ncontext while still leveraging FastAPI's authorization middleware.\n\nAutomatically detects streaming requests (message/stream JSON-RPC method)\nand returns a StreamingResponse to enable real-time chunk delivery.\n\nArgs:\n request: FastAPI request object\n auth: Authentication tuple\n mcp_headers: MCP headers for context propagation\n\nReturns:\n JSON-RPC response or streaming response",
4293-
"operationId": "handle_a2a_jsonrpc_a2a_get",
4293+
"operationId": "handle_a2a_jsonrpc_a2a_post",
42944294
"responses": {
42954295
"200": {
42964296
"description": "Successful Response",
@@ -4308,7 +4308,7 @@
43084308
],
43094309
"summary": "Handle A2A Jsonrpc",
43104310
"description": "Handle A2A JSON-RPC requests following the A2A protocol specification.\n\nThis endpoint uses the DefaultRequestHandler from the A2A SDK to handle\nall JSON-RPC requests including message/send, message/stream, etc.\n\nThe A2A SDK application is created per-request to include authentication\ncontext while still leveraging FastAPI's authorization middleware.\n\nAutomatically detects streaming requests (message/stream JSON-RPC method)\nand returns a StreamingResponse to enable real-time chunk delivery.\n\nArgs:\n request: FastAPI request object\n auth: Authentication tuple\n mcp_headers: MCP headers for context propagation\n\nReturns:\n JSON-RPC response or streaming response",
4311-
"operationId": "handle_a2a_jsonrpc_a2a_get",
4311+
"operationId": "handle_a2a_jsonrpc_a2a_post",
43124312
"responses": {
43134313
"200": {
43144314
"description": "Successful Response",
@@ -6957,6 +6957,13 @@
69576957
],
69586958
"title": "Llama Stack configuration path",
69596959
"description": "Path to configuration file used when Llama Stack is run in library mode"
6960+
},
6961+
"timeout": {
6962+
"type": "integer",
6963+
"exclusiveMinimum": 0.0,
6964+
"title": "Request timeout",
6965+
"description": "Timeout in seconds for requests to Llama Stack service. Default is 180 seconds (3 minutes) to accommodate long-running RAG queries.",
6966+
"default": 180
69606967
}
69616968
},
69626969
"additionalProperties": false,

docs/openapi.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3200,6 +3200,8 @@ system info) and returns an LLM-generated response.
32003200

32013201
Args:
32023202
infer_request: The inference request containing question and context.
3203+
request: The FastAPI request object for accessing headers and state.
3204+
background_tasks: FastAPI background tasks for async Splunk event sending.
32033205
auth: Authentication tuple from the configured auth provider.
32043206

32053207
Returns:
@@ -4860,6 +4862,7 @@ Useful resources:
48604862
| api_key | | API key to access Llama Stack service |
48614863
| use_as_library_client | | When set to true Llama Stack will be used in library mode, not in server mode (default) |
48624864
| library_client_config_path | | Path to configuration file used when Llama Stack is run in library mode |
4865+
| timeout | integer | Timeout in seconds for requests to Llama Stack service. Default is 180 seconds (3 minutes) to accommodate long-running RAG queries. |
48634866

48644867

48654868
## MCPClientAuthOptionsResponse

0 commit comments

Comments
 (0)