You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/openapi.json
+49-2Lines changed: 49 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -245,7 +245,7 @@
245
245
"models"
246
246
],
247
247
"summary": "Models Endpoint Handler",
248
-
"description": "Handle requests to the /models endpoint.\n\nProcess GET requests to the /models endpoint, returning a list of available\nmodels from the Llama Stack service.\n\nParameters:\nrequest: The incoming HTTP request.\n auth: Authentication tuple from the auth dependency.\n model_type: Optional filter to return only models matching this type.\n\nRaises:\n HTTPException: If unable to connect to the Llama Stack server or if\n model retrieval fails for any reason.\n\nReturns:\n ModelsResponse: An object containing the list of available models.",
248
+
"description": "Handle requests to the /models endpoint.\n\nProcess GET requests to the /models endpoint, returning a list of available\nmodels from the Llama Stack service. It is possible to specify \"model_type\"\nquery parameter that is used as a filter. For example, if model type is set\nto \"llm\", only LLM models will be returned:\n\n curl http://localhost:8080/v1/models?model_type=llm\n\nThe \"model_type\" query parameter is optional. When not specified, all models\nwill be returned.\n\n## Parameters:\nrequest: The incoming HTTP request.\n auth: Authentication tuple from the auth dependency.\n model_type: Optional filter to return only models matching this type.\n\n## Raises:\n HTTPException: If unable to connect to the Llama Stack server or if\n model retrieval fails for any reason.\n\n## Returns:\n ModelsResponse: An object containing the list of available models.",
"cause": "The prompt exceeds the maximum allowed length.",
3778
+
"response": "Prompt is too long"
3779
+
}
3780
+
}
3781
+
}
3782
+
}
3783
+
}
3784
+
}
3785
+
},
3766
3786
"422": {
3767
3787
"description": "Request validation failed",
3768
3788
"content": {
@@ -7201,7 +7221,7 @@
7201
7221
},
7202
7222
"type": "object",
7203
7223
"title": "Authorization headers",
7204
-
"description": "Headers to send to the MCP server. The map contains the header name and the path to a file containing the header value (secret). There are 2 special cases: 1. Usage of the kubernetes token in the header. To specify this use a string 'kubernetes' instead of the file path. 2. Usage of the clientprovided token in the header. To specify this use a string 'client' instead of the file path."
7224
+
"description": "Headers to send to the MCP server. The map contains the header name and the path to a file containing the header value (secret). There are 3 special cases: 1. Usage of the kubernetes token in the header. To specify this use a string 'kubernetes' instead of the file path. 2. Usage of the client-provided token in the header. To specify this use a string 'client' instead of the file path. 3. Usage of the oauth token in the header. To specify this use a string 'oauth' instead of the file path. "
7205
7225
},
7206
7226
"timeout": {
7207
7227
"anyOf": [
@@ -7565,6 +7585,33 @@
7565
7585
"title": "PostgreSQLDatabaseConfiguration",
7566
7586
"description": "PostgreSQL database configuration.\n\nPostgreSQL database is used by Lightspeed Core Stack service for storing\ninformation about conversation IDs. It can also be leveraged to store\nconversation history and information about quota usage.\n\nUseful resources:\n\n- [Psycopg: connection classes](https://www.psycopg.org/psycopg3/docs/api/connections.html)\n- [PostgreSQL connection strings](https://www.connectionstrings.com/postgresql/)\n- [How to Use PostgreSQL in Python](https://www.freecodecamp.org/news/postgresql-in-python/)"
7567
7587
},
7588
+
"PromptTooLongResponse": {
7589
+
"properties": {
7590
+
"status_code": {
7591
+
"type": "integer",
7592
+
"title": "Status Code"
7593
+
},
7594
+
"detail": {
7595
+
"$ref": "#/components/schemas/DetailModel"
7596
+
}
7597
+
},
7598
+
"type": "object",
7599
+
"required": [
7600
+
"status_code",
7601
+
"detail"
7602
+
],
7603
+
"title": "PromptTooLongResponse",
7604
+
"description": "413 Payload Too Large - Prompt is too long.",
7605
+
"examples": [
7606
+
{
7607
+
"detail": {
7608
+
"cause": "The prompt exceeds the maximum allowed length.",
| authorization_headers | object | Headers to send to the MCP server. The map contains the header name and the path to a file containing the header value (secret). There are 2 special cases: 1. Usage of the kubernetes token in the header. To specify this use a string 'kubernetes' instead of the file path. 2. Usage of the clientprovided token in the header. To specify this use a string 'client' instead of the file path. |
4944
+
| authorization_headers | object | Headers to send to the MCP server. The map contains the header name and the path to a file containing the header value (secret). There are 3 special cases: 1. Usage of the kubernetes token in the header. To specify this use a string 'kubernetes' instead of the file path. 2. Usage of the client-provided token in the header. To specify this use a string 'client' instead of the file path. 3. Usage of the oauth token in the header. To specify this use a string 'oauth' instead of the file path. |
4949
4945
| timeout || Timeout in seconds for requests to the MCP server. If not specified, the default timeout from Llama Stack will be used. Note: This field is reserved for future use when Llama Stack adds timeout support. |
0 commit comments