add endpoint to return prompts#1379
Conversation
|
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅ |
|
Isn't this the |
That's my understanding too. |
|
I added Are they the same thing? If so, then we should change some names and/or add comments explaining what's what, because it's confusing. |
From the IndirectSUT's pespective, requests have a prompt and a request id (which is not our notion of a prompt id). Responses have a request id and a response. Each request comes from an IndirectSUTRequest and each response gets turned into an IndirectSUTResponse. These in turn derive from our core notions of SUT requests and responses. So the terms align with our usual domain language. I think the modelbench code should stay consistent with that. It should be the BaaS server's job to translate between the BaaS user's domain model and our internal one. |
|
Are we using the request ID as the key to reconcile a prompt we sent the customer and the response to that prompt they send back to us? @wpietri |
|
@rogthefrog that's right |
|
Yes that makes sense since the request should contain more than just the prompt (e.g. max_tokens, temperature..). |
|
Great, thank you. |
|
Yes, that's also good by me. |
BaaS --get prompts--> modelrunner --get prompts--> modelbench job <-- this part