You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(detoken,openai): unhang client on detoken init failure; map "error" to API error path
- detoken: on _add_new_group_req_index failure, set FINISHED_ERROR, push an
empty-string sentinel into out_tokens_queue at finish_token_index, mark
can_released_mark, and continue with the rest of the group instead of
re-raising. Without this the http loop stays blocked (queue empty, no
finish ever forwarded) and the shm req leaks until client disconnect.
- openai: surface FINISHED_ERROR as a controlled error response. Non-stream
chat / completions return HTTP 500; streaming chat / completions yield an
SSE error event followed by [DONE] and stop. Previously "error" leaked
into ChatCompletionResponseChoice / CompletionChoice whose finish_reason
literals reject it, raising Pydantic ValidationError.
0 commit comments