Commit 529fdd3
authored
feat(router): openai-harmony bypass for gpt-oss tool calls
Closes #444 #455 #468 #480 #513.
Replaces wire-text reconstruction of gpt-oss tool calls with vLLM/SGLang's structured-passthrough architecture: HarmonyStreamingRouter delegates to openai-harmony's StreamableParser, emits RouterEvent.tool_call as structured (name, arguments), engine surfaces GenerationOutput.tool_calls, routes bypass regex-based parsing via _parse_tool_calls_with_parser(structured_tool_calls=...).
Streaming fast-path enforces tool_calls[*].index monotonicity across router chunks + parallel_tool_calls=false cap. 16 rounds of codex review, final round MERGE-SAFE: 4484 unit / 553 targeted / 3×3 stress matrix green.1 parent 7f45f0a commit 529fdd3
14 files changed
Lines changed: 2657 additions & 79 deletions
File tree
- tests
- parsers/regressions
- vllm_mlx
- engine
- routes
- service
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
53 | 60 | | |
54 | 61 | | |
55 | 62 | | |
| |||
0 commit comments