Commit d857c4d
runtime: lower temperature 0.6->0.3 + top_k 20->5 for XML adherence
Lab observation 2026-05-27: the model emitted 'diag/summary' as
plain text instead of the trained '<tool_call>{...}</tool_call>'
XML. At temp=0.6/top_k=20 the sampling is loose enough that the
model drifts into narrative prose mode despite the structured-
output training. Tightening to temp=0.3/top_k=5 favours the most-
likely next-token (which, for the trained pattern, is the XML
tag) without going fully greedy (top_k=1 would risk verbose
deterministic loops).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent 0b8e2db commit d857c4d
1 file changed
Lines changed: 10 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
363 | 363 | | |
364 | 364 | | |
365 | 365 | | |
366 | | - | |
367 | | - | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
368 | 376 | | |
369 | 377 | | |
370 | 378 | | |
| |||
0 commit comments