Commit 4c2554e
Fix WDL channel order: model outputs L/D/W not W/D/L
The maia3 value head is trained with target = self_wins + 1, mapping
loss=0, draw=1, win=2. The processOutputsMaia3 function incorrectly
treated index 0 as win instead of loss, producing inverted evaluations.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 24b3e96 commit 4c2554e
1 file changed
Lines changed: 5 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
451 | 451 | | |
452 | 452 | | |
453 | 453 | | |
454 | | - | |
| 454 | + | |
| 455 | + | |
455 | 456 | | |
456 | | - | |
| 457 | + | |
457 | 458 | | |
458 | | - | |
459 | | - | |
| 459 | + | |
| 460 | + | |
460 | 461 | | |
461 | 462 | | |
462 | 463 | | |
| |||
0 commit comments