Commit 114ad0e
fix: use training-appropriate evaluate timeouts instead of reordering eval (#246)
Reverts the evaluate_dense reordering from #245 (local-first was too
aggressive — skipped binary eval entirely, losing the signal when 5050
IS available).
The actual fix: set evaluate_timeout=15s and evaluate_retries=1 on the
WAALiveAdapter in the TRL wrapper. The evaluate_dense logic stays
correct (try binary first, local fallback, take max). Training speed
comes from fast failure, not from skipping evaluation paths.
- Benchmarking: 180s timeout, 3 retries (thorough, one-shot)
- Training: 15s timeout, 1 retry (fast feedback, thousands of evals)
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 0922b0a commit 114ad0e
3 files changed
Lines changed: 58 additions & 51 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
602 | 602 | | |
603 | 603 | | |
604 | 604 | | |
605 | | - | |
606 | | - | |
607 | | - | |
608 | | - | |
609 | | - | |
610 | | - | |
611 | | - | |
612 | | - | |
613 | | - | |
614 | | - | |
615 | | - | |
616 | | - | |
| 605 | + | |
| 606 | + | |
| 607 | + | |
| 608 | + | |
| 609 | + | |
| 610 | + | |
| 611 | + | |
| 612 | + | |
| 613 | + | |
| 614 | + | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
| 619 | + | |
| 620 | + | |
| 621 | + | |
| 622 | + | |
| 623 | + | |
| 624 | + | |
| 625 | + | |
617 | 626 | | |
618 | 627 | | |
619 | 628 | | |
| |||
622 | 631 | | |
623 | 632 | | |
624 | 633 | | |
625 | | - | |
626 | | - | |
627 | | - | |
| 634 | + | |
| 635 | + | |
628 | 636 | | |
629 | 637 | | |
630 | 638 | | |
631 | | - | |
| 639 | + | |
| 640 | + | |
632 | 641 | | |
633 | 642 | | |
634 | | - | |
635 | | - | |
636 | | - | |
637 | | - | |
638 | | - | |
639 | | - | |
640 | | - | |
641 | | - | |
642 | | - | |
643 | | - | |
644 | 643 | | |
645 | 644 | | |
646 | 645 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
152 | 152 | | |
153 | 153 | | |
154 | 154 | | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
155 | 161 | | |
156 | 162 | | |
157 | 163 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
224 | 224 | | |
225 | 225 | | |
226 | 226 | | |
227 | | - | |
228 | | - | |
| 227 | + | |
| 228 | + | |
229 | 229 | | |
230 | | - | |
231 | | - | |
232 | | - | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
233 | 233 | | |
234 | 234 | | |
235 | | - | |
236 | | - | |
| 235 | + | |
| 236 | + | |
237 | 237 | | |
238 | 238 | | |
239 | 239 | | |
| |||
244 | 244 | | |
245 | 245 | | |
246 | 246 | | |
247 | | - | |
248 | | - | |
| 247 | + | |
| 248 | + | |
249 | 249 | | |
250 | | - | |
251 | | - | |
252 | | - | |
| 250 | + | |
| 251 | + | |
253 | 252 | | |
254 | | - | |
255 | | - | |
| 253 | + | |
| 254 | + | |
256 | 255 | | |
257 | | - | |
258 | | - | |
259 | | - | |
260 | 256 | | |
261 | 257 | | |
262 | 258 | | |
263 | 259 | | |
264 | | - | |
| 260 | + | |
265 | 261 | | |
266 | 262 | | |
267 | 263 | | |
268 | 264 | | |
269 | | - | |
| 265 | + | |
| 266 | + | |
270 | 267 | | |
271 | | - | |
| 268 | + | |
| 269 | + | |
272 | 270 | | |
273 | | - | |
274 | | - | |
| 271 | + | |
| 272 | + | |
275 | 273 | | |
276 | | - | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
277 | 278 | | |
278 | 279 | | |
279 | 280 | | |
| |||
282 | 283 | | |
283 | 284 | | |
284 | 285 | | |
285 | | - | |
| 286 | + | |
286 | 287 | | |
287 | 288 | | |
288 | | - | |
| 289 | + | |
| 290 | + | |
0 commit comments