Commit fa1a9c4
fix: switch distillation collection to WAADirect for reliable task setup (#194)
Replace RLEnvironment + WAALiveAdapter with WAADirect in the distillation
data collection script. The adapter layer fails on custom YAML task IDs
and doesn't reset the environment properly.
Key changes:
- Load task configs from --task-dir (YAML/JSON files) via TaskConfig.from_dir()
- Use WAADirect.setup_task(task_config.to_waa_config()) for environment reset
- Use WAADirect.screenshot() and execute_action() instead of env.step()
- Evaluate via evaluate_milestones_screenshot() on fresh post-episode screenshot
- Fix Anthropic API call: always use max_tokens (not max_completion_tokens)
- Add --eval-model flag for milestone VLM evaluation model
- Add --task-dir as required arg (replaces server-side task discovery)
Kept unchanged: TeacherAgent, PlannerTrajectoryLogger (keep_failed=True),
CostTracker, resume support, graceful shutdown handling.
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 7c402ca commit fa1a9c4
2 files changed
Lines changed: 185 additions & 204 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
325 | 325 | | |
326 | 326 | | |
327 | 327 | | |
328 | | - | |
| 328 | + | |
329 | 329 | | |
330 | 330 | | |
331 | 331 | | |
332 | 332 | | |
| 333 | + | |
333 | 334 | | |
334 | 335 | | |
335 | 336 | | |
336 | 337 | | |
| 338 | + | |
337 | 339 | | |
338 | 340 | | |
339 | 341 | | |
340 | 342 | | |
341 | 343 | | |
342 | | - | |
| 344 | + | |
343 | 345 | | |
344 | | - | |
| 346 | + | |
| 347 | + | |
345 | 348 | | |
346 | 349 | | |
347 | 350 | | |
348 | 351 | | |
| 352 | + | |
349 | 353 | | |
350 | 354 | | |
351 | 355 | | |
352 | 356 | | |
353 | 357 | | |
354 | 358 | | |
| 359 | + | |
355 | 360 | | |
356 | 361 | | |
357 | 362 | | |
358 | 363 | | |
359 | 364 | | |
| 365 | + | |
360 | 366 | | |
361 | 367 | | |
362 | | - | |
| 368 | + | |
363 | 369 | | |
364 | 370 | | |
365 | 371 | | |
366 | 372 | | |
| 373 | + | |
367 | 374 | | |
368 | 375 | | |
369 | 376 | | |
| |||
0 commit comments