|
1 | 1 | # CHANGELOG |
2 | 2 |
|
3 | 3 |
|
| 4 | +## v0.30.1 (2026-03-04) |
| 5 | + |
| 6 | +### Bug Fixes |
| 7 | + |
| 8 | +- Correct training entry point, env config, and GPU defaults for VAGEN |
| 9 | + ([#101](https://github.com/OpenAdaptAI/openadapt-evals/pull/101), |
| 10 | + [`ac7437d`](https://github.com/OpenAdaptAI/openadapt-evals/commit/ac7437d0c5702c7d36c4bd586c3337973c41837c)) |
| 11 | + |
| 12 | +Five blockers for running verl-agent training on g5.xlarge: |
| 13 | + |
| 14 | +A) n_gpus default: 2 -> 1 (g5.xlarge has 1 GPU; multi-GPU is for g5.12xlarge) - train_verl_e2e.py |
| 15 | + argparse default - train_waa_vagen.yaml trainer.n_gpus_per_node - vm_cli.py gpu-train --n-gpus |
| 16 | + default |
| 17 | + |
| 18 | +B) n_envs: 8 -> 1 (single WAA VM; GRPO group size is rollout.n, not n_envs) - train_waa_vagen.yaml |
| 19 | + envs[0].n_envs |
| 20 | + |
| 21 | +C) Training entry point: verl.trainer.main_ppo -> vagen.main_ppo - VAGEN has its own entry point |
| 22 | + with Hydra config support - Added --config-path and --config-name Hydra args |
| 23 | + |
| 24 | +D) Generated config: full training config -> env spec only - _generate_training_config now emits |
| 25 | + only the envs section - Algorithm, trainer, and rollout settings are Hydra overrides on CLI - |
| 26 | + data.train_files/val_files point to the env spec YAML |
| 27 | + |
| 28 | +E) Rollout config: added VAGEN-required Hydra overrides - multi_turn.enable=True for multi-step |
| 29 | + desktop tasks - rollout.n={group_size} for GRPO group size - FSDP param/optimizer offload for |
| 30 | + single-GPU memory - gradient checkpointing enabled - total_training_steps replaces total_epochs |
| 31 | + (VAGEN uses steps) - Added evaluate_url to log output |
| 32 | + |
| 33 | +Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> |
| 34 | + |
| 35 | +### Documentation |
| 36 | + |
| 37 | +- Add AWS spot instance cost analysis for GPU training |
| 38 | + ([#100](https://github.com/OpenAdaptAI/openadapt-evals/pull/100), |
| 39 | + [`c7a9177`](https://github.com/OpenAdaptAI/openadapt-evals/commit/c7a9177fa421efd998ce8e0a49452b9ea86511c0)) |
| 40 | + |
| 41 | +Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> |
| 42 | + |
| 43 | +- Add first training run runbook with pre-flight checklist |
| 44 | + ([#99](https://github.com/OpenAdaptAI/openadapt-evals/pull/99), |
| 45 | + [`dd6b6fc`](https://github.com/OpenAdaptAI/openadapt-evals/commit/dd6b6fc8938f5422bb92c9cd721ec603035a8644)) |
| 46 | + |
| 47 | +Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> |
| 48 | + |
| 49 | +- Add UNIX socket bridge section to README |
| 50 | + ([#98](https://github.com/OpenAdaptAI/openadapt-evals/pull/98), |
| 51 | + [`f7d4be9`](https://github.com/OpenAdaptAI/openadapt-evals/commit/f7d4be967bc5d313b4c8a88b5be6314637b52cde)) |
| 52 | + |
| 53 | +Add concise section explaining the nsenter+socat workaround for Docker port 5050 broken by QEMU |
| 54 | + NET_ADMIN, with recovery steps and link to the detailed architecture doc. |
| 55 | + |
| 56 | +Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> |
| 57 | + |
| 58 | + |
4 | 59 | ## v0.30.0 (2026-03-04) |
5 | 60 |
|
6 | 61 | ### Bug Fixes |
|
0 commit comments