|
1 | 1 | # CHANGELOG |
2 | 2 |
|
3 | 3 |
|
| 4 | +## v0.3.0 (2026-02-05) |
| 5 | + |
| 6 | +### Bug Fixes |
| 7 | + |
| 8 | +- **cli**: Improve pool-create reliability and error handling |
| 9 | + ([`f23bd57`](https://github.com/OpenAdaptAI/openadapt-ml/commit/f23bd571a76c361d9e46d99820728ffdedb5cef5)) |
| 10 | + |
| 11 | +- Properly clean up test VM and associated resources during quota check - Use sudo for docker pull |
| 12 | + (usermod not effective in same session) - Add pool-cleanup command for orphaned resources - Show |
| 13 | + full error messages in pool creation failures |
| 14 | + |
| 15 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 16 | + |
| 17 | +- **pool**: Use WAA native task distribution with --worker_id/--num_workers |
| 18 | + ([`ef0d8c7`](https://github.com/OpenAdaptAI/openadapt-ml/commit/ef0d8c7ecf60b1644dbc5a40ed0a05b1b4c2f597)) |
| 19 | + |
| 20 | +- Fixed task distribution: WAA ignores --start_idx/--num_tasks, use native --worker_id and |
| 21 | + --num_workers parameters instead - Worker 0 gets tasks 0, N, 2N... Worker 1 gets tasks 1, N+1, |
| 22 | + 2N+1... - Use vanilla windowsarena/winarena image with correct IP (20.20.20.21) - Add container |
| 23 | + reuse check (skip restart if already running) - Pass API key via env var instead of config file - |
| 24 | + Fix QMP port exposure (7200) for QEMU control - Store Windows disk on /mnt for 300GB temp storage |
| 25 | + (D8ds_v5) |
| 26 | + |
| 27 | +Tested: 2-worker pool running 4 tasks in parallel successfully |
| 28 | + |
| 29 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 30 | + |
| 31 | +- **waa**: Use D4ds_v4 VM size for quota compatibility |
| 32 | + ([`2a51a97`](https://github.com/OpenAdaptAI/openadapt-ml/commit/2a51a976f10db135ed79971162f5605de944dd6e)) |
| 33 | + |
| 34 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 35 | + |
| 36 | +- **waa**: Use D8ds_v5 VM size for Azure ML workers |
| 37 | + ([`71a0fdd`](https://github.com/OpenAdaptAI/openadapt-ml/commit/71a0fddfa4216a49b3a37c1ff1cc2d98d1f605a1)) |
| 38 | + |
| 39 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 40 | + |
| 41 | +### Documentation |
| 42 | + |
| 43 | +- Add Azure ML log streaming and cost tracking guides |
| 44 | + ([`59c3a3e`](https://github.com/OpenAdaptAI/openadapt-ml/commit/59c3a3ef852d747599de53fe74f660aea6d5b033)) |
| 45 | + |
| 46 | +Document the new CLI commands for: - Live log streaming from Azure ML jobs - Cost tracking for |
| 47 | + compute instances - Teardown procedures |
| 48 | + |
| 49 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 50 | + |
| 51 | +### Features |
| 52 | + |
| 53 | +- **cli**: Add Azure ML log streaming, cost tracking, and teardown |
| 54 | + ([`59e3cf7`](https://github.com/OpenAdaptAI/openadapt-ml/commit/59e3cf7fbdb53f07866709efd60b05a1aa511ed5)) |
| 55 | + |
| 56 | +Add comprehensive Azure ML management commands: - azure-ml-stream: Stream logs from running jobs |
| 57 | + using Python SDK with account key auth (works around DefaultAzureCredential permission issues) - |
| 58 | + azure-ml-cost: Track compute instance uptime and estimated costs - azure-ml-teardown: Cancel jobs |
| 59 | + and delete compute instances |
| 60 | + |
| 61 | +Also improves: - azure-ml-quota: Shows both ML Dedicated quota (what Azure ML actually uses) and |
| 62 | + regular VM quota - Better error handling and logging throughout |
| 63 | + |
| 64 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 65 | + |
| 66 | +- **cli**: Add Azure ML status, VNC, and monitor commands |
| 67 | + ([`7985cff`](https://github.com/OpenAdaptAI/openadapt-ml/commit/7985cff95cc50d0313b3f3cb8ff5a1a1de039a71)) |
| 68 | + |
| 69 | +New commands for end-to-end Azure ML automation: - azure-ml-status: Show jobs and compute instances |
| 70 | + - azure-ml-vnc: Set up VNC tunnel to compute instance - azure-ml-monitor: Monitor jobs with auto |
| 71 | + VNC setup |
| 72 | + |
| 73 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 74 | + |
| 75 | +- **cli**: Add azure-ml-quota command for quota management |
| 76 | + ([`eecb3a4`](https://github.com/OpenAdaptAI/openadapt-ml/commit/eecb3a461c2b5f6882755dab52f2cfb0c9a9616a)) |
| 77 | + |
| 78 | +Semi-automated quota increase workflow: - Checks current quota for WAA-compatible VM families - |
| 79 | + Shows which families have sufficient quota - Opens Azure Portal quota page with instructions - |
| 80 | + Guides user through the request process |
| 81 | + |
| 82 | +Usage: uv run python -m openadapt_ml.benchmarks.cli azure-ml-quota |
| 83 | + |
| 84 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 85 | + |
| 86 | +- **cli**: Add multi-VM pool commands for parallel WAA evaluation |
| 87 | + ([`005664a`](https://github.com/OpenAdaptAI/openadapt-ml/commit/005664ab0ec6c425ac9ebe1af19b23e552b9bf90)) |
| 88 | + |
| 89 | +Add pool-create, pool-wait, and pool-run commands for running WAA benchmarks across multiple VMs in |
| 90 | + parallel: |
| 91 | + |
| 92 | +- pool-create --workers N: Create N VMs with Docker and WAA image - Parallel VM creation using |
| 93 | + ThreadPoolExecutor - Auto-selects available region and VM size - Configures Docker with /mnt |
| 94 | + storage - Registers pool for tracking |
| 95 | + |
| 96 | +- pool-wait: Wait for WAA to be ready on all workers - Starts WAA containers on each worker - Polls |
| 97 | + /probe endpoint until ready - Configurable timeout |
| 98 | + |
| 99 | +- pool-run --tasks N: Distribute tasks across pool - Round-robin task distribution - Parallel |
| 100 | + execution on all workers - Progress tracking in registry |
| 101 | + |
| 102 | +This enables ~5x faster benchmark completion with 5 workers, or full 154-task evaluation in ~10min |
| 103 | + with 10+ workers. |
| 104 | + |
| 105 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 106 | + |
| 107 | +### Refactoring |
| 108 | + |
| 109 | +- **waa**: Update submodule with SDK v2 migration |
| 110 | + ([`5080ad6`](https://github.com/OpenAdaptAI/openadapt-ml/commit/5080ad697e88ff297dfbb14f2c0756f53ebfd496)) |
| 111 | + |
| 112 | +Updates WindowsAgentArena submodule to include Azure ML SDK v2 migration that enables job submission |
| 113 | + from macOS ARM64. |
| 114 | + |
| 115 | +Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
| 116 | + |
| 117 | + |
4 | 118 | ## v0.2.2 (2026-01-29) |
5 | 119 |
|
6 | 120 | ### Bug Fixes |
|
0 commit comments