Skip to content

Commit 373a0d3

Browse files
committed
fix(infra): update GCP runner to v2.332.0 and resolve rebase import conflicts
Runner 2.317.0 does not support node24 required by actions/checkout@v6, causing 'Set up job' failures on dynamically provisioned GCP instances. Key changes: - Bump RUNNER_VERSION from 2.317.0 to 2.332.0 in test-areal.yml and build-docker-image.yml - Remove duplicate imports in test_rollout_controller.py from rebase with PR #996
1 parent 6845f62 commit 373a0d3

11 files changed

Lines changed: 37 additions & 456 deletions

.github/workflows/build-docker-image.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ permissions:
1414
env:
1515
GCP_PROJECT_ID: ${{ secrets.GCP_PROJECT_ID }}
1616
VALIDATOR_LABELS: gcp-docker-validator
17-
RUNNER_VERSION: '2.317.0'
17+
RUNNER_VERSION: '2.332.0'
1818
IMAGE_NAME: ghcr.io/inclusionai/areal-runtime
1919
IMAGE_TAG: test
2020

.github/workflows/test-areal.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ concurrency:
3939

4040
env:
4141
GCP_PROJECT_ID: ${{ secrets.GCP_PROJECT_ID }}
42-
RUNNER_VERSION: '2.317.0'
42+
RUNNER_VERSION: '2.332.0'
4343
GCP_OS_IMAGE: areal-cicd-test-202602030
4444

4545
jobs:
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,9 @@
1+
# Unified GRPO test config.
2+
# Backend-specific values (allocation_mode, optimizer) are overridden via CLI.
13
experiment_name: tests-grpo
24
trial_name: trial
5+
scheduler:
6+
type: local
37
cluster:
48
n_nodes: 1
59
n_gpus_per_node: 2
@@ -33,6 +37,10 @@ recover:
3337
sglang:
3438
model_path: ${actor.path}
3539
random_seed: ${seed}
40+
vllm:
41+
model: ${actor.path}
42+
seed: ${seed}
43+
skip_tokenizer_init: false
3644
gconfig:
3745
n_samples: 4
3846
max_new_tokens: 1024
@@ -42,6 +50,7 @@ rollout:
4250
trial_name: ${trial_name}
4351
consumer_batch_size: ${train_dataset.batch_size}
4452
scheduling_spec: ${actor.scheduling_spec}
53+
max_head_offpolicyness: 2
4554

4655
actor:
4756
experiment_name: ${experiment_name}

tests/grpo/config_archon.yaml

Lines changed: 0 additions & 83 deletions
This file was deleted.

tests/grpo/config_archon_vllm.yaml

Lines changed: 0 additions & 86 deletions
This file was deleted.

tests/grpo/config_fsdp_vllm.yaml

Lines changed: 0 additions & 86 deletions
This file was deleted.

tests/grpo/config_megatron.yaml

Lines changed: 0 additions & 83 deletions
This file was deleted.

0 commit comments

Comments
 (0)