You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: ajet/default_config/ajet_default.yaml
+11-2Lines changed: 11 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -293,7 +293,17 @@ ajet:
293
293
max_fastapi_threads: 512# 64 or 128 is fine
294
294
max_inference_tracker_threads: 64# recommend to be equal to `ajet.rollout.max_env_worker`
295
295
already_started: False # do not edit, used by `swarm`
296
-
296
+
# what is the stop condition for swarm mode sample collection
297
+
# "rollout_until_finish_enough_episodes":
298
+
# AgentJet simply counts the number of completed episodes, and stop when it has collected [>= (ajet.data.train_batch_size * ajet.rollout.num_repeat)] samples
299
+
# "rollout_until_finish_enough_tasks":
300
+
# AgentJet will identify the **task_id** of each episode, and stop when it has collected [>= ajet.data.train_batch_size] unique & FINISHED **task_id**.
301
+
# (Hint: a **task_id** is considered "FINISHED" when [>= ajet.rollout.num_repeat] episodes of this **task_id** have been completed.)
302
+
# "rollout_until_finish_enough_non_dummy_tasks":
303
+
# AgentJet will identify the **task_id** of each episode, and stop when it has collected [>= ajet.data.train_batch_size] unique & FINISHED & NON-DUMMY **task_id**.
304
+
# (Hint: a **task_id** is considered "NON-DUMMY" at least one of **episodes** of **task_id** has **different** reward value.)
Copy file name to clipboardExpand all lines: ajet/default_config/ajet_ts_default.yaml
+11Lines changed: 11 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -33,6 +33,17 @@ ajet:
33
33
max_inference_tracker_threads: 64# recommend to be equal to `ajet.rollout.max_env_worker`
34
34
already_started: False # do not edit, used by `swarm`
35
35
36
+
# the method to determine when to stop rollout in swarm mode. Options:
37
+
# "rollout_until_finish_enough_episodes":
38
+
# AgentJet simply counts the number of completed episodes, and stop when it has collected [>= (ajet.data.train_batch_size * ajet.rollout.num_repeat)] samples
39
+
# "rollout_until_finish_enough_tasks":
40
+
# AgentJet will identify the **task_id** of each episode, and stop when it has collected [>= ajet.data.train_batch_size] unique & FINISHED **task_id**.
41
+
# (Hint: a **task_id** is considered "FINISHED" when [>= ajet.rollout.num_repeat] episodes of this **task_id** have been completed.)
42
+
# "rollout_until_finish_enough_non_dummy_tasks":
43
+
# AgentJet will identify the **task_id** of each episode, and stop when it has collected [>= ajet.data.train_batch_size] unique & FINISHED & NON-DUMMY **task_id**.
44
+
# (Hint: a **task_id** is considered "NON-DUMMY" at least one of **episodes** of **task_id** has **different** reward value.)
0 commit comments