Skip to content

Add deterministic reversals curriculum#545

Merged
bruno-f-cruz merged 10 commits into
mainfrom
feat-add-deterministic-reversal-curriculum
May 22, 2026
Merged

Add deterministic reversals curriculum#545
bruno-f-cruz merged 10 commits into
mainfrom
feat-add-deterministic-reversal-curriculum

Conversation

@tiffanyona
Copy link
Copy Markdown
Collaborator

Curriculum for deterministic reversals

Use MarkovEnvironment (renamed from EnvironmentStatistics) after PR #547
@bruno-f-cruz bruno-f-cruz force-pushed the feat-add-deterministic-reversal-curriculum branch from aaa042d to f544db3 Compare May 19, 2026 20:59
@bruno-f-cruz bruno-f-cruz force-pushed the feat-add-deterministic-reversal-curriculum branch from 431afca to feebca6 Compare May 19, 2026 21:06
@bruno-f-cruz bruno-f-cruz changed the title Feat add deterministic reversal curriculum Add deterministic reversals curriculum May 19, 2026
@bruno-f-cruz
Copy link
Copy Markdown
Member

@tiffanyona can you confirm you get identical results after the changes I made?

@tiffanyona
Copy link
Copy Markdown
Collaborator Author

I will check it in a bit

Copy link
Copy Markdown
Member

@bruno-f-cruz bruno-f-cruz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Small question to get rid of potentially unecessary reward function. Not sure if this is correct, but worth checking

rule=task_logic.RewardFunctionRule.ON_REWARD,
)
reset_function = task_logic.OnThisPatchEntryRewardFunction(
probability=task_logic.SetValueFunction(value=task_logic.scalar_value(1)),
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you really need this? Since the rule is on_choice, i think this is redundant

else:
reward_available = 100
reset_function = task_logic.OnThisPatchEntryRewardFunction(
probability=task_logic.SetValueFunction(value=task_logic.scalar_value(1)),
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I never tried without it so not sure of the behavior.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok I will leave it for now since it helps understand the logic anyway.

@bruno-f-cruz bruno-f-cruz merged commit 1f43a8d into main May 22, 2026
4 checks passed
@bruno-f-cruz bruno-f-cruz deleted the feat-add-deterministic-reversal-curriculum branch May 22, 2026 17:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants