Skip to content

Add open-r1/OpenR1-Math-220k dataset and nvidia/OpenMathReasoning to RL and fix reward function#3629

Merged
copybara-service[bot] merged 1 commit intomainfrom
rl-debug
Apr 23, 2026
Merged

Add open-r1/OpenR1-Math-220k dataset and nvidia/OpenMathReasoning to RL and fix reward function#3629
copybara-service[bot] merged 1 commit intomainfrom
rl-debug

Commits

Commits on Apr 23, 2026