Skip to content

update rl_utils_test to use values in config

084c8ee
Select commit
Loading
Failed to load commit list.
Closed

log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts #3417

update rl_utils_test to use values in config
084c8ee
Select commit
Loading
Failed to load commit list.