Skip to content

log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts#3417

Closed
andytwigg wants to merge 13 commits intomainfrom
atwigg/log_lr_hyperparam
Closed

log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts#3417
andytwigg wants to merge 13 commits intomainfrom
atwigg/log_lr_hyperparam