log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts#3417
Closed
log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts#3417
Commits
Commits on Mar 14, 2026
- committed
- committed
- committed
- committed
- authored
- committed
- committed
- committed
- committed