log lr hyperparams; add exact match rewards; fix qwen3-base configs; use user/system parts in prompts #3417
Google CLA / cla/google
succeeded
Mar 16, 2026 in 13s
✅ All contributors are covered under a CLA with Google
See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).
ℹ️ Googlers: Go here to view more details and manage scans for this pull request.
Details
The following contributors were found for this pull request:
✅ 084c8ee Author: @andytwigg <and*****gg@gmail.com>, <at***g@google.com>
(Only the first commit for a unique contributor is listed.)
Loading