fix(opd): score teacher logprobs at rollout temperature, not 0#2085
Open
EazyReal wants to merge 2 commits into
Open
fix(opd): score teacher logprobs at rollout temperature, not 0#2085EazyReal wants to merge 2 commits into
EazyReal wants to merge 2 commits into