You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Consolidate Gemma RMSNorm +1 offset into single block after load
Address review feedback (shewu-quic): move the Gemma norm weight
adjustment out of both if/else branches into a single block that
runs regardless of checkpoint source. Also fix the misleading comment
and simplify torch.ones() to scalar + 1.0.
0 commit comments