You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Raise UnsupportedFeatureError for FP8 on sm80 family
- Relax tf32 mma test tolerance for sm80 family
- Fix rmsnorm kernel to add zero padding when load out of bound
- Reduce tile size on sm80 family for persistent rmsnorm benchmark
Signed-off-by: Jay Gu <jagu@nvidia.com>
0 commit comments