You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add rdnaWaves occupancy filter to greedy tuning phases 1 and 2 (#2311)
Greedy tuning for attention kernels on RDNA was missing the rdnaWaves
filter that exhaustive tuning already has, causing a massive search
space explosion (~8k configs/problem vs ~300-600 in exhaustive).
This made greedy ~13x slower than exhaustive on RDNA targets.
0 commit comments