Commit ef4ff46
[rocm-libraries] ROCm/rocm-libraries#5842 (commit 04c5690)
[CK][CK Tile] Force padding for atomic_add bf16 C tensor
(#5842)
## Motivation
Force padding for atomic_add bf16 C tensor to avoid memfaults.
## Technical Details
- add global atomic add for bf16 and enable them
- add padding for atomic add bf16 due to the lack of oob
- remove padding for not continous dims in conv for other cases
- minor bwd data conv fixes
## Test Plan
test_grouped_conv_*_tile
## Test Result
pending
## Submission Checklist
- [x] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.1 parent 66dc81d commit ef4ff46
7 files changed
Lines changed: 174 additions & 171 deletions
File tree
- include/ck_tile
- core
- arch
- tensor
- ops
- gemm/kernel
- grouped_convolution/kernel
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
21 | 25 | | |
22 | 26 | | |
23 | 27 | | |
| |||
2317 | 2321 | | |
2318 | 2322 | | |
2319 | 2323 | | |
| 2324 | + | |
| 2325 | + | |
| 2326 | + | |
| 2327 | + | |
| 2328 | + | |
| 2329 | + | |
| 2330 | + | |
| 2331 | + | |
| 2332 | + | |
| 2333 | + | |
| 2334 | + | |
| 2335 | + | |
| 2336 | + | |
| 2337 | + | |
| 2338 | + | |
| 2339 | + | |
| 2340 | + | |
| 2341 | + | |
| 2342 | + | |
| 2343 | + | |
| 2344 | + | |
| 2345 | + | |
| 2346 | + | |
| 2347 | + | |
| 2348 | + | |
| 2349 | + | |
| 2350 | + | |
| 2351 | + | |
2320 | 2352 | | |
2321 | 2353 | | |
2322 | 2354 | | |
| |||
2325 | 2357 | | |
2326 | 2358 | | |
2327 | 2359 | | |
2328 | | - | |
2329 | | - | |
| 2360 | + | |
| 2361 | + | |
| 2362 | + | |
| 2363 | + | |
| 2364 | + | |
2330 | 2365 | | |
2331 | 2366 | | |
2332 | 2367 | | |
| |||
2931 | 2966 | | |
2932 | 2967 | | |
2933 | 2968 | | |
2934 | | - | |
2935 | | - | |
2936 | | - | |
2937 | | - | |
| 2969 | + | |
| 2970 | + | |
| 2971 | + | |
| 2972 | + | |
| 2973 | + | |
| 2974 | + | |
| 2975 | + | |
| 2976 | + | |
| 2977 | + | |
| 2978 | + | |
| 2979 | + | |
| 2980 | + | |
| 2981 | + | |
| 2982 | + | |
2938 | 2983 | | |
| 2984 | + | |
2939 | 2985 | | |
2940 | | - | |
| 2986 | + | |
2941 | 2987 | | |
2942 | | - | |
2943 | | - | |
| 2988 | + | |
| 2989 | + | |
2944 | 2990 | | |
2945 | 2991 | | |
2946 | 2992 | | |
2947 | 2993 | | |
2948 | 2994 | | |
2949 | 2995 | | |
2950 | 2996 | | |
| 2997 | + | |
| 2998 | + | |
| 2999 | + | |
2951 | 3000 | | |
2952 | 3001 | | |
2953 | 3002 | | |
| |||
Lines changed: 57 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
21 | 25 | | |
22 | 26 | | |
23 | 27 | | |
| |||
2143 | 2147 | | |
2144 | 2148 | | |
2145 | 2149 | | |
| 2150 | + | |
| 2151 | + | |
| 2152 | + | |
| 2153 | + | |
| 2154 | + | |
| 2155 | + | |
| 2156 | + | |
| 2157 | + | |
| 2158 | + | |
| 2159 | + | |
| 2160 | + | |
| 2161 | + | |
| 2162 | + | |
| 2163 | + | |
| 2164 | + | |
| 2165 | + | |
| 2166 | + | |
| 2167 | + | |
| 2168 | + | |
| 2169 | + | |
| 2170 | + | |
| 2171 | + | |
| 2172 | + | |
| 2173 | + | |
| 2174 | + | |
| 2175 | + | |
| 2176 | + | |
2146 | 2177 | | |
2147 | 2178 | | |
2148 | 2179 | | |
| |||
2151 | 2182 | | |
2152 | 2183 | | |
2153 | 2184 | | |
2154 | | - | |
2155 | | - | |
| 2185 | + | |
| 2186 | + | |
| 2187 | + | |
| 2188 | + | |
| 2189 | + | |
2156 | 2190 | | |
2157 | 2191 | | |
2158 | 2192 | | |
| |||
2759 | 2793 | | |
2760 | 2794 | | |
2761 | 2795 | | |
2762 | | - | |
2763 | | - | |
| 2796 | + | |
| 2797 | + | |
| 2798 | + | |
| 2799 | + | |
| 2800 | + | |
| 2801 | + | |
| 2802 | + | |
| 2803 | + | |
| 2804 | + | |
| 2805 | + | |
| 2806 | + | |
| 2807 | + | |
| 2808 | + | |
| 2809 | + | |
2764 | 2810 | | |
2765 | | - | |
| 2811 | + | |
2766 | 2812 | | |
2767 | 2813 | | |
2768 | | - | |
| 2814 | + | |
2769 | 2815 | | |
2770 | | - | |
2771 | | - | |
| 2816 | + | |
| 2817 | + | |
2772 | 2818 | | |
2773 | 2819 | | |
2774 | 2820 | | |
2775 | 2821 | | |
2776 | 2822 | | |
2777 | 2823 | | |
2778 | 2824 | | |
| 2825 | + | |
| 2826 | + | |
| 2827 | + | |
2779 | 2828 | | |
2780 | 2829 | | |
2781 | 2830 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
630 | 630 | | |
631 | 631 | | |
632 | 632 | | |
633 | | - | |
| 633 | + | |
634 | 634 | | |
635 | 635 | | |
636 | 636 | | |
| |||
642 | 642 | | |
643 | 643 | | |
644 | 644 | | |
645 | | - | |
| 645 | + | |
646 | 646 | | |
647 | 647 | | |
648 | 648 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1021 | 1021 | | |
1022 | 1022 | | |
1023 | 1023 | | |
| 1024 | + | |
| 1025 | + | |
| 1026 | + | |
| 1027 | + | |
| 1028 | + | |
1024 | 1029 | | |
1025 | 1030 | | |
1026 | 1031 | | |
1027 | 1032 | | |
1028 | 1033 | | |
1029 | 1034 | | |
1030 | 1035 | | |
1031 | | - | |
| 1036 | + | |
1032 | 1037 | | |
1033 | 1038 | | |
1034 | 1039 | | |
1035 | 1040 | | |
1036 | 1041 | | |
1037 | 1042 | | |
1038 | | - | |
| 1043 | + | |
1039 | 1044 | | |
1040 | 1045 | | |
1041 | 1046 | | |
| |||
0 commit comments