Commit 5eb8fff
ggml-webgpu: updated matrix-vector multiplication (ggml-org#21738)
* merged properly, but slow q3_k and q5_k with u32 indexing
* Start on new mat-vec
* New format float paths working
* Working q4_0
* Work on remaining legacy q-types
* port k-quants to new matvec
* remove old shader
* Remove old constants, format
* remove accidental file
---------
Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local>
Co-authored-by: Reese Levine <reeselevine1@gmail.com>1 parent f8ea065 commit 5eb8fff
4 files changed
Lines changed: 788 additions & 383 deletions
File tree
- ggml/src/ggml-webgpu
- wgsl-shaders
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
59 | 50 | | |
60 | 51 | | |
61 | 52 | | |
| |||
78 | 69 | | |
79 | 70 | | |
80 | 71 | | |
| 72 | + | |
81 | 73 | | |
82 | 74 | | |
83 | 75 | | |
| |||
575 | 567 | | |
576 | 568 | | |
577 | 569 | | |
578 | | - | |
579 | 570 | | |
580 | 571 | | |
581 | 572 | | |
| |||
1326 | 1317 | | |
1327 | 1318 | | |
1328 | 1319 | | |
1329 | | - | |
| 1320 | + | |
1330 | 1321 | | |
1331 | 1322 | | |
1332 | 1323 | | |
| |||
1337 | 1328 | | |
1338 | 1329 | | |
1339 | 1330 | | |
1340 | | - | |
| 1331 | + | |
| 1332 | + | |
1341 | 1333 | | |
1342 | 1334 | | |
1343 | 1335 | | |
| |||
1386 | 1378 | | |
1387 | 1379 | | |
1388 | 1380 | | |
1389 | | - | |
1390 | 1381 | | |
1391 | 1382 | | |
1392 | 1383 | | |
1393 | | - | |
1394 | 1384 | | |
1395 | 1385 | | |
1396 | | - | |
1397 | 1386 | | |
1398 | 1387 | | |
1399 | 1388 | | |
1400 | 1389 | | |
1401 | | - | |
1402 | 1390 | | |
| 1391 | + | |
| 1392 | + | |
| 1393 | + | |
| 1394 | + | |
| 1395 | + | |
1403 | 1396 | | |
1404 | | - | |
| 1397 | + | |
1405 | 1398 | | |
1406 | 1399 | | |
1407 | | - | |
1408 | 1400 | | |
1409 | 1401 | | |
1410 | 1402 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
| 184 | + | |
184 | 185 | | |
185 | 186 | | |
186 | 187 | | |
| |||
1164 | 1165 | | |
1165 | 1166 | | |
1166 | 1167 | | |
1167 | | - | |
1168 | | - | |
1169 | | - | |
1170 | | - | |
1171 | 1168 | | |
1172 | 1169 | | |
1173 | | - | |
1174 | | - | |
| 1170 | + | |
| 1171 | + | |
| 1172 | + | |
1175 | 1173 | | |
1176 | 1174 | | |
1177 | 1175 | | |
| |||
1182 | 1180 | | |
1183 | 1181 | | |
1184 | 1182 | | |
1185 | | - | |
1186 | | - | |
1187 | | - | |
| 1183 | + | |
| 1184 | + | |
| 1185 | + | |
| 1186 | + | |
1188 | 1187 | | |
| 1188 | + | |
1189 | 1189 | | |
1190 | 1190 | | |
1191 | 1191 | | |
| |||
1287 | 1287 | | |
1288 | 1288 | | |
1289 | 1289 | | |
1290 | | - | |
| 1290 | + | |
| 1291 | + | |
1291 | 1292 | | |
1292 | 1293 | | |
1293 | 1294 | | |
| |||
3040 | 3041 | | |
3041 | 3042 | | |
3042 | 3043 | | |
| 3044 | + | |
| 3045 | + | |
3043 | 3046 | | |
3044 | 3047 | | |
3045 | 3048 | | |
| |||
3072 | 3075 | | |
3073 | 3076 | | |
3074 | 3077 | | |
3075 | | - | |
3076 | 3078 | | |
3077 | 3079 | | |
3078 | 3080 | | |
3079 | 3081 | | |
| 3082 | + | |
| 3083 | + | |
| 3084 | + | |
| 3085 | + | |
3080 | 3086 | | |
3081 | 3087 | | |
3082 | 3088 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
48 | 55 | | |
49 | 56 | | |
50 | 57 | | |
| |||
0 commit comments