Commit 6aa9619
authored
Cpu fused kernel (#1804)
* add template to support more dtypes
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update cmake list
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix compile cpu
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* make different dtype works
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* use bf16 on CPU
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix state2 dtype
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* remove torch
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* rm torch
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable float to bf16
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* rm dequantizeBlockwise4bitCpu
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable dequant 4bit kernel
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix dequantize
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* test
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* change input param
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix input param
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* spliut 8bit and 4bit
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix input params
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix input params
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable dequant4bit
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix reverse
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix dequant 4bit fallback path
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix fp4 dequant
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* rm _Float16
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* tmp codes
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable gemv
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* change to 4bit dequant
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix def
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix type
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix absmax dtype
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix type
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix compile and type
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable gemv
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix shape
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix lib name
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* debug
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable gemv 4bit bf16
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable avx512 check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix endif
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix def
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix position
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* rm duplicated func
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* rm useless code comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix out shape
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* add reverse format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* check avx512bf15
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix has_avx512bf16
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix absmax shhape
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix compile
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix test_gemv
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* disable binsearch
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix lint
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix save
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>1 parent bd028b8 commit 6aa9619
File tree
10 files changed
+755
-81
lines changed- bitsandbytes
- autograd
- backends/cpu
- nn
- csrc
- tests
10 files changed
+755
-81
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
282 | 282 | | |
283 | 283 | | |
284 | 284 | | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
285 | 288 | | |
286 | 289 | | |
287 | 290 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
374 | 374 | | |
375 | 375 | | |
376 | 376 | | |
377 | | - | |
| 377 | + | |
378 | 378 | | |
379 | 379 | | |
380 | 380 | | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
381 | 389 | | |
382 | 390 | | |
383 | 391 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
| 8 | + | |
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| |||
217 | 217 | | |
218 | 218 | | |
219 | 219 | | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2103 | 2103 | | |
2104 | 2104 | | |
2105 | 2105 | | |
| 2106 | + | |
| 2107 | + | |
| 2108 | + | |
| 2109 | + | |
| 2110 | + | |
| 2111 | + | |
| 2112 | + | |
| 2113 | + | |
| 2114 | + | |
| 2115 | + | |
| 2116 | + | |
| 2117 | + | |
| 2118 | + | |
| 2119 | + | |
| 2120 | + | |
| 2121 | + | |
| 2122 | + | |
| 2123 | + | |
| 2124 | + | |
| 2125 | + | |
| 2126 | + | |
| 2127 | + | |
| 2128 | + | |
| 2129 | + | |
| 2130 | + | |
| 2131 | + | |
| 2132 | + | |
| 2133 | + | |
| 2134 | + | |
| 2135 | + | |
| 2136 | + | |
| 2137 | + | |
| 2138 | + | |
| 2139 | + | |
| 2140 | + | |
| 2141 | + | |
| 2142 | + | |
| 2143 | + | |
| 2144 | + | |
| 2145 | + | |
| 2146 | + | |
| 2147 | + | |
| 2148 | + | |
| 2149 | + | |
| 2150 | + | |
| 2151 | + | |
| 2152 | + | |
| 2153 | + | |
| 2154 | + | |
| 2155 | + | |
| 2156 | + | |
| 2157 | + | |
| 2158 | + | |
| 2159 | + | |
| 2160 | + | |
| 2161 | + | |
| 2162 | + | |
| 2163 | + | |
| 2164 | + | |
| 2165 | + | |
| 2166 | + | |
| 2167 | + | |
| 2168 | + | |
| 2169 | + | |
| 2170 | + | |
| 2171 | + | |
| 2172 | + | |
| 2173 | + | |
| 2174 | + | |
| 2175 | + | |
| 2176 | + | |
| 2177 | + | |
| 2178 | + | |
| 2179 | + | |
| 2180 | + | |
| 2181 | + | |
| 2182 | + | |
| 2183 | + | |
| 2184 | + | |
| 2185 | + | |
| 2186 | + | |
| 2187 | + | |
| 2188 | + | |
| 2189 | + | |
| 2190 | + | |
| 2191 | + | |
| 2192 | + | |
| 2193 | + | |
| 2194 | + | |
| 2195 | + | |
| 2196 | + | |
| 2197 | + | |
| 2198 | + | |
| 2199 | + | |
| 2200 | + | |
| 2201 | + | |
| 2202 | + | |
| 2203 | + | |
| 2204 | + | |
| 2205 | + | |
| 2206 | + | |
| 2207 | + | |
| 2208 | + | |
| 2209 | + | |
| 2210 | + | |
| 2211 | + | |
| 2212 | + | |
| 2213 | + | |
| 2214 | + | |
| 2215 | + | |
| 2216 | + | |
| 2217 | + | |
| 2218 | + | |
| 2219 | + | |
| 2220 | + | |
| 2221 | + | |
| 2222 | + | |
| 2223 | + | |
| 2224 | + | |
| 2225 | + | |
| 2226 | + | |
| 2227 | + | |
| 2228 | + | |
| 2229 | + | |
| 2230 | + | |
| 2231 | + | |
| 2232 | + | |
| 2233 | + | |
| 2234 | + | |
| 2235 | + | |
| 2236 | + | |
| 2237 | + | |
| 2238 | + | |
| 2239 | + | |
2106 | 2240 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
16 | 21 | | |
17 | 22 | | |
18 | 23 | | |
| |||
311 | 316 | | |
312 | 317 | | |
313 | 318 | | |
| 319 | + | |
| 320 | + | |
314 | 321 | | |
315 | 322 | | |
316 | 323 | | |
| 324 | + | |
| 325 | + | |
317 | 326 | | |
318 | 327 | | |
319 | 328 | | |
| |||
479 | 488 | | |
480 | 489 | | |
481 | 490 | | |
| 491 | + | |
482 | 492 | | |
483 | 493 | | |
484 | 494 | | |
| |||
505 | 515 | | |
506 | 516 | | |
507 | 517 | | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
508 | 522 | | |
509 | | - | |
510 | 523 | | |
511 | 524 | | |
512 | 525 | | |
513 | 526 | | |
514 | 527 | | |
515 | 528 | | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
516 | 539 | | |
517 | 540 | | |
518 | 541 | | |
| |||
527 | 550 | | |
528 | 551 | | |
529 | 552 | | |
530 | | - | |
| 553 | + | |
531 | 554 | | |
532 | | - | |
| 555 | + | |
533 | 556 | | |
534 | 557 | | |
535 | 558 | | |
| |||
0 commit comments