Commit cf24e84
kv-cache : support attention rotation for heterogeneous iSWA (ggml-org#21513)
* kv-cache : support attention rotation for heterogeneous iSWA
* cont : remove assert1 parent 115311f commit cf24e84
File tree
4 files changed
+58
-17
lines changed- src
4 files changed
+58
-17
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
511 | 511 | | |
512 | 512 | | |
513 | 513 | | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
514 | 522 | | |
515 | 523 | | |
516 | 524 | | |
| |||
681 | 689 | | |
682 | 690 | | |
683 | 691 | | |
| 692 | + | |
| 693 | + | |
| 694 | + | |
| 695 | + | |
| 696 | + | |
| 697 | + | |
| 698 | + | |
| 699 | + | |
684 | 700 | | |
685 | 701 | | |
686 | 702 | | |
| |||
2328 | 2344 | | |
2329 | 2345 | | |
2330 | 2346 | | |
2331 | | - | |
2332 | | - | |
| 2347 | + | |
| 2348 | + | |
| 2349 | + | |
| 2350 | + | |
| 2351 | + | |
| 2352 | + | |
| 2353 | + | |
2333 | 2354 | | |
2334 | | - | |
| 2355 | + | |
2335 | 2356 | | |
2336 | 2357 | | |
2337 | | - | |
| 2358 | + | |
2338 | 2359 | | |
2339 | | - | |
| 2360 | + | |
2340 | 2361 | | |
2341 | 2362 | | |
2342 | 2363 | | |
| |||
2354 | 2375 | | |
2355 | 2376 | | |
2356 | 2377 | | |
2357 | | - | |
2358 | | - | |
2359 | 2378 | | |
2360 | 2379 | | |
2361 | 2380 | | |
| |||
2406 | 2425 | | |
2407 | 2426 | | |
2408 | 2427 | | |
2409 | | - | |
2410 | | - | |
| 2428 | + | |
| 2429 | + | |
2411 | 2430 | | |
2412 | 2431 | | |
2413 | 2432 | | |
| |||
2509 | 2528 | | |
2510 | 2529 | | |
2511 | 2530 | | |
| 2531 | + | |
| 2532 | + | |
| 2533 | + | |
2512 | 2534 | | |
2513 | 2535 | | |
2514 | 2536 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
308 | 308 | | |
309 | 309 | | |
310 | 310 | | |
311 | | - | |
| 311 | + | |
312 | 312 | | |
313 | 313 | | |
314 | 314 | | |
| |||
388 | 388 | | |
389 | 389 | | |
390 | 390 | | |
391 | | - | |
392 | 391 | | |
393 | 392 | | |
394 | 393 | | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
395 | 397 | | |
396 | 398 | | |
397 | 399 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
194 | 194 | | |
195 | 195 | | |
196 | 196 | | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
197 | 209 | | |
198 | 210 | | |
199 | 211 | | |
| |||
437 | 449 | | |
438 | 450 | | |
439 | 451 | | |
| 452 | + | |
440 | 453 | | |
441 | | - | |
442 | 454 | | |
443 | 455 | | |
444 | 456 | | |
445 | 457 | | |
| 458 | + | |
446 | 459 | | |
447 | | - | |
448 | 460 | | |
449 | 461 | | |
450 | | - | |
451 | | - | |
| 462 | + | |
| 463 | + | |
452 | 464 | | |
453 | 465 | | |
454 | 466 | | |
455 | 467 | | |
456 | | - | |
| 468 | + | |
457 | 469 | | |
458 | 470 | | |
459 | 471 | | |
| |||
1535 | 1547 | | |
1536 | 1548 | | |
1537 | 1549 | | |
1538 | | - | |
| 1550 | + | |
1539 | 1551 | | |
1540 | 1552 | | |
1541 | 1553 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
248 | 248 | | |
249 | 249 | | |
250 | 250 | | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
251 | 256 | | |
252 | 257 | | |
253 | 258 | | |
| |||
0 commit comments