Commit af088ae
committed
Add --no_transposed_cache CLI flag for export pipeline
Add CLI argument to control transposed KV cache layout during export.
By default transposed cache is used (is_seq_at_dim_2=True). Pass
--no_transposed_cache to disable it for baseline comparison.
Differential Revision: [D99677684](https://our.internmc.facebook.com/intern/diff/D99677684/)
[ghstack-poisoned]1 parent 6f31ee3 commit af088ae
4 files changed
Lines changed: 95 additions & 44 deletions
File tree
- examples/models/llama
- source_transformation
- extension/llm/export/config
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
334 | 334 | | |
335 | 335 | | |
336 | 336 | | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
337 | 352 | | |
338 | 353 | | |
339 | 354 | | |
| |||
766 | 781 | | |
767 | 782 | | |
768 | 783 | | |
| 784 | + | |
769 | 785 | | |
770 | 786 | | |
771 | 787 | | |
| |||
1605 | 1621 | | |
1606 | 1622 | | |
1607 | 1623 | | |
| 1624 | + | |
1608 | 1625 | | |
1609 | 1626 | | |
1610 | 1627 | | |
| |||
1642 | 1659 | | |
1643 | 1660 | | |
1644 | 1661 | | |
| 1662 | + | |
1645 | 1663 | | |
1646 | 1664 | | |
1647 | 1665 | | |
| |||
1737 | 1755 | | |
1738 | 1756 | | |
1739 | 1757 | | |
| 1758 | + | |
| 1759 | + | |
| 1760 | + | |
| 1761 | + | |
| 1762 | + | |
| 1763 | + | |
| 1764 | + | |
| 1765 | + | |
| 1766 | + | |
| 1767 | + | |
| 1768 | + | |
| 1769 | + | |
1740 | 1770 | | |
1741 | | - | |
| 1771 | + | |
1742 | 1772 | | |
1743 | | - | |
1744 | | - | |
1745 | | - | |
1746 | 1773 | | |
1747 | 1774 | | |
1748 | | - | |
| 1775 | + | |
1749 | 1776 | | |
1750 | 1777 | | |
1751 | 1778 | | |
1752 | | - | |
| 1779 | + | |
1753 | 1780 | | |
1754 | 1781 | | |
1755 | 1782 | | |
| |||
Lines changed: 40 additions & 29 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
334 | 334 | | |
335 | 335 | | |
336 | 336 | | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
337 | 345 | | |
338 | 346 | | |
339 | 347 | | |
340 | 348 | | |
341 | 349 | | |
342 | 350 | | |
343 | 351 | | |
344 | | - | |
| 352 | + | |
| 353 | + | |
345 | 354 | | |
346 | | - | |
347 | 355 | | |
| 356 | + | |
| 357 | + | |
348 | 358 | | |
349 | | - | |
350 | | - | |
351 | | - | |
352 | | - | |
353 | | - | |
354 | 359 | | |
355 | 360 | | |
356 | 361 | | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
357 | 365 | | |
358 | | - | |
| 366 | + | |
359 | 367 | | |
360 | 368 | | |
361 | | - | |
| 369 | + | |
362 | 370 | | |
363 | 371 | | |
364 | 372 | | |
| |||
368 | 376 | | |
369 | 377 | | |
370 | 378 | | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
| 379 | + | |
375 | 380 | | |
376 | 381 | | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
377 | 386 | | |
378 | 387 | | |
379 | | - | |
| 388 | + | |
380 | 389 | | |
381 | 390 | | |
382 | | - | |
| 391 | + | |
383 | 392 | | |
384 | 393 | | |
385 | | - | |
386 | | - | |
| 394 | + | |
| 395 | + | |
387 | 396 | | |
388 | | - | |
389 | | - | |
390 | | - | |
391 | | - | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
392 | 401 | | |
393 | 402 | | |
394 | | - | |
| 403 | + | |
395 | 404 | | |
396 | 405 | | |
397 | | - | |
398 | | - | |
399 | | - | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
400 | 410 | | |
401 | 411 | | |
402 | 412 | | |
403 | 413 | | |
404 | | - | |
| 414 | + | |
405 | 415 | | |
406 | 416 | | |
407 | | - | |
| 417 | + | |
408 | 418 | | |
409 | 419 | | |
410 | 420 | | |
| |||
421 | 431 | | |
422 | 432 | | |
423 | 433 | | |
424 | | - | |
| 434 | + | |
| 435 | + | |
425 | 436 | | |
426 | 437 | | |
427 | 438 | | |
428 | | - | |
| 439 | + | |
429 | 440 | | |
430 | 441 | | |
431 | 442 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| 31 | + | |
| 32 | + | |
31 | 33 | | |
32 | 34 | | |
33 | 35 | | |
| |||
40 | 42 | | |
41 | 43 | | |
42 | 44 | | |
| 45 | + | |
| 46 | + | |
43 | 47 | | |
44 | | - | |
| 48 | + | |
45 | 49 | | |
46 | 50 | | |
47 | 51 | | |
48 | | - | |
49 | | - | |
50 | 52 | | |
51 | 53 | | |
52 | 54 | | |
| |||
58 | 60 | | |
59 | 61 | | |
60 | 62 | | |
61 | | - | |
62 | | - | |
63 | | - | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
64 | 66 | | |
65 | 67 | | |
66 | 68 | | |
| |||
70 | 72 | | |
71 | 73 | | |
72 | 74 | | |
73 | | - | |
74 | | - | |
75 | | - | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
76 | 78 | | |
77 | 79 | | |
78 | 80 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
181 | 187 | | |
182 | 188 | | |
183 | 189 | | |
| |||
199 | 205 | | |
200 | 206 | | |
201 | 207 | | |
| 208 | + | |
202 | 209 | | |
203 | 210 | | |
204 | 211 | | |
| |||
686 | 693 | | |
687 | 694 | | |
688 | 695 | | |
| 696 | + | |
| 697 | + | |
| 698 | + | |
| 699 | + | |
689 | 700 | | |
690 | 701 | | |
691 | 702 | | |
| |||
0 commit comments