Commit d522412
committed
[None][feat] reuse triton slicing kernel for GDN prefill transpose
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>1 parent 1045f38 commit d522412
File tree
2 files changed
+50
-16
lines changed- tensorrt_llm/_torch/modules/mamba
2 files changed
+50
-16
lines changedLines changed: 31 additions & 11 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
54 | 84 | | |
55 | 85 | | |
56 | 86 | | |
| |||
63 | 93 | | |
64 | 94 | | |
65 | 95 | | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
| 96 | + | |
72 | 97 | | |
73 | | - | |
74 | 98 | | |
75 | | - | |
76 | 99 | | |
77 | 100 | | |
78 | | - | |
79 | | - | |
80 | 101 | | |
81 | | - | |
82 | 102 | | |
83 | 103 | | |
84 | 104 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| 31 | + | |
31 | 32 | | |
32 | 33 | | |
33 | 34 | | |
| |||
544 | 545 | | |
545 | 546 | | |
546 | 547 | | |
547 | | - | |
548 | | - | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
| 554 | + | |
| 555 | + | |
549 | 556 | | |
550 | 557 | | |
551 | 558 | | |
552 | 559 | | |
553 | 560 | | |
554 | 561 | | |
555 | 562 | | |
556 | | - | |
| 563 | + | |
557 | 564 | | |
558 | 565 | | |
559 | 566 | | |
| |||
588 | 595 | | |
589 | 596 | | |
590 | 597 | | |
| 598 | + | |
591 | 599 | | |
592 | 600 | | |
| 601 | + | |
| 602 | + | |
| 603 | + | |
| 604 | + | |
| 605 | + | |
| 606 | + | |
593 | 607 | | |
594 | | - | |
| 608 | + | |
595 | 609 | | |
596 | 610 | | |
597 | 611 | | |
598 | 612 | | |
599 | 613 | | |
600 | 614 | | |
601 | 615 | | |
602 | | - | |
| 616 | + | |
603 | 617 | | |
604 | 618 | | |
605 | 619 | | |
| |||
0 commit comments