Commit 514ed5c
authored
* [Op][Optimization]Kernel fusion: cast+sigmoid+bias+noauxtc (#7777)
[Cherry-Pick][Op][Optimization]Kernel fusion: cast+sigmoid+bias+noauxtc (#7777)
* Bug fixes and modifications to the fused kernel switch.
* fix replicated env args
1 parent d71bdda commit 514ed5c
7 files changed
Lines changed: 1319 additions & 21 deletions
File tree
- custom_ops
- gpu_ops
- fastdeploy
- engine
- model_executor/layers/moe
- tests/operators
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
691 | 691 | | |
692 | 692 | | |
693 | 693 | | |
| 694 | + | |
| 695 | + | |
| 696 | + | |
| 697 | + | |
| 698 | + | |
| 699 | + | |
| 700 | + | |
| 701 | + | |
| 702 | + | |
694 | 703 | | |
695 | 704 | | |
696 | 705 | | |
| |||
1704 | 1713 | | |
1705 | 1714 | | |
1706 | 1715 | | |
| 1716 | + | |
| 1717 | + | |
1707 | 1718 | | |
1708 | 1719 | | |
1709 | 1720 | | |
| |||
0 commit comments