Skip to content

Commit be1f694

Browse files
committed
Rename attention to scaled_dot_product_attention and separate the kernels into modular packages
1 parent a33a14c commit be1f694

7 files changed

Lines changed: 398 additions & 383 deletions

File tree

attention.py

Lines changed: 0 additions & 382 deletions
This file was deleted.

infer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@
22

33
from transformers import AutoModelForCausalLM, AutoTokenizer
44

5-
from attention import Attention
65
from fused_rms_norm import RMSNorm
76
from linear import Linear
7+
from scaled_dot_product_attention import Attention
88
from silu import SiLU
99
from utils import replace_module
1010

0 commit comments

Comments
 (0)