-
Notifications
You must be signed in to change notification settings - Fork 133
Add torch scaled dot product attention (FlashAttention) #1798
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
dorian-K
wants to merge
38
commits into
master
Choose a base branch
from
doriank-sdpa
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from all commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
b814a5f
WIP scaled dot product attention
dorian-K b69bbfd
wip
dorian-K 1194018
wip
dorian-K 9a2cae2
wip
dorian-K e5f636c
fix formatting
dorian-K a0627ec
more
dorian-K 90316e9
more
dorian-K b91ea9d
fix tests
dorian-K 54beb8f
more
dorian-K f9adbf3
more
dorian-K eada973
fix
dorian-K 62f825e
fix pycharm
dorian-K 6c8fd3f
Merge branch 'master' into doriank-sdpa
dorian-K c8bd06b
remove debug prints, update
dorian-K f08d16a
more
dorian-K 2de26f8
use is_causal=True
dorian-K ecb79ed
add back a test
dorian-K a1b6c5d
more tests
dorian-K 6eb0906
fix test
dorian-K c8b7c11
more tests
dorian-K b31e9b8
Merge branch 'master' into doriank-sdpa
dorian-K d6e25f9
fix formatting
dorian-K 8480c3d
fix import
dorian-K 6b35b43
remove some formatting only changes
dorian-K ef324be
more sdpa vs fallback tests
dorian-K 4480aa5
Add benchmarks
dorian-K 5469e1e
Merge branch 'master' into doriank-sdpa
dorian-K 547d139
force flash att
dorian-K 4962ced
also upgrade cross attention
dorian-K f91e5be
add back att_dropout_broadcast
dorian-K 33a045b
convert some asserts to exceptions
dorian-K 322112c
fix private member
dorian-K 96470b4
oops
dorian-K 6d3a585
reduce line length
dorian-K 0162ef3
Make tensors contigous to enable fused kernels
dorian-K 7da3b66
_embed_dim -> _feat_dim
dorian-K 315fd86
pycharm
dorian-K a388048
Merge branch 'master' into doriank-sdpa
dorian-K File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.