Skip to content

[VectorDistribute] Enable intrinsic usage for unaligned attention#24510

Draft
sommerlukas wants to merge 3 commits into
iree-org:mainfrom
sommerlukas:attention-unaligned-intrinsics
Draft

[VectorDistribute] Enable intrinsic usage for unaligned attention#24510
sommerlukas wants to merge 3 commits into
iree-org:mainfrom
sommerlukas:attention-unaligned-intrinsics

Conversation

@sommerlukas
Copy link
Copy Markdown
Contributor

Enable attention to use intrinsics for matrix multiplication, even if the attention shape is not evenly divided by the intrinsic size.
Previous work on to allow padding/masking in such cases allows to use intrinsics in these cases.

Additionally, also adds configuration and pipeline tests for unaligned shapes, mainly based on shapes reported in #24221 and #24319. The pipeline tests not only check cases using intrinsics, but also smaller shapes that do not use intrinsics.

Assisted-by: Codex

Signed-off-by: Lukas Sommer <lukas.sommer@amd.com>
Signed-off-by: Lukas Sommer <lukas.sommer@amd.com>
Signed-off-by: Lukas Sommer <lukas.sommer@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant