Uh oh!

There was an error while loading. Please reload this page.

fla-org / flash-linear-attention Public

Notifications You must be signed in to change notification settings
Fork 569
Star 5.3k

Code
Issues 40
Pull requests 28
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Pull requests: fla-org/flash-linear-attention

Labels 18 Milestones 3

New pull request New

28 Open 616 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Fix] fix benchamrk issues and add ub management for npu

#1001 opened Jul 1, 2026 by sunyi0505 Contributor

Loading…

[GDN] Restrict Blackwell gated delta bwd autotune

#1000 opened Jul 1, 2026 by HQuanShaWu

Loading…

[GDN] Add FlashQLA backend dispatch

#998 opened Jul 1, 2026 by Erix025

Loading…

7 tasks done

[GDN2] Add fused BT=16 inference kernels for GDN2 prefill

#990 opened Jun 29, 2026 by Ghd-2077

Loading…

[Attn] Remove erroneous @torch.compile on ParallelAttentionFunction class

#980 opened Jun 25, 2026 by arbi-dev • Draft

[Perf] Generalize fused q/k/v short convolution across layers

#977 opened Jun 23, 2026 by zhiyuan1i Collaborator

Loading…

[Fix] generate() prefills the full prompt instead of only the last token

#962 opened Jun 20, 2026 by Sunt-ing Contributor

Loading…

[Fix] causal_conv1d_bwd silent dweight corruption with cu_seqlens and B>1

#955 opened Jun 19, 2026 by TimDarcet • Draft

[Misc] Add generic concat + conv + split vs N separate grouped convs benchmark

#952 opened Jun 18, 2026 by Costa-SM Contributor

Loading…

[Model] Add Preconditioned Gated DeltaNet (PGDN) and KDA (PKDA)

#950 opened Jun 17, 2026 by ntumm120

Loading…

[GDN] Fix GDN precision on Blackwell

#948 opened Jun 14, 2026 by syeehyn

Loading…

[GDN] Restrict chunk_delta_h Blackwell autotune stages

#946 opened Jun 12, 2026 by IgorYashch

Loading…

[KDA] Add fused BT=16 inference kernels for KDA prefill

#915 opened May 22, 2026 by kuoihao

Loading…

[Fix] Zero-init chunk-mode backward gradient buffers to prevent NaN propagation

#892 opened May 12, 2026 by xylian86

Loading…

[Fix] Fix shared memory race in tilelang chunk_bwd dg_last accumulation help wanted

Extra attention is needed

#890 opened May 11, 2026 by Erix025

Loading…

[SSE] Add SSE integration

#882 opened May 9, 2026 by Pan-Yuqi Contributor

Loading…

[KDA][AMD]for kda kernel,fix core dump on AMD GPU and tune the config for AMD branch

#869 opened Apr 29, 2026 by binding7012

Loading…

[Ops] Fix int32 overflow in pointer arithmetic across all Triton kernels

#818 opened Apr 8, 2026 by tmct Contributor • Draft

Add MALA (Magnitude-Aware Linear Attention) to FLA

#809 opened Apr 3, 2026 by drdanielwuwu

Loading…

feat: add Quasar Attention and standalone model implementation

#805 opened Mar 31, 2026 by troy12x

Loading…

[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform

#797 opened Mar 28, 2026 by hypnopump Contributor

Loading…

5 tasks

[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs

#796 opened Mar 28, 2026 by mpurland Contributor

Loading…

5 tasks done

Add fused short convolution kernel with L2 norm

#661 opened Nov 24, 2025 by sustcsonglin Collaborator

Loading…

[kda] add recursive block intra implementation

#656 opened Nov 22, 2025 by sustcsonglin Collaborator

Loading…

[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan

#603 opened Sep 30, 2025 by foreverpiano

Loading…

Previous 1 2 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!