Skip to content

tokenspeed-attention: add TokenSpeed MHA/MLA Triton kernels#990

Draft
adarshxs wants to merge 5 commits into
huggingface:mainfrom
adarshxs:tokenspeed
Draft

tokenspeed-attention: add TokenSpeed MHA/MLA Triton kernels#990
adarshxs wants to merge 5 commits into
huggingface:mainfrom
adarshxs:tokenspeed

Conversation

@adarshxs

Copy link
Copy Markdown
Contributor

What does this PR do?

Adds tokenspeed MHA/MLA triton kernels

Summary

Adds a new tokenspeed-attention Hub kernel package with portable Triton attention kernels ported from LightSeek TokenSpeed.

Exported APIs:

  • mha_prefill
  • mha_extend_with_kvcache
  • mha_decode_with_kvcache
  • mla_prefill
  • mla_decode_with_kvcache
  • attn_merge_state

@danieldk

Copy link
Copy Markdown
Member

/kernel-bot build-and-stage tokenspeed-attention

@github-actions

github-actions Bot commented Jun 23, 2026

Copy link
Copy Markdown

Build request processed.

Command: /kernel-bot build-and-stage tokenspeed-attention
Mode: build and stage
Target branch: pr-990
PR head SHA: 2f6e9a9b6e5560a8c4de3626bedda75127b58f95
Workflows: build.yaml, build-mac.yaml, build-windows.yaml

Dispatched (2):

@drbh drbh added area: build-system build.toml, Nix flakes, packaging, and kernel-builder integration area: docs README, CARD.md, guides, and repository documentation area: tests Tests, validation code, and benchmark harnesses backend: triton Vendor-neutral Triton kernels new-kernel A brand-new kernel package is added size: XL Diff > 1000 lines type: feature New functionality / capability labels Jun 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area: build-system build.toml, Nix flakes, packaging, and kernel-builder integration area: docs README, CARD.md, guides, and repository documentation area: tests Tests, validation code, and benchmark harnesses backend: triton Vendor-neutral Triton kernels new-kernel A brand-new kernel package is added size: XL Diff > 1000 lines type: feature New functionality / capability

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants