-
Notifications
You must be signed in to change notification settings - Fork 145
Pull requests: pytorch/helion
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[compiler][autotuner] Compiler seed heuristics
CLA Signed
This label is managed by the Meta Open Source bot.
#2392
opened May 10, 2026 by
ethche
Contributor
Loading…
[autotune] Add observed heuristic seeds
CLA Signed
This label is managed by the Meta Open Source bot.
Attention Perf: Transpose blocked K right before QK instead of pre-transposing before the kernel
CLA Signed
This label is managed by the Meta Open Source bot.
#2374
opened May 9, 2026 by
AmesingFlank
Contributor
Loading…
Attention Perf: Multiply Q in-loop to avoid memory spillage
CLA Signed
This label is managed by the Meta Open Source bot.
#2373
opened May 9, 2026 by
AmesingFlank
Contributor
Loading…
Seeding the configs for aymmetric skinny matmuls
CLA Signed
This label is managed by the Meta Open Source bot.
#2357
opened May 7, 2026 by
umechand-amd
Collaborator
Loading…
Avoid dynamic shape recompiles for 0/1 tensor dimensions
CLA Signed
This label is managed by the Meta Open Source bot.
#2353
opened May 7, 2026 by
oulgen
Contributor
Loading…
[runtime:pallas] migrate to torch_tpu's new Pallas buffer donation API
CLA Signed
This label is managed by the Meta Open Source bot.
#2351
opened May 7, 2026 by
cota
Collaborator
Loading…
Add multi tile loop support to autodiff
CLA Signed
This label is managed by the Meta Open Source bot.
#2338
opened May 7, 2026 by
karthickai
Contributor
•
Draft
[DO NOT MERGE] [Pallas] Manual full-coverage TPU benchmark trigger
CLA Signed
This label is managed by the Meta Open Source bot.
Add RemoteCacheBackend ABC for pluggable remote autotune caching
CLA Signed
This label is managed by the Meta Open Source bot.
#2317
opened May 6, 2026 by
fulvius31
Collaborator
Loading…
[Pallas] Skip factory tensor padding for Pallas backend
CLA Signed
This label is managed by the Meta Open Source bot.
[TPU][Pallas] relax tolerances and fix Pallas autotuning OOM in layer_norm
CLA Signed
This label is managed by the Meta Open Source bot.
#2272
opened May 5, 2026 by
yarongmu-google
Collaborator
•
Draft
A jagged_hstu_attention example that works on Pallas TPU
CLA Signed
This label is managed by the Meta Open Source bot.
#2218
opened May 3, 2026 by
AmesingFlank
Contributor
Loading…
[Pallas] Use LONG_INT_TYPE for jagged offsets in examples and tests
CLA Signed
This label is managed by the Meta Open Source bot.
[Autotuner] Long-lived worker pool for parallel precompile
CLA Signed
This label is managed by the Meta Open Source bot.
[Autotuner] Raise default min_improvement_delta to 0.003
CLA Signed
This label is managed by the Meta Open Source bot.
autotuner: cap tile size for imbalanced 2D grid dims
CLA Signed
This label is managed by the Meta Open Source bot.
#2102
opened Apr 24, 2026 by
umechand-amd
Collaborator
Loading…
[Pallas] Switch gather to jnp.take_along_axis (for JAX issue filing)
CLA Signed
This label is managed by the Meta Open Source bot.
#2061
opened Apr 20, 2026 by
AmesingFlank
Contributor
•
Draft
[Pallas] Lower aten gather using one_hot + sum for TPU compatibility, unblocking cross_entropy
CLA Signed
This label is managed by the Meta Open Source bot.
#2060
opened Apr 20, 2026 by
AmesingFlank
Contributor
Loading…
Use torch.gather instead of generic int indexing for cross_entropy example
CLA Signed
This label is managed by the Meta Open Source bot.
#2058
opened Apr 20, 2026 by
AmesingFlank
Contributor
•
Draft
[Pallas] Implement indirect gather via exact one-hot matmul
CLA Signed
This label is managed by the Meta Open Source bot.
[TPU][Pallas]Fix example/cross_entropy.py on Pallas TPU
CLA Signed
This label is managed by the Meta Open Source bot.
#2019
opened Apr 14, 2026 by
yarongmu-google
Collaborator
•
Draft
WIP: fp8 all gather matmul
CLA Signed
This label is managed by the Meta Open Source bot.
#1974
opened Apr 7, 2026 by
shunting314
Contributor
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.