-
Notifications
You must be signed in to change notification settings - Fork 23
Pull requests: AMD-AGI/Primus-Turbo
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(gemm): avoid physical transpose in flydsl fp8 dense gemm trans_c
ci:gpu
#406
opened Jul 4, 2026 by
kyle-256
Collaborator
Loading…
5 of 12 tasks
fix(triton): scope AMD gfx950 compiler knobs to GEMM kernel launches
ci:gpu
#405
opened Jul 3, 2026 by
kyle-256
Collaborator
Loading…
6 of 12 tasks
[WIP][feat] Optimize flydsl MXFP8 quant kernel
#403
opened Jul 1, 2026 by
kyle-256
Collaborator
Loading…
12 tasks
feat: add quantized tensor support for mxfp8 grouped gemm
ci:gpu
#401
opened Jun 30, 2026 by
RuibinCheung
Collaborator
Loading…
5 of 12 tasks
optimize by optimize/optimize_turbo_blockwise_fp8_gemm_with_flydsl_ba_202606261106
#399
opened Jun 29, 2026 by
ChengYao-amd
Collaborator
Loading…
12 tasks
[WIP]feat(gemm): add fp8 blockwise gemm flydsl backend
#393
opened Jun 23, 2026 by
ChengYao-amd
Collaborator
Loading…
feat: refine quant config arguments
ci:gpu
#379
opened Jun 12, 2026 by
RuibinCheung
Collaborator
Loading…
3 of 12 tasks
[WIP] feat: support build on gfx1250
ci:gpu
#374
opened Jun 9, 2026 by
RuibinCheung
Collaborator
•
Draft
6 of 12 tasks
feat(grouped_gemm): add CK work-stealing variant with schedule API
ci:gpu
#348
opened May 27, 2026 by
wenchenvincent
Contributor
Loading…
7 of 12 tasks
[WIP] [Feature] Add Turbo MXFP8 Grouped GEMM (gfx950) for MoE
#330
opened May 7, 2026 by
kyle-256
Collaborator
Loading…
6 of 12 tasks
feat: add more activation func
#329
opened May 7, 2026 by
RuibinCheung
Collaborator
Loading…
8 of 9 tasks
opt(gemm): add hipBLASLt algorithm cache and thread-local workspace
#321
opened Apr 30, 2026 by
jasainio
Contributor
Loading…
6 of 12 tasks
Refactor: moe dispatch combine autotune
ci:gpu
#312
opened Apr 24, 2026 by
zhenhuang12
Collaborator
Loading…
7 of 12 tasks
feat: enable hybrid FP8 dtypes on Triton grouped GEMM backends
#288
opened Apr 15, 2026 by
sarthak-amd
•
Draft
perf: optimize hipBLASLt grouped GEMM with algo tuning, enable grouped_gemm autotune hipblaslt support
#284
opened Apr 14, 2026 by
kyle-256
Collaborator
Loading…
feat(benchmark): per-model/GPU batch sizes and vocab projection for GEMM bench
#265
opened Mar 31, 2026 by
Z-Y00
Loading…
refactor: reorganize moe ops and kernels
#243
opened Mar 5, 2026 by
zhenhuang12
Collaborator
Loading…
ProTip!
Filter pull requests by the default branch with base:main.