Improvement of 2D thread-partitioned GEMM for M << N case by nakagawa-fj · Pull Request #5276 · OpenMathLib/OpenBLAS

nakagawa-fj · 2025-05-21T12:30:13Z

Closes #5270
The 2D thread partitioning in GEMM (PR#4655) requires nthreads_m % 2 == 0. This can prevent optimal nthreads_m and nthreads_n combinations on architectures like A64FX (48 cores) or Grace (144 cores) when M<<N, due to core counts having divisors other than 2.
Specifically, when matrix size N is significantly larger than M, the number of threads for N direction should be increased.
However, if nthreads_m includes divisors other than 2, such as 3, the increase of nthreads_n is prevented by ' nthreads_m % 2 == 0 '.
This modification removes the nthreads_m % 2 == 0 restriction and selects the combination that minimizes the following objective function 'n * nthreads_m + m * nthreads_n'.
This change improves the performance of multi-threaded GEMM for M << N cases.

martin-frbg · 2025-05-21T21:41:44Z

Thank you

Update 2D thread-partitioned GEMM for M << N case.

2351a98

martin-frbg added this to the 0.3.30 milestone May 21, 2025

martin-frbg merged commit e2e6a4d into OpenMathLib:develop May 21, 2025
82 of 86 checks passed

martin-frbg mentioned this pull request May 21, 2025

Fix compilation with pre-C99 compilers #5278

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvement of 2D thread-partitioned GEMM for M << N case#5276

Improvement of 2D thread-partitioned GEMM for M << N case#5276
martin-frbg merged 1 commit intoOpenMathLib:developfrom
nakagawa-fj:gemm_2d_thread_partitioning

nakagawa-fj commented May 21, 2025 •

edited

Loading

Uh oh!

martin-frbg commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nakagawa-fj commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martin-frbg commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nakagawa-fj commented May 21, 2025 •

edited

Loading