kernel/riscv64:Optimized the implementation of axpby on TARGET=RISCV64_ZVL256B. by guoyuanplct · Pull Request #5288 · OpenMathLib/OpenBLAS

guoyuanplct · 2025-05-29T11:09:01Z

The specific improvements are shown in the figure below.

abhishek-iitmadras · 2025-08-23T14:57:03Z

Just out of curiosity and for my learning, i have below question :

What are the key practical scenarios or algorithms where a dedicated AXPBY kernel from openBLAS provides a significant performance advantage given that if we already have a highly optimized AXPY ?

Thanks

@martin-frbg @guoyuanplct

martin-frbg · 2025-08-23T15:39:53Z

We may not have a highly optimized AXPY on all architectures, and the current default for AXPBY is a naive C loop instead of combining calls to SCAL and AXPY in the interface. (The git log suggests that axpby was added a decade ago for compatibility with MKL - #285 - and nobody looked at it - or its performance - ever since)

martin-frbg · 2025-08-23T15:54:06Z

(small correction - the Loongson crew did add optimized kernels for their hardware in late 2023, so this is not entirely without precedent. There are no callers in Reference-LAPACK, and the only user in OpenBLAS itself seems to be the generic GEADD, so this may have gone mostly unnoticed)

guoyuanplct added 2 commits May 29, 2025 17:50

Optimized the axpby function.

45fd2d9

del lines

d2003dc

guoyuanplct changed the title ~~Optimized the implementation of axpby on TARGET=RISCV64_ZVL256B.~~ kernel/riscv64:Optimized the implementation of axpby on TARGET=RISCV64_ZVL256B. May 29, 2025

martin-frbg added this to the 0.3.30 milestone May 29, 2025

martin-frbg merged commit 02267d8 into OpenMathLib:develop May 29, 2025
84 of 86 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kernel/riscv64:Optimized the implementation of axpby on TARGET=RISCV64_ZVL256B.#5288

kernel/riscv64:Optimized the implementation of axpby on TARGET=RISCV64_ZVL256B.#5288
martin-frbg merged 2 commits intoOpenMathLib:developfrom
guoyuanplct:develop

guoyuanplct commented May 29, 2025

Uh oh!

Uh oh!

abhishek-iitmadras commented Aug 23, 2025 •

edited

Loading

Uh oh!

martin-frbg commented Aug 23, 2025

Uh oh!

martin-frbg commented Aug 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

guoyuanplct commented May 29, 2025

Uh oh!

Uh oh!

abhishek-iitmadras commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martin-frbg commented Aug 23, 2025

Uh oh!

martin-frbg commented Aug 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abhishek-iitmadras commented Aug 23, 2025 •

edited

Loading