Skip to content

Further performance improvements of [SD]GEMV on A64FX and Neoverse V1. #5210

@iha-taisei

Description

@iha-taisei

Pull Request #5157 did excellent work and the performance of non-transposed [SD]GEMV improved.
I think Neoverse V1 has room for further performance improvement and the A64FX has a better optimal loop unrolling number. So, I would like to propose a patch for the A64FX and Neoverse V1.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions