|
1 | 1 | OpenBLAS ChangeLog |
| 2 | +==================================================================== |
| 3 | +Version 0.3.33 |
| 4 | +23-Apr-2026 |
| 5 | + |
| 6 | +general: |
| 7 | + - fixed an incorrect cast in the SBGEMM test case that could lead to spurious test failures |
| 8 | + - fixed an invalid memory access in the converted C version of the CBLAS tests |
| 9 | + - made the BIGNUMA setting automatic when the number of cores exceeds 256 |
| 10 | + - Imported recent updates from Reference-LAPACK to realign with its upcoming 3.13.0 release: |
| 11 | + - Implement ?LARF1F and ?ORM2R (Reference-LAPACK PRs 1019,1020,1196,1257) |
| 12 | + - Change loop order in ?GETC2 to improve performance (Reference-LAPACK PR 1023) |
| 13 | + - Change WORK array dimension in ?GELQS/?GEQRS (Reference-LAPACK PR 1094) |
| 14 | + - Add NaN checks for input matrix A in ?GEEV (Reference-LAPACK PR 1136) |
| 15 | + - Fix support for jobu/v in LAPACKE_?GESVDQ_WORK (Reference-LAPACK PRs 1146,1221) |
| 16 | + - Fix display of version number in LAPACK testsuite (Reference-LAPACK PR 1149) |
| 17 | + - Fix DGGES test seed to avoid bad matrix cases (Reference-LAPACK PR 1187) |
| 18 | + - Fix truncation of large WORK array sizes in ZHE (Reference-LAPACK PR 1195) |
| 19 | + - Fix overwriting of LDSWORK parameter in ?TRSYL3 (Reference-LAPACK PR 1206) |
| 20 | + - Fix overwriting of error states in some EIG tests (Reference-LAPACK PR 1207) |
| 21 | + - Remove unused parameter in DORBDB3/ZUNBDB3 (Reference-LAPACK PR 1209) |
| 22 | + - Re-enable testing of ?BB and ?GG driver functions (Reference-LAPACK PR 1211) |
| 23 | + - Fix workspace size calculation in ?TGSEN (Reference-LAPACK PR 774) |
| 24 | + - Fix typos in the EIG DMD tests and initialized the cutoff variable (PR 1212,1228) |
| 25 | + - Optimized looping in ?LACPY/?LASCL/?LANTR with fat matrix and UPLO=L (PR 1251) |
| 26 | + |
| 27 | +arm64: |
| 28 | + - worked around a serious miscompilation of the DDOT kernel by GCC15, affecting |
| 29 | + most non-SVE targets, and SVE targets in the case of non-unit array stride) |
| 30 | + - fixed an accuracy issue in the GEMV kernel for Neoverse V1 and other SVE targets |
| 31 | + - fixed broken STRMM and SSYMM in DYNAMIC_ARCH builds when running on non-SME hardware |
| 32 | + - added an optimized SHGEMM kernel for Neoverse N2 |
| 33 | + - fixed DYNAMIC_ARCH builds under Windows on Arm |
| 34 | + - Added autodetection of Cortex A75/A76 in DYNAMIC_ARCH builds |
| 35 | + - Added autodetection of Neoverse V3, currently supported through V2 kernels |
| 36 | + - Re-added support for the "VORTEX" target in DYNAMIC_ARCH builds with DYNAMIC_LIST |
| 37 | + - Fixed CMake-based builds that use the "Ninja" generator |
| 38 | + |
| 39 | +loongarch64: |
| 40 | + - fixed a build failure due to missing support for the new half-precision float type |
| 41 | + - fixed a long-standing bug in asserting 64bit capability in the c_check helper script |
| 42 | + |
| 43 | +x86_64: |
| 44 | + - added a workaround for miscompilation of the AVX512 GEMM kernels by LLVM on Windows |
| 45 | + - fixed a build failure in the LAED3 code when compiling with MinGW on Windows |
| 46 | + - fixed CMake-based compilation with the NVIDIA HPC compiler |
| 47 | + - Fixed CMake-based builds that use the "Ninja" generator |
| 48 | + |
| 49 | +wasm: |
| 50 | + - added optimized kernels for STRSM and DTRSM |
| 51 | + |
2 | 52 | ==================================================================== |
3 | 53 | Version 0.3.32 |
4 | 54 | 23-Mar-2026 |
|
0 commit comments