Skip to content

Commit 1e48eca

Browse files
authored
Merge pull request #5695 from martin-frbg/update_readme_wasm
README.md: Update cpu section and add WASM support
2 parents 81d1029 + faa06bd commit 1e48eca

1 file changed

Lines changed: 12 additions & 1 deletion

File tree

README.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -148,11 +148,12 @@ Please read `GotoBLAS_01Readme.txt` for older CPU models already supported by th
148148
- **Intel Haswell**: Optimized Level-3 and Level-2 BLAS with AVX2 and FMA on x86-64.
149149
- **Intel Skylake-X**: Optimized Level-3 and Level-2 BLAS with AVX512 and FMA on x86-64.
150150
- **Intel Cooper Lake**: as Skylake-X with improved BFLOAT16 support.
151+
- **Intel Sapphire Rapids**: as Cooper Lake with improved BFLOAT16 SBGEMM kernel.
151152
- **AMD Bobcat**: Used GotoBLAS2 Barcelona codes.
152153
- **AMD Bulldozer**: x86-64 ?GEMM FMA4 kernels. (Thanks to Werner Saar)
153154
- **AMD PILEDRIVER**: Uses Bulldozer codes with some optimizations.
154155
- **AMD STEAMROLLER**: Uses Bulldozer codes with some optimizations.
155-
- **AMD ZEN**: Uses Haswell codes with some optimizations for Zen 2/3 (use SkylakeX for Zen4)
156+
- **AMD ZEN**: Uses Haswell codes with some optimizations for Zen 2/3, SkylakeX for Zen4, Cooperlake for Zen5
156157

157158
#### MIPS32
158159

@@ -186,9 +187,13 @@ Please read `GotoBLAS_01Readme.txt` for older CPU models already supported by th
186187
- **EMAG 8180**: preliminary support based on A57
187188
- **Neoverse N1**: (AWS Graviton2) preliminary support
188189
- **Neoverse V1**: (AWS Graviton3) optimized Level-3 BLAS
190+
- **Neoverse N2**: preliminary support
191+
- **Neoverse V2**: preliminary support
189192
- **Apple Vortex**: preliminary support based on ThunderX2/3
193+
- **Apple VortexM4**: preliminary support based on ThunderX2/3, SME kernels for SGEMM,SSYMM,STRMM,SSYRK,SSYR2K
190194
- **A64FX**: preliminary support, optimized Level-3 BLAS
191195
- **ARMV8SVE**: any ARMV8 cpu with SVE extensions
196+
- **ARMV9SME**: any ARMV9 cpu with SVE and SME extensions
192197

193198
#### PPC/PPC64
194199

@@ -249,6 +254,12 @@ e.g.:
249254
```
250255
The old-style TARGET=LOONGSON3R5 is still supported
251256

257+
#### WASM
258+
Not a cpu target in the strict sense, but portable WebAssembly for browser-based applications and the like. See emscripten.org for the compiler and related information
259+
260+
- **WASM128_GENERIC**: Optimized SGEMM,DGEMM, DAXPY, SSUM/DSUM, SDOT/DDOT and SROT/DROT
261+
262+
252263
### Support for multiple targets in a single library
253264

254265
OpenBLAS can be built for multiple targets with runtime detection of the target cpu by specifying `DYNAMIC_ARCH=1` in Makefile.rule, on the gmake command line or as `-DDYNAMIC_ARCH=TRUE` in cmake.

0 commit comments

Comments
 (0)