In my recent dgemm comparison benchmarks (on an Zen3 AMD Milan system) I find that Octavian is essentially not scaling at all from single-socket to dual-socket. Below 64 cores corresponds to a full single socket and 128 cores to the full dual-socket system.
| BLAS |
# cores |
size |
GFLOPS |
| Intel MKL v2022.0.0 |
128 cores |
10240 |
3279 |
| Intel MKL v2022.0.0 |
64 cores |
10240 |
1684 |
| BLIS 0.9.0 |
128 cores |
10240 |
3893 |
| BLIS 0.9.0 |
64 cores |
10240 |
2014 |
| Octavian 0.3.15 |
128 cores |
10240 |
1843 |
| Octavian 0.3.15 |
64 cores |
10240 |
1802 |
Would be great to see Octavian perform better here :)
In my recent dgemm comparison benchmarks (on an Zen3 AMD Milan system) I find that Octavian is essentially not scaling at all from single-socket to dual-socket. Below 64 cores corresponds to a full single socket and 128 cores to the full dual-socket system.
Would be great to see Octavian perform better here :)