File tree Expand file tree Collapse file tree
hardware-tests/best-stack-followup-2026-05-17 Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -51,10 +51,14 @@ This bundle exercises a different combination on the same hardware:
5151| cell | dream-server ROCm 7 (this bundle) | canonical Vulkan b9151 |
5252| ---| ---:| ---:|
5353| ctx=1024 gen=128 decode | 7.666 ± 0.003 | ~ 7.82 peak |
54- | ctx=1024 gen=512 decode | 7.614 ± 0.001 | — |
55- | ctx=1024 gen=2048 decode | 7.575 ± 0.004 | — |
56- | ctx=4096 gen=128 decode | 7.525 ± 0.002 | — |
54+ | ctx=1024 gen=512 decode | 7.614 ± 0.001 | 7.784 ± 0.004 |
55+ | ctx=1024 gen=2048 decode | 7.575 ± 0.004 | 7.780 ± 0.001 |
56+ | ctx=4096 gen=128 decode | 7.525 ± 0.002 | 7.771 ± 0.003 |
57+ | ctx=4096 gen=512 decode | 7.472 ± 0.001 | 7.724 ± 0.001 |
58+ | ctx=4096 gen=2048 decode | 7.439 ± 0.0003 | 7.706 ± 0.001 |
59+ | ctx=16384 gen=128 decode | 7.062 ± 0.002 | 7.549 ± 0.003 |
5760| ctx=4096 gen=128 ** prefill** | ** 111.94 ± 0.002** | ** ~ 292** (peak across cells) |
61+ | ctx=16384 gen=128 ** TTFT / prefill** | ** 185.6 s / 84.08 tok/s** | ** 59.3 s / 263.1 tok/s** |
5862
5963** Decode:** essentially the same as canonical Vulkan within run-to-run noise. ROCm 7 isn't faster, isn't slower.
6064
You can’t perform that action at this time.
0 commit comments