Skip to content

Commit 2b0c1dd

Browse files
author
Michael Bradley
committed
Polish Strix follow-up findings table
1 parent 3aec6fc commit 2b0c1dd

1 file changed

Lines changed: 7 additions & 3 deletions

File tree

  • hardware-tests/best-stack-followup-2026-05-17

hardware-tests/best-stack-followup-2026-05-17/findings.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -51,10 +51,14 @@ This bundle exercises a different combination on the same hardware:
5151
| cell | dream-server ROCm 7 (this bundle) | canonical Vulkan b9151 |
5252
|---|---:|---:|
5353
| ctx=1024 gen=128 decode | 7.666 ± 0.003 | ~7.82 peak |
54-
| ctx=1024 gen=512 decode | 7.614 ± 0.001 ||
55-
| ctx=1024 gen=2048 decode | 7.575 ± 0.004 ||
56-
| ctx=4096 gen=128 decode | 7.525 ± 0.002 ||
54+
| ctx=1024 gen=512 decode | 7.614 ± 0.001 | 7.784 ± 0.004 |
55+
| ctx=1024 gen=2048 decode | 7.575 ± 0.004 | 7.780 ± 0.001 |
56+
| ctx=4096 gen=128 decode | 7.525 ± 0.002 | 7.771 ± 0.003 |
57+
| ctx=4096 gen=512 decode | 7.472 ± 0.001 | 7.724 ± 0.001 |
58+
| ctx=4096 gen=2048 decode | 7.439 ± 0.0003 | 7.706 ± 0.001 |
59+
| ctx=16384 gen=128 decode | 7.062 ± 0.002 | 7.549 ± 0.003 |
5760
| ctx=4096 gen=128 **prefill** | **111.94 ± 0.002** | **~292** (peak across cells) |
61+
| ctx=16384 gen=128 **TTFT / prefill** | **185.6 s / 84.08 tok/s** | **59.3 s / 263.1 tok/s** |
5862

5963
**Decode:** essentially the same as canonical Vulkan within run-to-run noise. ROCm 7 isn't faster, isn't slower.
6064

0 commit comments

Comments
 (0)