Skip to content

Commit d1430d9

Browse files
committed
fixed benchmark in docs
1 parent 6aa85f0 commit d1430d9

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

docs/source/en/training/distributed_inference.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -360,4 +360,4 @@ We ran a benchmark with Ulysess, Ring, and Unified Attention with [this script](
360360
| ring | 13076.492 | 3.82 | 56.02 |
361361
| unified_balanced | 11068.705 | 4.52 | 33.85 |
362362

363-
From the above table, it's clear that Unified Attention as a CP backend provides the best trade-off between speed and memory.
363+
From the above table, it's clear that Ulysses provides better throughput, but the number of devices it can use remains limited to number of attention-heads, a limitation that is solved by unified attention.

0 commit comments

Comments
 (0)