Skip to content

Commit f28be62

Browse files
Add Output TPUT per GPU and Input TPUT per GPU to summary table
Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>
1 parent d227611 commit f28be62

2 files changed

Lines changed: 7 additions & 4 deletions

File tree

utils/process_result.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,8 @@
2626
'framework': framework,
2727
'precision': precision,
2828
'tput_per_gpu': float(bmk_result['total_token_throughput']) / tp_size,
29-
'output_tput_per_gpu': float(bmk_result['output_throughput']) / tp_size
29+
'output_tput_per_gpu': float(bmk_result['output_throughput']) / tp_size,
30+
'input_tput_per_gpu': float(bmk_result['input_throughput']) / tp_size
3031
}
3132

3233
if mtp_mode: # MTP

utils/summarize.py

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,8 +12,8 @@
1212
results.sort(key=lambda r: (r['hw'], r.get('framework', 'vllm'), r.get('precision', 'fp8'), r['tp'], r['ep'], r['conc']))
1313

1414
summary_header = f'''\
15-
| Hardware | Framework | Precision | TP | EP | DP Attention | Conc | TTFT (ms) | TPOT (ms) | E2EL (s) | TPUT per GPU |
16-
| :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: |\
15+
| Hardware | Framework | Precision | TP | EP | DP Attention | Conc | TTFT (ms) | TPOT (ms) | E2EL (s) | TPUT per GPU | Output TPUT per GPU | Input TPUT per GPU |
16+
| :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: |\
1717
'''
1818
print(summary_header)
1919

@@ -31,5 +31,7 @@
3131
f"| {(result['median_ttft'] * 1000):.4f} "
3232
f"| {(result['median_tpot'] * 1000):.4f} "
3333
f"| {result['median_e2el']:.4f} "
34-
f"| {result['tput_per_gpu']:.4f} |"
34+
f"| {result['tput_per_gpu']:.4f} "
35+
f"| {result['output_tput_per_gpu']:.4f} "
36+
f"| {result['input_tput_per_gpu']:.4f} |"
3537
)

0 commit comments

Comments
 (0)