Commit 29ac412
authored
benchmark_serving: fail run when request failure rate exceeds 5% (#1379)
Gate the benchmark after results are written so the artifact still
uploads, then exit non-zero if (num_prompts - completed) / num_prompts
> 0.05. Surfaces partial-failure runs that currently get reported as
successful jobs.1 parent 766b097 commit 29ac412
1 file changed
Lines changed: 10 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
900 | 900 | | |
901 | 901 | | |
902 | 902 | | |
| 903 | + | |
| 904 | + | |
| 905 | + | |
| 906 | + | |
| 907 | + | |
| 908 | + | |
| 909 | + | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
903 | 913 | | |
904 | 914 | | |
905 | 915 | | |
| |||
0 commit comments