Commit 405a86a
agentic: bump aiperf Configure Profiling timeout 900s -> 1800s
R3 H200 marathon hit aiperf's 900s Configure Profiling timeout on every
job (14/14). Root cause: H200 /tmp is ~3x slower than B300's, and 14
parallel jobs contending for the same shared /tmp pushed dataset config
(load + reconstruct + 62GB mmap + 4min inputs.json) past the 900s
budget. vLLM started cleanly in every case — aiperf itself timed out
during its own dataset materialization step.
Bump the timeout to 1800s (30 min) so worst-case slow-/tmp + contention
scenarios still fit. Post-setup measurement window unaffected.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent cc922de commit 405a86a
1 file changed
Lines changed: 7 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
956 | 956 | | |
957 | 957 | | |
958 | 958 | | |
959 | | - | |
960 | | - | |
961 | | - | |
962 | | - | |
963 | | - | |
| 959 | + | |
| 960 | + | |
| 961 | + | |
| 962 | + | |
| 963 | + | |
| 964 | + | |
964 | 965 | | |
965 | 966 | | |
966 | | - | |
| 967 | + | |
967 | 968 | | |
968 | 969 | | |
969 | 970 | | |
| |||
0 commit comments