Skip to content

Commit 07fc926

Browse files
committed
stop vllm to stdout after it stops pt 2
1 parent a99755d commit 07fc926

1 file changed

Lines changed: 3 additions & 5 deletions

File tree

benchmarks/gptoss_fp4_h100_docker.sh

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -29,18 +29,16 @@ vllm serve $MODEL --host=0.0.0.0 --port=$PORT \
2929
--gpu-memory-utilization=0.9 \
3030
--tensor-parallel-size=$TP \
3131
--max-num-seqs=$CONC \
32-
--disable-log-requests &
32+
--disable-log-requests 2>&1 | tee $(mktemp /tmp/server-XXXXXX.log) &
3333

34-
SERVER_PID=$!
34+
VLLM_PID=$!
3535
set +x
36-
tail -f /tmp/vllm_server.log &
37-
TAIL_PID=$!
3836

3937
until curl --output /dev/null --silent --fail http://localhost:$PORT/health; do
4038
sleep 5
4139
done
4240

43-
kill $TAIL_PID 2>/dev/null
41+
pkill -P $$ tee 2>/dev/null
4442

4543
pip install -q datasets pandas
4644
git clone https://github.com/kimbochen/bench_serving.git

0 commit comments

Comments
 (0)