Skip to content

Commit 5e2d39e

Browse files
committed
run vllm serve in background
1 parent dc6d76d commit 5e2d39e

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

benchmarks/gptoss_fp4_h100_docker.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -29,10 +29,10 @@ vllm serve $MODEL --host=0.0.0.0 --port=$PORT \
2929
--gpu-memory-utilization=0.9 \
3030
--tensor-parallel-size=$TP \
3131
--max-num-seqs=$CONC \
32-
--disable-log-requests
32+
--disable-log-requests &
3333

3434
set +x
35-
until curl --output /dev/null --silent --head --fail http://localhost:$PORT/health; do
35+
until curl --output /dev/null --silent --fail http://localhost:$PORT/health; do
3636
sleep 5
3737
done
3838

0 commit comments

Comments
 (0)