Skip to content

Commit fc21e40

Browse files
committed
fix(profile): disable Flash vLLM torch compile
1 parent 4f1f0fa commit fc21e40

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

benchmarks/single_node/dsv4_fp4_b300_vllm_mtp.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ COMPILATION_ARGS=(
6161
--max-cudagraph-capture-size 2048
6262
)
6363
if [[ "$MODEL" == "deepseek-ai/DeepSeek-V4-Flash" ]]; then
64-
COMPILATION_ARGS=(--compilation-config '{"cudagraph_mode":"NONE","custom_ops":["all"]}')
64+
COMPILATION_ARGS=(--compilation-config '{"mode":0,"cudagraph_mode":"NONE","custom_ops":["all"]}')
6565
fi
6666

6767
BENCHMARK_MAX_MODEL_LEN=$MAX_MODEL_LEN

0 commit comments

Comments
 (0)