Skip to content

Commit f22f054

Browse files
committed
Disable pipeline graph mode; focus on GEMM-only graph vs BF16
1 parent 2fe1032 commit f22f054

File tree

1 file changed

+4
-5
lines changed

1 file changed

+4
-5
lines changed

bench_moe_pipeline.py

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -372,11 +372,10 @@ def run_config(name, num_experts, K, N, tokens_per_expert):
372372
except Exception as e:
373373
results["nvfp4_gemm_graph"] = None
374374

375-
# 5. NVFP4 pipeline (graph)
376-
try:
377-
results["nvfp4_pipe_graph"] = bench_nvfp4_pipeline_graph(layer, x, expert_offsets)
378-
except Exception as e:
379-
results["nvfp4_pipe_graph"] = None
375+
# 5. NVFP4 pipeline (graph) — disabled: mode 4's direct C init
376+
# conflicts with mode 5's cached init, and zero_() in graph capture
377+
# needs further investigation. Mode 4 gives the GEMM throughput ceiling.
378+
results["nvfp4_pipe_graph"] = None
380379

381380
results["padded_flops"] = padded_flops
382381
return results

0 commit comments

Comments
 (0)