Skip to content

Commit 8eb2f1e

Browse files
committed
add return_expert_indices=True in ep_prefill
1 parent 82f0d7f commit 8eb2f1e

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

fastdeploy/model_executor/layers/moe/fused_moe_cutlass_backend.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -168,6 +168,7 @@ def apply_ep_prefill(
168168
tokens_per_expert=[],
169169
padding_alignment=128,
170170
override_buffer_size=token_all_num,
171+
return_expert_indices=True,
171172
)
172173
)
173174

0 commit comments

Comments
 (0)