Skip to content

rollout ep#9392

Open
hjh0119 wants to merge 5 commits into
modelscope:mainfrom
hjh0119:rollout-ep
Open

rollout ep#9392
hjh0119 wants to merge 5 commits into
modelscope:mainfrom
hjh0119:rollout-ep

Conversation

@hjh0119
Copy link
Copy Markdown
Collaborator

@hjh0119 hjh0119 commented May 21, 2026

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces the vllm_enable_expert_parallel parameter to support Expert Parallel (EP) for MoE models, including documentation updates and validation logic in deploy_args.py. Feedback was provided regarding the ambiguity of the warning message when automatically overriding configurations and the inconsistency of this auto-fix approach compared to other argument validations that raise errors for incompatible settings.

Comment on lines +163 to +164
logger.warning('vllm_enable_expert_parallel is only supported with vllm_use_async_engine, '
'set vllm_use_async_engine to True.')
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The warning message is slightly ambiguous as it sounds like an instruction to the user, even though the code has already performed the override. It would be clearer to state that the setting is being changed automatically.

Additionally, this auto-fix logic is inconsistent with the handling of multi_turn_scheduler in _check_args (lines 136-137), which raises a ValueError for the same incompatibility. For a more robust and consistent implementation, consider integrating this dependency into _set_default_engine_type for defaulting and _check_args for validation.

References
  1. Maintain consistency in argument validation and defaulting logic across similar parameters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant