Skip to content

feat: add silu clamp limit to shared expert for DeepSeek-V4#1665

Merged
AlpinDale merged 1 commit into
mainfrom
feat/dsv4-silu-clamp
May 1, 2026
Merged

feat: add silu clamp limit to shared expert for DeepSeek-V4#1665
AlpinDale merged 1 commit into
mainfrom
feat/dsv4-silu-clamp

Conversation

@AlpinDale
Copy link
Copy Markdown
Collaborator

No description provided.

Signed-off-by: AlpinDale <alpindale@gmail.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces the SiluAndMulWithClamp activation function, which implements SwiGLU with input clamping, and integrates it into the DeepseekV4 model. The changes include new CUDA kernels for clamped activations and the addition of a DeepseekV4MLP class. Feedback was provided regarding an inefficient instantiation of the SiluAndMul class within a lambda in the CPU MoE implementation, which could also lead to initialization errors.

Comment thread aphrodite/model_executor/layers/fused_moe/cpu_fused_moe.py
@AlpinDale AlpinDale merged commit e958628 into main May 1, 2026
1 check failed
@AlpinDale AlpinDale deleted the feat/dsv4-silu-clamp branch May 2, 2026 02:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant