Fix GEGLU docstring: Sigmoid -> GELU (#8696)

aymuos15 · ericspod · web-flow · commit 1e3d29bc0e58 · 2026-03-01T02:44:35.000Z
## Summary - Fixed GEGLU docstring which incorrectly stated the activation function was Sigmoid - The code correctly uses GELU, as specified in the original GEGLU paper ## Details - GLU uses Sigmoid: GLU(x) = σ(xW) ⊗ xV - GEGLU uses GELU: GEGLU(x) = GELU(xW) ⊗ xV Reference: https://arxiv.org/abs/2002.05202 --------- Signed-off-by: Soumya Snigdha Kundu <soumya_snigdha.kundu@kcl.ac.uk> Co-authored-by: Eric Kerfoot <17726042+ericspod@users.noreply.github.com>
diff --git a/monai/networks/blocks/activation.py b/monai/networks/blocks/activation.py
@@ -168,7 +168,7 @@ class GEGLU(nn.Module):
     r"""Applies the element-wise function:
 
     .. math::
-        \text{GEGLU}(x) = x_1 * \text{Sigmoid}(x_2)
+        \text{GEGLU}(x) = x_1 * \text{GELU}(x_2)
 
     where :math:`x_1` and :math:`x_2` are split from the input tensor along the last dimension.
 
@@ -177,6 +177,14 @@ class GEGLU(nn.Module):
     Shape:
         - Input: :math:`(N, *, 2 * D)`
         - Output: :math:`(N, *, D)`, where `*` means, any number of additional dimensions
+
+    Examples::
+
+        >>> import torch
+        >>> from monai.networks.layers.factories import Act
+        >>> m = Act['geglu']()
+        >>> input = torch.randn(2, 8)  # last dim must be even
+        >>> output = m(input)
     """
 
     def forward(self, input: torch.Tensor):