We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 15589f3 commit 08777afCopy full SHA for 08777af
1 file changed
examples/models/qwen3_5_moe/model.py
@@ -613,10 +613,6 @@ def forward(
613
for layer in self.layers:
614
x = layer(x, input_pos)
615
x = self.norm(x)
616
- # When no sampling is requested, return the full ``[B, T, V]``
617
- # logits so callers (eval, custom samplers) can inspect every
618
- # position. Otherwise apply the prefill optimization and only
619
- # materialize ``[B, V]`` for the last token.
620
if temperature is None:
621
return self.lm_head(x) # [B, T, V] in model dtype
622
logits = self.lm_head(x[:, -1, :]).float() # [B, V] float32
0 commit comments