fix(gemma3, gemma4): default token_type_ids to zeros for text-only training by jashshah999 · Pull Request #45222 · huggingface/transformers

jashshah999 · 2026-04-03T16:27:31Z

Summary

When using Gemma 3 or Gemma 4 for text-only supervised fine-tuning (no images), the forward pass raises a ValueError because token_type_ids / mm_token_type_ids is not provided. This happens because AutoTokenizer does not produce these fields -- only the multimodal Processor does.

The fix defaults to all-zeros when token_type_ids / mm_token_type_ids is None during training, instead of raising. When all zeros, is_vision is entirely False, so the bidirectional vision mask branch is skipped and a standard causal mask is produced -- which is exactly correct for text-only input.

Changes

modeling_gemma4.py / modular_gemma4.py: default mm_token_type_ids to torch.zeros(...) instead of raising ValueError
modeling_gemma3.py / modular_gemma3.py: same fix for token_type_ids (same root cause)

Fixes #45200

…aining Fixes huggingface#45200

github-actions · 2026-04-03T16:28:41Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma3, gemma4

fix(gemma3, gemma4): default token_type_ids to zeros for text-only tr…

cc6ea6a

…aining Fixes huggingface#45200

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(gemma3, gemma4): default token_type_ids to zeros for text-only training#45222

fix(gemma3, gemma4): default token_type_ids to zeros for text-only training#45222
jashshah999 wants to merge 1 commit intohuggingface:mainfrom
jashshah999:fix/gemma-text-only-training

jashshah999 commented Apr 3, 2026

Uh oh!

github-actions bot commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jashshah999 commented Apr 3, 2026

Summary

Changes

Uh oh!

github-actions bot commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant