Fix Gemma 4 quantized per-layer projection loading by spicyneuron · Pull Request #935 · Blaizzy/mlx-vlm

spicyneuron · 2026-04-05T13:10:51Z

While trying to run unsloth/gemma-4-E2B-it-UD-MLX-4bit, I hit a loading error:

ValueError: Unable to quantize model of type <class 'mlx_lm.models.gemma4_text.ScaledLinear'>

After cross-checking this against the Gemma 4 implementation in Transformers (constructor, projection path), I replaced the custom ScaledLinear wrapper used for per_layer_model_projection with a standard bias-free nn.Linear and moved the hidden_size**-0.5 scale into the projection path explicitly.

The math stays the same, but the layer now works with MLX's normal quantization and loading flow.

For reference, here's the same fix on mlx-lm: ml-explore/mlx-lm#1112

Blaizzy

LGTM, thanks!

spicyneuron and others added 4 commits April 5, 2026 21:05

Fix Gemma 4 quantized per-layer projection loading

7d85739

Merge branch 'main' into fix-gemma-4

3194f59

Format

1f9e930

Merge branch 'main' into fix-gemma-4

883c3fc

Blaizzy approved these changes Apr 7, 2026

View reviewed changes

Blaizzy merged commit b2cffea into Blaizzy:main Apr 7, 2026
1 check passed

Chedrian07 mentioned this pull request Apr 8, 2026

bump mlx-lm to 0.31.3 and mlx-vlm to latest main jundot/omlx#675

Closed

5 tasks

spicyneuron deleted the fix-gemma-4 branch April 30, 2026 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Gemma 4 quantized per-layer projection loading#935

Fix Gemma 4 quantized per-layer projection loading#935
Blaizzy merged 4 commits into
Blaizzy:mainfrom
spicyneuron:fix-gemma-4

spicyneuron commented Apr 5, 2026

Uh oh!

Blaizzy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

spicyneuron commented Apr 5, 2026

Uh oh!

Blaizzy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants