fix: use alpha/rank scaling in LoRaLayer (standard LoRA convention) by kikoncuo · Pull Request #846 · Blaizzy/mlx-vlm

kikoncuo · 2026-03-21T19:26:38Z

Summary

LoRaLayer uses raw alpha as the scaling factor instead of alpha / rank. This makes the LoRA contribution rank times larger than the standard convention used by PEFT, the original LoRA paper, and mlx-lm.

The bug

# Current (lora.py line 38):
return y + (self.alpha * lora_update)     # scale = 16

# Should be:
return y + (self.scale * lora_update)     # scale = alpha/rank = 2

Proof

With alpha=16, rank=8 and deterministic weights:

from mlx_vlm.trainer.lora import LoRaLayer

linear = nn.Linear(4, 4)
lora = LoRaLayer(linear, rank=8, alpha=16.0)
lora.A = mx.ones((4, 8))
lora.B = mx.ones((8, 4))
x = mx.ones((1, 4))

base = linear(x)
actual = lora(x)
contribution = (actual - base)[0, 0].item()
# Current:  512.0  (alpha * x @ A @ B = 16 * 32)
# Expected:  64.0  (alpha/rank * x @ A @ B = 2 * 32)
# Ratio: 8x too large

Fix

Store self.scale = alpha / rank in __init__ (instead of self.alpha = alpha)
Use self.scale in __call__ and replace_lora_with_linear

Tests added

4 new tests in test_trainer_utils.py:

test_scale_is_alpha_over_rank — verifies scale = alpha/rank
test_scale_with_rank_equals_alpha — verifies alpha=rank gives scale=1
test_forward_scaling_matches_peft — verifies forward pass matches (alpha/rank) * delta
test_default_alpha_rank_gives_2x — verifies default settings give 2x, not 16x

All 8 tests pass.

References

Original LoRA paper: "We then scale ΔWx by α/r"
PEFT: self.scaling = lora_alpha / r
mlx-lm: uses alpha / rank

Fixes #845

LoRaLayer used raw `alpha` as the scaling factor instead of `alpha / rank`. With the default alpha=16, rank=8, this made the LoRA contribution 8x larger than PEFT, the original LoRA paper, and mlx-lm. Before: scale = alpha = 16.0 After: scale = alpha / rank = 2.0 Also fixes replace_lora_with_linear to use the same corrected scale. Added tests verifying: - scale = alpha / rank - Forward pass produces (alpha/rank) * (x @ A @ B) - Default settings give 2x scaling, not 16x Fixes Blaizzy#845

Copilot

Pull request overview

Fixes LoRA scaling in LoRaLayer to follow the standard LoRA convention (alpha / rank) so LoRA updates match PEFT / LoRA paper expectations and reduce unintended amplification.

Changes:

Compute and store self.scale = alpha / rank in LoRaLayer and use it in the forward pass.
Update LoRA weight-merge logic to use scale when producing the merged delta.
Add unit tests validating scaling math and forward-pass contribution.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`mlx_vlm/trainer/lora.py`	Switches from raw `alpha` to `alpha/rank` scaling in forward and merge code.
`mlx_vlm/tests/test_trainer_utils.py`	Adds tests to validate LoRA scaling behavior and expected contribution size.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Goekdeniz-Guelmez · 2026-03-26T16:03:47Z

@kikoncuo LGTM, did you run the test? might have to change due to the missing B=0 test

Verifies that when B is zeros (default init), the LoRA layer output equals the base linear layer output exactly (no LoRA contribution). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

kikoncuo · 2026-03-26T20:24:08Z

I ran the existing tests, all 8 passed.
I added a B=0 test as well, the LoRA layer just passes through the original linear output unchanged. No LoRA contribution at all cause B starts at zero, so x @ A @ B is always zero regardless of the scaling factor. Confirms the fix doesn't break it.

Blaizzy · 2026-03-27T19:09:24Z

@Goekdeniz-Guelmez need your sign off here before merging

Goekdeniz-Guelmez · 2026-03-27T21:25:48Z

I think it would be better to move the new tests inside test_trainer.py file and not create a new test file. Other then that LGTM

Move TestLoRaScaling class from test_trainer_utils.py into test_trainer.py as suggested in review, and revert test_trainer_utils.py to its original state. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

kikoncuo · 2026-03-29T20:19:56Z

Test moved!

Goekdeniz-Guelmez · 2026-03-30T11:52:27Z

@Blaizzy I think this can be merged now!

kikoncuo · 2026-04-09T08:19:36Z

@Goekdeniz-Guelmez @Blaizzy any progress here? do you need anything from me?

Blaizzy

LGTM, thanks!

Copilot AI review requested due to automatic review settings March 21, 2026 19:26

Copilot started reviewing on behalf of kikoncuo March 21, 2026 19:27 View session

Copilot AI reviewed Mar 21, 2026

View reviewed changes

Comment thread mlx_vlm/trainer/lora.py

Comment thread mlx_vlm/tests/test_trainer_utils.py Outdated

test: add B=0 initialization test for LoRaLayer

3013a97

Verifies that when B is zeros (default init), the LoRA layer output equals the base linear layer output exactly (no LoRA contribution). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

refactor: move LoRA scaling tests into test_trainer.py

94359c6

Move TestLoRaScaling class from test_trainer_utils.py into test_trainer.py as suggested in review, and revert test_trainer_utils.py to its original state. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge branch 'main' into fix/lora-alpha-scaling

a6e0fec

Blaizzy approved these changes Apr 18, 2026

View reviewed changes

Blaizzy merged commit 6b0ad8f into Blaizzy:main Apr 18, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: use alpha/rank scaling in LoRaLayer (standard LoRA convention)#846

fix: use alpha/rank scaling in LoRaLayer (standard LoRA convention)#846
Blaizzy merged 4 commits into
Blaizzy:mainfrom
kikoncuo:fix/lora-alpha-scaling

kikoncuo commented Mar 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Goekdeniz-Guelmez commented Mar 26, 2026

Uh oh!

kikoncuo commented Mar 26, 2026 •

edited

Loading

Uh oh!

Blaizzy commented Mar 27, 2026

Uh oh!

Goekdeniz-Guelmez commented Mar 27, 2026

Uh oh!

kikoncuo commented Mar 29, 2026

Uh oh!

Goekdeniz-Guelmez commented Mar 30, 2026

Uh oh!

kikoncuo commented Apr 9, 2026

Uh oh!

Blaizzy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

kikoncuo commented Mar 21, 2026

Summary

The bug

Proof

Fix

Tests added

References

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Goekdeniz-Guelmez commented Mar 26, 2026

Uh oh!

kikoncuo commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Blaizzy commented Mar 27, 2026

Uh oh!

Goekdeniz-Guelmez commented Mar 27, 2026

Uh oh!

kikoncuo commented Mar 29, 2026

Uh oh!

Goekdeniz-Guelmez commented Mar 30, 2026

Uh oh!

kikoncuo commented Apr 9, 2026

Uh oh!

Blaizzy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kikoncuo commented Mar 26, 2026 •

edited

Loading