MiSS update by Joluck · Pull Request #3194 · huggingface/peft

Joluck · 2026-04-24T09:13:54Z

The optimized writing improves readability, but when I tested it using method_comparison, the results were incorrect. Could you help me test it?

Joluck · 2026-04-24T09:23:17Z

I use this model 'unsloth/Llama-3.2-3B' — maybe this is wrong.

BenjaminBossan · 2026-04-24T10:28:19Z

So IIUC, this change should not influence the results in any way, it's just a change for better readability. Therefore, we should expect the results to be identical. To test this, don't change the base model: We always want to use the same one or else results are not comparable. Instead, run one of the existing experiments and then re-run the same experiment but with your changes applied on top:

python run.py -v experiments/miss/llama-3.2-3B-default
# checkout your branch
python run.py -v experiments/miss/llama-3.2-3B-default

Joluck · 2026-04-24T10:30:47Z

So IIUC, this change should not influence the results in any way, it's just a change for better readability. Therefore, we should expect the results to be identical. To test this, don't change the base model: We always want to use the same one or else results are not comparable. Instead, run one of the existing experiments and then re-run the same experiment but with your changes applied on top:
python run.py -v experiments/miss/llama-3.2-3B-default
# checkout your branch
python run.py -v experiments/miss/llama-3.2-3B-default

Sorry, I don't have permission/license for Llama-3.2-3B.

BenjaminBossan · 2026-04-24T10:46:26Z

Sorry, I don't have permission/license for Llama-3.2-3B.

Ouch, I thought you pretty much get auto permission if you request. I can't check right now, but I'll check next week and let you know if I see any difference. LMK if there is any setting in particular that I should test.

Meanwhile, please revert the change to the default training params. If you want to test a different model, you can always create a new experiment, e.g. method_comparison/MetaMathQA/experiments/miss/unsloth-llama-3.2-3B-default/. Put the adapter_config.json in there and then create a training_params.json. This allows to override the defaults, so you can just add {"model_id": "unsloth/Llama-3.2-3B"} there.

Joluck · 2026-04-27T07:06:19Z

I've added the code for converting MIss to LoRA. Please take a look.

BenjaminBossan

Thanks for the updated. I ran the MetaMathQA benchmark on my machine with the main branch and with your changes using python run.py -v experiments/miss/llama-3.2-3B-default/ (i.e. default MiSS setting). Train train loss is basically identical:

Max memory for both were identical, train times were 833 vs 856 sec, which is reasonably close. So I think overall, this shows that results stay the same. LMK if I should test something else.

As for the conversion, thanks a lot for adding the MiSS-specific path. I converted the trained MiSS model from the benchmark to LoRA using a relatively small rank of 32 and it got a test accuracy of 50.3%, so basically the same as the MiSS adapter. That's quite a nice result.

Since there is this special MiSS conversion path now, we should add a unit test for that. The easiest way should be to take this LoKr test, copy it, and replace the lokr_model with a MiSS model.

Joluck · 2026-04-28T08:39:03Z

Thanks for the updated. I ran the MetaMathQA benchmark on my machine with the main branch and with your changes using python run.py -v experiments/miss/llama-3.2-3B-default/ (i.e. default MiSS setting). Train train loss is basically identical:
Max memory for both were identical, train times were 833 vs 856 sec, which is reasonably close. So I think overall, this shows that results stay the same. LMK if I should test something else.
As for the conversion, thanks a lot for adding the MiSS-specific path. I converted the trained MiSS model from the benchmark to LoRA using a relatively small rank of 32 and it got a test accuracy of 50.3%, so basically the same as the MiSS adapter. That's quite a nice result.

Since there is this special MiSS conversion path now, we should add a unit test for that. The easiest way should be to take this LoKr test, copy it, and replace the lokr_model with a MiSS model.

done

HuggingFaceDocBuilderDev · 2026-04-28T09:49:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2026-04-28T09:51:48Z

@Joluck Thanks for adding the tests. Ruff is complaining about the use of the variable name I. It's a bit annoying, but let's just use a different name.

Joluck · 2026-04-28T10:15:17Z

@Joluck Thanks for adding the tests. Ruff is complaining about the use of the variable name I. It's a bit annoying, but let's just use a different name.

What name do you think would be suitable?

BenjaminBossan · 2026-04-28T10:44:30Z

What name do you think would be suitable?

Doesn't really matter to me, perhaps eye :-]

Joluck · 2026-04-29T10:59:07Z

What name do you think would be suitable?

Doesn't really matter to me, perhaps eye :-]

done

BenjaminBossan · 2026-04-29T11:50:06Z

@Joluck Could you please run make style?

BenjaminBossan · 2026-04-30T08:50:26Z

Ruff is still complaining, the required changes are:

modified   src/peft/tuners/lora/conversion.py
@@ -51,8 +51,8 @@ def _convert_miss_module_to_lora(
 ) -> tuple[torch.Tensor, torch.Tensor, int]:
     """Convert a single MiSS layer to LoRA A and B matrices.
 
-    For standard and mini modes, the MiSS forward pass (reshape+sum @ miss) is already a rank-r
-    factorization, so the exact factors are returned directly without SVD.
+    For standard and mini modes, the MiSS forward pass (reshape+sum @ miss) is already a rank-r factorization, so the
+    exact factors are returned directly without SVD.
 
     For bat mode, the delta weight depends on the base weight, so SVD is used.
     """
modified   src/peft/tuners/miss/layer.py
@@ -313,8 +313,12 @@ class MissLinear(nn.Module, MissLayer):
             aligned_size = n_blocks * r
 
             W_aligned = orig_weight[:, :aligned_size].reshape(-1, n_blocks, r).permute(1, 2, 0)
-            orig_weight[:, :aligned_size] = (W_aligned + sign * miss_B).permute(2, 0, 1).reshape(*orig_weight[:, :aligned_size].shape)
-            orig_weight[:, aligned_size:] = orig_weight[:, aligned_size:] + sign * miss_B.transpose(0, 1)[:, :remainder]
+            orig_weight[:, :aligned_size] = (
+                (W_aligned + sign * miss_B).permute(2, 0, 1).reshape(*orig_weight[:, :aligned_size].shape)
+            )
+            orig_weight[:, aligned_size:] = (
+                orig_weight[:, aligned_size:] + sign * miss_B.transpose(0, 1)[:, :remainder]
+            )
             output_tensor = orig_weight
         else:
             W_blocks = orig_weight.reshape(-1, orig_weight.size(1) // r, r).permute(1, 2, 0)

Joluck · 2026-05-01T14:05:54Z

I don't know why there are so many updates when I use make style. Does it require a specific version?

BenjaminBossan

Thanks for updating MiSS and adding the LoRA conversion code, LGTM.

Improve readability of MiSS code. Add MiSS to LoRA conversion code, some of which is exact conversion.

Joluck and others added 5 commits March 30, 2026 16:46

miss update

2cb966e

change link

6afce90

1

c8d47a4

Merge branch 'huggingface:main' into main

da87a2a

update

63d941f

Joluck added 2 commits April 27, 2026 14:14

miss_to_lora

53e0aa8

origin

d399714

BenjaminBossan requested changes Apr 27, 2026

View reviewed changes

test unit

c618c27

I->eye

4785f97

Joluck and others added 2 commits April 30, 2026 14:23

Merge branch 'huggingface:main' into main

c22a35e

make style

8f7171b

fix

27a9f7d

BenjaminBossan approved these changes May 4, 2026

View reviewed changes

BenjaminBossan merged commit 4050ef5 into huggingface:main May 4, 2026
10 checks passed

kashif pushed a commit to kashif/peft that referenced this pull request May 28, 2026

ENH Improve MiSS code, add LoRA conversion (huggingface#3194)

3401436

Improve readability of MiSS code. Add MiSS to LoRA conversion code, some of which is exact conversion.

Uh oh!

Conversation

Joluck commented Apr 24, 2026

Uh oh!

Joluck commented Apr 24, 2026

Uh oh!

BenjaminBossan commented Apr 24, 2026

Uh oh!

Joluck commented Apr 24, 2026

Uh oh!

BenjaminBossan commented Apr 24, 2026

Uh oh!

Joluck commented Apr 27, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Joluck commented Apr 28, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2026

Uh oh!

BenjaminBossan commented Apr 28, 2026

Uh oh!

Joluck commented Apr 28, 2026

Uh oh!

BenjaminBossan commented Apr 28, 2026

Uh oh!

Joluck commented Apr 29, 2026

Uh oh!

BenjaminBossan commented Apr 29, 2026

Uh oh!

BenjaminBossan commented Apr 30, 2026

Uh oh!

Joluck commented May 1, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants