Fix OFT Conv2d weight reshaping for non-square kernels by Chessing234 · Pull Request #3150 · huggingface/peft

Chessing234 · 2026-04-12T06:06:18Z

Summary

The OFT Conv2d layer computes the filter dimension and reshapes weights using kernel_size[0] * kernel_size[0], squaring the kernel height instead of computing height × width (kernel_size[0] * kernel_size[1]). Similarly, the 4D reshape uses kernel_size[0], kernel_size[0] instead of kernel_size[0], kernel_size[1].

This works by accident for square kernels (3×3, 5×5, etc.) but produces wrong dimensions for any asymmetric kernel (e.g., 3×1, 1×7, 3×5), causing RuntimeError: shape mismatch during forward/merge/unmerge.

Fix: Replace all 5 occurrences of the second kernel_size[0] with kernel_size[1] in oft/layer.py.

Note: The same pattern exists in boft/layer.py and hra/layer.py. Happy to extend this fix to those files if desired.

Test plan

Existing OFT tests with square kernels should continue to pass
OFT with a non-square Conv2d kernel (e.g., nn.Conv2d(3, 16, (3, 1))) should now work

🤖 Generated with Claude Code

All weight reshape operations in the OFT Conv2d layer use kernel_size[0] * kernel_size[0], squaring the height dimension instead of computing height * width. This gives wrong filter dimensions for non-square kernels (e.g. 3x1, 1x7), causing shape mismatches in forward/merge/unmerge. It works by accident for square kernels. Note: the same pattern exists in boft/layer.py and hra/layer.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

BenjaminBossan · 2026-04-13T12:27:32Z

@Chessing234 Thanks for the PR. Could you please extend the unit tests to cover this case? I.e. add a test case that fails with the current main branch but is fixed with your branch? It should be fine to add a one-off test for this in test_custom_models.py inside the TestPeftCustomModel class.

The same pattern exists in boft/layer.py and hra/layer.py. Happy to extend this fix to those files if desired.

This would be really appreciated, better to have that in one PR than multiple smaller ones. The unit test can be parametrized to cover these PEFT methods too.

Chessing234 · 2026-04-14T02:19:00Z

Added a standalone test test_oft_conv2d_non_square_kernel in TestPeftCustomModel: it builds a Conv2d with kernel (3, 5), wraps it in OFT, runs a forward pass, then calls merge_and_unload and checks the merged output matches. Fails on main (shape mismatch from the kernel_size[0] ** 2 miscalculation); passes with this PR.

BenjaminBossan

Thanks for adding the test, but it's in the wrong place. Please check the comment. You can ensure that the test passes by running:

pytest tests/test_custom_models.py -k test_oft_conv2d_non_square_kernel

Also, as mentioned above, it would be much better to have the BOFT and HRA fixes in the same PR, so please add those as well.

BenjaminBossan · 2026-04-14T12:30:10Z

            "base_model.model.lin0.oft_R.adapter1.weight",
        )

+    def test_oft_conv2d_non_square_kernel(self):


This test is placed in the wrong place. Please place it at the end of TestPeftCustomModel.

BenjaminBossan · 2026-04-14T12:31:53Z

+        peft_model = get_peft_model(model, config)
+
+        X = torch.arange(5 * 5 * 3 * 5, dtype=torch.float, device=self.torch_device).reshape(5, 5, 3, 5)
+        output = peft_model(X)


Let's add a comment: "# Ensure that the forward pass does not raise". This is what we actually want to test. Equality of non-merged vs merged is nice to test too, but not the critical issue.

Add OFT Conv2d non-square kernel test

247029a

BenjaminBossan requested changes Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix OFT Conv2d weight reshaping for non-square kernels#3150

Fix OFT Conv2d weight reshaping for non-square kernels#3150
Chessing234 wants to merge 2 commits intohuggingface:mainfrom
Chessing234:fix/oft-conv2d-nonsquare-kernel

Chessing234 commented Apr 12, 2026

Uh oh!

BenjaminBossan commented Apr 13, 2026

Uh oh!

Chessing234 commented Apr 14, 2026

Uh oh!

BenjaminBossan left a comment

Uh oh!

BenjaminBossan Apr 14, 2026

Uh oh!

BenjaminBossan Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Chessing234 commented Apr 12, 2026

Summary

Test plan

Uh oh!

BenjaminBossan commented Apr 13, 2026

Uh oh!

Chessing234 commented Apr 14, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants