Implement Prefix Tuning for Gemma models. by copybara-service[bot] · Pull Request #631 · google-deepmind/gemma

copybara-service · 2026-04-22T21:51:18Z

Implement Prefix Tuning for Gemma models.

google-cla · 2026-04-22T21:51:35Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

PiperOrigin-RevId: 899479928

There was an issue with applying prefix tuning to Gemma 4 because the model uses different head dimensions for layers that use sliding window attention. As prefix tuning only initializes a single projection matrix that is used for all layers, this would lead to a shape mismatch. The solution is to "overprovision" the matrix and then slice the prefix down to size of the layer is smaller. This is not quite as parameter efficient as it could be, but the overhead shouldn't be too large. For robustness, we also skip layers if the matrix is underprovisioned, but we warn about it and raise an error if all layers are skipped. Alternatively, we could implement one project per layer, each with the right size, like in google-deepmind/gemma#631. However, this would be a big refactor and also very hard to make backwards compatible with existing checkpoints, so going with the less efficient solution is preferable. This PR also contains an independent, single line fix to a prefix tuning test that was referencing a non-existing model.

copybara-service Bot force-pushed the test_899479928 branch from dd303ca to 5eaee92 Compare April 23, 2026 14:28

copybara-service Bot changed the title ~~Prefix tuning support for Gemma models.~~ Implement Prefix Tuning for Gemma models. Apr 23, 2026

copybara-service Bot force-pushed the test_899479928 branch 3 times, most recently from 1e518ad to 9a0f5ed Compare April 23, 2026 19:22

Implement Prefix Tuning for Gemma models.

7c83e75

PiperOrigin-RevId: 899479928

copybara-service Bot force-pushed the test_899479928 branch from 9a0f5ed to 7c83e75 Compare April 28, 2026 17:57

stharrold mentioned this pull request Apr 29, 2026

PrefixTuningConfig fails on google/gemma-4-e2b-it: tensor expand size mismatch in attention forward (peft 0.19.1, transformers 5.6.2) huggingface/peft#3201

Closed

BenjaminBossan mentioned this pull request Apr 30, 2026

FIX Error when prefix tuning Gemma 4 huggingface/peft#3205

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Prefix Tuning for Gemma models.#631

Implement Prefix Tuning for Gemma models.#631
copybara-service[bot] wants to merge 1 commit into
mainfrom
test_899479928

copybara-service Bot commented Apr 22, 2026 •

edited

Loading

Uh oh!

google-cla Bot commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

copybara-service Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla Bot commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

copybara-service Bot commented Apr 22, 2026 •

edited

Loading