Fix LoRA fine-tuning for Gemma4 models by copybara-service[bot] · Pull Request #644 · google-deepmind/gemma

copybara-service · 2026-05-08T13:32:15Z

Fix LoRA fine-tuning for Gemma4 models

google-cla · 2026-05-08T13:32:32Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

darknecrocities

This is a substantial and well-executed change that improves checkpoint robustness and significantly strengthens the model’s ability to handle real-world structural drift across LoRA, multimodal extensions, and nn.share_scope variations. The introduction of _needs_reconciliation and _reconcile_tree is particularly valuable, as it cleanly isolates structural repair logic without impacting the default Gemma3 / legacy restore path.

The multimodal parameter handling is also more systematic now, with consistent support for both vision and audio encoders and their associated projection/norm keys. Consolidating these into _MM_TOP_LEVEL_KEYS and _MM_EMBEDDER_KEYS improves maintainability and reduces the risk of silent parameter drops when new modalities are introduced.

The added LoRA support in _SUPPORTED_MODULES and the expanded test coverage are strong improvements, especially given the complexity introduced by multiple Einsum variants across Gemma3n, Gemma4, and nano layers. The reconciliation tests are particularly thorough and effectively validate both stub removal and leaf-vs-dict normalization behaviors.

A few minor considerations: _needs_reconciliation performs recursive structural checks that may become expensive for very large parameter trees; if this path is hit frequently, it may be worth benchmarking or caching intermediate structural signatures. Additionally, the reconciliation logic assumes that empty dicts always represent LoRA stubs, which is correct for current usage but may benefit from a more explicit tagging mechanism in the future to avoid accidental false positives if other empty scopes are introduced.

PiperOrigin-RevId: 933421530

copybara-service Bot force-pushed the test_911763673 branch 2 times, most recently from 197cb16 to 6209b2e Compare May 12, 2026 02:00

darknecrocities reviewed May 25, 2026

View reviewed changes

copybara-service Bot force-pushed the test_911763673 branch from 6209b2e to d8ce711 Compare June 17, 2026 01:53

copybara-service Bot changed the title ~~Internal~~ Fix LoRA fine-tuning for Gemma4 models Jun 17, 2026

Fix LoRA fine-tuning for Gemma4 models

05652d5

PiperOrigin-RevId: 933421530

copybara-service Bot force-pushed the test_911763673 branch from d8ce711 to 05652d5 Compare June 17, 2026 02:02

copybara-service Bot merged commit 05652d5 into main Jun 17, 2026

copybara-service Bot deleted the test_911763673 branch June 17, 2026 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LoRA fine-tuning for Gemma4 models#644

Fix LoRA fine-tuning for Gemma4 models#644
copybara-service[bot] merged 1 commit into
mainfrom
test_911763673

copybara-service Bot commented May 8, 2026 •

edited

Loading

Uh oh!

google-cla Bot commented May 8, 2026

Uh oh!

darknecrocities left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

copybara-service Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla Bot commented May 8, 2026

Uh oh!

darknecrocities left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

copybara-service Bot commented May 8, 2026 •

edited

Loading