Extend LoRA for Gemma4 by RexBearIU · Pull Request #3969 · AI-Hypercomputer/maxtext

RexBearIU · 2026-05-22T07:31:35Z

Description

This PR extends the recent LoRA support to accurately target and process Gemma 4 architectures (including MoE).

Gemma 4 introduces complex nested structures (like scanned_blocks and layers_remainder) and unique chat template behaviors (such as the <|channel>thought block) that are incompatible with standard LoRA targeting and data
processing. Furthermore, MoE models require dynamic metadata synchronization during forward passes which is broken by aggressive NNX graph caching.

This PR addresses these challenges by:

Adding accurate regex mapping for Gemma 4 standard and MoE LoRA targets in lora_module_path.yml.
Implementing a thought channel bypass in input_pipeline_utils.py to prevent validation failures when the generation prompt includes the <|channel>thought block.
Dynamically disabling NNX graph caching in train_sft.py specifically for MoE models (where experts > 1) to allow necessary metadata synchronization.

Tests

Added unit tests for the Gemma 4 tokenizer bypass in tests/post_training/unit/sft_data_processing_test.py (test_tokenizer_gemma4_thought_channel_bypass).
Verified caching behavior changes by running Gemma-4 MoE LoRA tuning on TPU.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-05-22T07:37:22Z

Codecov Report

❌ Patch coverage is 80.00000% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/input_pipeline/input_pipeline_utils.py	80.00%	1 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

SurbhiJainUSC · 2026-05-27T16:25:44Z

  def test_tokenizer_wo_generation_prompt(self):
    verify_chat_template_generation_prompt_logic(self.llama2_tokenizer)

+  def test_tokenizer_gemma4_thought_channel_bypass(self):


This test expects to not fail with TemplateError or ValueError. Can you add an assertion for this so that it is readable what this test actually verifies?

I updated verify_chat_template_generation_prompt_logic to return True on success, and wrapped all three test cases in explicit self.assertTrue() assertions.

This cleanly verifies that validation succeeds and keeps all tests uniform. All tests pass successfully!"

SurbhiJainUSC · 2026-05-28T16:08:38Z

  actual_prefix_in_full_turn = full_turn_ids[len(prompt_wo_gen_ids) : len(prompt_wo_gen_ids) + len(assistant_prefix)]

  if actual_prefix_in_full_turn != assistant_prefix:
+    # Allow the generation prompt to include a thought channel block (e.g., for Gemma 4).


This logic looks like a hacky approach to support Gemma4. I am working on a generalized logic to support any model that requires specific prefix shifting. I will send out the PR soon.

Agreed, it was definitely a workaround. Thanks for taking the lead on a generalized solution! I'll track #4010 and we can use that approach instead.

I've rebased this PR on top of #4010

RexBearIU changed the title ~~Jackyf/gemma4 lora~~ Extend LoRA for Gemma4 May 22, 2026

RexBearIU mentioned this pull request May 22, 2026

feat: QLoRA support for Dense/MoE models across Pathways and McJAX #3702

Closed

4 tasks

RexBearIU force-pushed the jackyf/gemma4-lora branch from 2bc8632 to ab61640 Compare May 22, 2026 07:38

SurbhiJainUSC reviewed May 27, 2026

View reviewed changes

RexBearIU force-pushed the jackyf/gemma4-lora branch 2 times, most recently from 61626bd to ef50ff7 Compare May 28, 2026 08:41

SurbhiJainUSC reviewed May 28, 2026

View reviewed changes

Fix SFT prompt masking for Gemma4

dd610f2

feat: Gemma4 LoRA Extension

5fd616b

RexBearIU force-pushed the jackyf/gemma4-lora branch from ef50ff7 to 5fd616b Compare May 29, 2026 07:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend LoRA for Gemma4#3969

Extend LoRA for Gemma4#3969
RexBearIU wants to merge 2 commits into
mainfrom
jackyf/gemma4-lora

RexBearIU commented May 22, 2026

Uh oh!

codecov Bot commented May 22, 2026 •

edited

Loading

Uh oh!

SurbhiJainUSC May 27, 2026

Uh oh!

RexBearIU May 28, 2026

Uh oh!

SurbhiJainUSC May 28, 2026

Uh oh!

SurbhiJainUSC May 28, 2026

Uh oh!

RexBearIU May 29, 2026

Uh oh!

RexBearIU May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RexBearIU commented May 22, 2026

Description

Tests

Checklist

Uh oh!

codecov Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

SurbhiJainUSC May 27, 2026

Choose a reason for hiding this comment

Uh oh!

RexBearIU May 28, 2026

Choose a reason for hiding this comment

Uh oh!

SurbhiJainUSC May 28, 2026

Choose a reason for hiding this comment

Uh oh!

SurbhiJainUSC May 28, 2026

Choose a reason for hiding this comment

Uh oh!

RexBearIU May 29, 2026

Choose a reason for hiding this comment

Uh oh!

RexBearIU May 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented May 22, 2026 •

edited

Loading