feat: support convert_lora_to_hf without merge by RayenTian · Pull Request #2234 · NVIDIA-NeMo/RL

RayenTian · 2026-04-08T12:15:31Z

Summary

New --adapter-only flag: exports only the LoRA adapter weights in HuggingFace PEFT format (no base model required at export time), enabling workflows like vLLM's split base+adapter serving
Bug fix in merge_lora_to_hf: adapter weights are now loaded via dist_checkpointing.load with a filtered sharded state dict (via apply_peft_adapter_filter_to_state_dict), fixing a KeyError that occurred when using the previous _load_model_weights_from_checkpoint path which silently drops adapter
parameters
Functional test: test_converter_roundtrip.py now covers the adapter-only export path and verifies that re-merging the exported PEFT adapter via merge_lora_to_hf produces weights identical to the direct merge path
Docs: updated checkpointing.md and sft.md with both export modes (Option A: merged, Option B: adapter-only)

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

closes #2190

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

copy-pr-bot · 2026-04-08T12:15:35Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

RayenTian · 2026-04-09T09:21:11Z

/ok to test eeef66c

RayenTian · 2026-04-09T09:40:37Z

/ok to test 9de1880

RayenTian · 2026-04-10T03:12:35Z

/ok to test 64a8679

RayenTian · 2026-04-13T03:05:59Z

/ok to test ed1bbdd

RayenTian · 2026-04-13T03:39:16Z

/ok to test f24be7a

yuki-97

thanks for fixing and adding the support! lgtm

yuki-97 · 2026-04-13T14:55:00Z

@terrykong could you take a review as well?

RayenTian · 2026-04-21T03:58:38Z

/ok to test 5396d85

terrykong

Nice work on this PR! The refactoring to extract _build_megatron_model_with_lora as a shared context manager is clean, the bug fix for adapter loading (using dist_checkpointing.load with a filtered state dict via apply_peft_adapter_filter_to_state_dict) correctly addresses the KeyError from the old _load_model_weights_from_checkpoint path, and the docs clearly present both export modes with examples. Nice work addressing the community request (#2190).

Two minor suggestions below — neither is a blocker.

Generated by Claude Code

terrykong

lgtm docstring was just a nit, btw looks like sphinx build failure

…pters in HuggingFace PEFT format Signed-off-by: ruit <ruit@nvidia.com>

Signed-off-by: ruit <ruit@nvidia.com>

…options Signed-off-by: ruit <ruit@nvidia.com>

Signed-off-by: ruit <ruit@nvidia.com>

RayenTian · 2026-04-21T07:09:43Z

/ok to test 2db3e7d

RayenTian mentioned this pull request Apr 9, 2026

How save Lora Weight into normal format #2190

Closed

RayenTian force-pushed the ruit/convert_lora_ckpt branch from 43f4642 to 0642c7a Compare April 9, 2026 09:12

github-actions Bot added the Documentation Improvements or additions to documentation label Apr 9, 2026

RayenTian added the CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) label Apr 9, 2026

RayenTian marked this pull request as ready for review April 9, 2026 09:21

RayenTian requested review from a team as code owners April 9, 2026 09:21

copy-pr-bot Bot had a problem deploying to nemo-ci April 9, 2026 09:21 Error

copy-pr-bot Bot temporarily deployed to nemo-ci April 9, 2026 09:21 Inactive

RayenTian requested review from terrykong, yaoyu-33 and yuki-97 April 9, 2026 09:22

RayenTian changed the title ~~feat: enhance convert_lora_to_hf script to support exporting LoRA ada…~~ feat: support convert_lora_to_hf without merge Apr 9, 2026

copy-pr-bot Bot had a problem deploying to nemo-ci April 9, 2026 09:40 Failure

copy-pr-bot Bot temporarily deployed to nemo-ci April 9, 2026 09:40 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci April 10, 2026 03:12 Failure

copy-pr-bot Bot temporarily deployed to nemo-ci April 10, 2026 03:12 Inactive

RayenTian force-pushed the ruit/convert_lora_ckpt branch from 64a8679 to ed1bbdd Compare April 13, 2026 02:58

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 03:06 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci April 13, 2026 03:39 Inactive

yuki-97 previously approved these changes Apr 13, 2026

View reviewed changes

copy-pr-bot Bot temporarily deployed to nemo-ci April 21, 2026 03:59 Inactive

terrykong reviewed Apr 21, 2026

View reviewed changes

Comment thread examples/converters/convert_lora_to_hf.py Outdated

terrykong previously approved these changes Apr 21, 2026

View reviewed changes

RayenTian added 7 commits April 20, 2026 22:36

feat: enhance convert_lora_to_hf script to support exporting LoRA ada…

09efc75

…pters in HuggingFace PEFT format Signed-off-by: ruit <ruit@nvidia.com>

add functional test and fix doc

63bd058

Signed-off-by: ruit <ruit@nvidia.com>

add copyright

c2e65cf

Signed-off-by: ruit <ruit@nvidia.com>

uncomment some code

f2430f7

Signed-off-by: ruit <ruit@nvidia.com>

refactor: update checkpointing documentation and enhance LoRA export …

a513dcf

…options Signed-off-by: ruit <ruit@nvidia.com>

fix doc

129cdf5

Signed-off-by: ruit <ruit@nvidia.com>

fix: correct formatting in nsys-profiling.md installation instructions

2db3e7d

Signed-off-by: ruit <ruit@nvidia.com>

RayenTian dismissed stale reviews from terrykong and yuki-97 via 2db3e7d April 21, 2026 07:09

RayenTian force-pushed the ruit/convert_lora_ckpt branch from 5396d85 to 2db3e7d Compare April 21, 2026 07:09

copy-pr-bot Bot temporarily deployed to nemo-ci April 21, 2026 07:10 Inactive

yuki-97 approved these changes Apr 21, 2026

View reviewed changes

yuki-97 merged commit f1835bb into main Apr 21, 2026
29 checks passed

yuki-97 deleted the ruit/convert_lora_ckpt branch April 21, 2026 08:28

Conversation

RayenTian commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Apr 8, 2026

Uh oh!

RayenTian commented Apr 9, 2026

Uh oh!

RayenTian commented Apr 9, 2026

Uh oh!

RayenTian commented Apr 10, 2026

Uh oh!

RayenTian commented Apr 13, 2026

Uh oh!

RayenTian commented Apr 13, 2026

Uh oh!

yuki-97 left a comment

Choose a reason for hiding this comment

Uh oh!

yuki-97 commented Apr 13, 2026

Uh oh!

RayenTian commented Apr 21, 2026

Uh oh!

terrykong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

terrykong left a comment

Choose a reason for hiding this comment

Uh oh!

RayenTian commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RayenTian commented Apr 8, 2026 •

edited

Loading