Feat (utils): replace weights with quantized ones by Giuseppe5 · Pull Request #1505 · Xilinx/brevitas

Giuseppe5 · 2026-04-04T00:08:58Z

Reason for this PR

In certain instances, like learned round, the weight tensors has extra parameters attached to it that could make export somewhat complicated.

Changes Made in this PR

We perform a destructive replacement of the original weight tensor with its quantized counterparts. This allows for easier exports in many scenarios.

There is an optional flag to keeep track of the original weights.

After merging, we reset the quantizers, including setting rounding mode to round, which is what is most commonly supported during export process.

What is missing:

Integration in the LLM entrypoint.

Testing Summary

New tests added.

nickfraser

See comments

pablomlago · 2026-05-13T09:43:10Z

        state_dict[keys_map[old_key]] = state_dict.pop(old_key)


+class merge_quant_weights:


I do not see a reason why this should be a context manager, specially considering you need to add a check to verify that one one forward pass is run. For me, this should be a function that accept a sample and runs the forward within the function.

pablomlago · 2026-05-13T09:45:52Z

+class merge_quant_weights:
+    """Context manager that merges quantized weights into model weights.
+
+    This could be useful for example with Learned Round.


From what I see, this is the main purpose of the context manager. What else is this context manager intended to cover? Also, this docstring does not reflect the fact that the scales are also updated to PARAMETER_FROM_STATS.

pablomlago · 2026-05-13T09:56:03Z

+        self._model = model
+        self._hooks: List[RemovableHandle] = []
+        self._module_tensor_id_mapping = {}
+        self.disable_quant = disable_quant


Is there a reason why some attributes are public and other private? E.g., _model and disable_quant.

pablomlago · 2026-05-13T09:56:40Z

+            for module in self._module_tensor_id_mapping:
+                self._reset_quantizer(module)
+
+    def change_scale_impl_type(self, proxy) -> None:


Suggested change

def change_scale_impl_type(self, proxy) -> None:

def change_scale_impl_type(self, proxy: WeightQuantProxyFromInjectorBase) -> None:

pablomlago · 2026-05-13T09:58:14Z

+    @staticmethod
+    def _reset_quantizer(proxy) -> None:
+        """Switch a weight quant proxy from LearnedRound back to standard Round."""
+        reinit_on_state_dict = config.REINIT_ON_STATE_DICT_LOAD


This pattern of overriding values in config and then restoring to the original values appears multiple times. Can we extract this common functionality? E.g.:

from contextlib import contextmanager @contextmanager def override_config(**overrides): old = {} try: for k, v in overrides.items(): old[k] = getattr(config, k) setattr(config, k, v) yield finally: for k, v in old.items(): setattr(config, k, v)

and then use it like:

with override_config( REINIT_ON_STATE_DICT_LOAD=False, IGNORE_MISSING_KEYS=True, ):

pablomlago · 2026-05-13T10:01:40Z

+    LearnedRoundImplType.HARD_SIGMOID, LearnedRoundImplType.SIGMOID, LearnedRoundImplType.IDENTITY]
+
+
+def _insert_learned_round(model, learned_round_param):


I think there is no need to create a new function. Use insert_learned_round_quantizers.

pablomlago · 2026-05-13T10:03:35Z

+
+
+@pytest.mark.parametrize("learned_round_param", LEARNED_ROUND_OPTIONS)
+def test_merge_quant_weights_reset(learned_round_param):


This test could be merged into the previous one.

pablomlago · 2026-05-13T10:03:56Z

+
+
+@pytest.mark.parametrize("learned_round_param", LEARNED_ROUND_OPTIONS)
+def test_merge_quant_weights_forward_equivalence(learned_round_param):


Probably this test can also be merged into the previous one.

pablomlago · 2026-05-13T10:04:29Z

 # Copyright (C) 2023, Advanced Micro Devices, Inc. All rights reserved.
 # SPDX-License-Identifier: BSD-3-Clause

+from typing import Dict


Please remove unused imports.

Giuseppe5 added 3 commits April 4, 2026 01:05

Feat (utils): replace weights with quantized ones

5b9965c

cleanup

2fb94d0

Fixed tests

40ef943

Giuseppe5 requested a review from pablomlago April 9, 2026 08:38

Giuseppe5 self-assigned this Apr 9, 2026

nickfraser requested changes Apr 13, 2026

View reviewed changes

Comment thread src/brevitas/nn/utils.py Outdated

Review

580abc7

Giuseppe5 added next release PRs which should be merged for the next release labels Apr 20, 2026

pablomlago requested changes May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (utils): replace weights with quantized ones#1505

Feat (utils): replace weights with quantized ones#1505
Giuseppe5 wants to merge 4 commits into
Xilinx:devfrom
Giuseppe5:merge_ln

Giuseppe5 commented Apr 4, 2026 •

edited

Loading

Uh oh!

nickfraser left a comment

Uh oh!

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

pablomlago May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		state_dict[keys_map[old_key]] = state_dict.pop(old_key)


		class merge_quant_weights:

	def change_scale_impl_type(self, proxy) -> None:
	def change_scale_impl_type(self, proxy: WeightQuantProxyFromInjectorBase) -> None:

		LearnedRoundImplType.HARD_SIGMOID, LearnedRoundImplType.SIGMOID, LearnedRoundImplType.IDENTITY]


		def _insert_learned_round(model, learned_round_param):



		@pytest.mark.parametrize("learned_round_param", LEARNED_ROUND_OPTIONS)
		def test_merge_quant_weights_reset(learned_round_param):



		@pytest.mark.parametrize("learned_round_param", LEARNED_ROUND_OPTIONS)
		def test_merge_quant_weights_forward_equivalence(learned_round_param):

Conversation

Giuseppe5 commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reason for this PR

Changes Made in this PR

Testing Summary

Uh oh!

nickfraser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Giuseppe5 commented Apr 4, 2026 •

edited

Loading