Gate weights cache on runtime option instead of compile-time macro (#19603) by hboyraz · Pull Request #19603 · pytorch/executorch

hboyraz · 2026-05-14T19:23:21Z

Summary:

Replaces the compile-time #ifdef ENABLE_XNNPACK_WEIGHTS_CACHE gate in
XNNCompiler.cpp with a runtime boolean plumbed from
XnnpackBackendOptions::resolve_weight_cache(context) through
XNNPACKBackend::init to XNNCompiler::compileModel.

This fixes a silent-disable bug: previously, runtime opt-in via
set_option(weight_cache_option_key, true) was silently a no-op unless
the build also set -c executorch.xnnpack_weights_cache=1, because the
cache pointer handed to xnn_create_runtime_v4 was hardcoded to nullptr
when the macro was undefined. Multimethod LoRA models re-packed the entire backbone for every method load, costing
hundreds of MB of resident memory.

The runtime path now keys all three cache-relevant code regions
(unpacked-data load, cache pointer handoff to xnn_create_runtime_v4, and
finalize_for_runtime) on bool use_weight_cache resolved per-init from
the BackendInitContext.

The Result<vector<string>> declaration in compileModel was reshaped to
plain vector<string> since Result<> is non-assignable, which is
required for the new runtime branch.

Reviewed By: GregoryComer

Differential Revision: D105123995

pytorch-bot · 2026-05-14T19:23:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19603

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Run pull request jobs on OSDC runners in shadow mode

⏳ No Failures, 138 Pending

As of commit 8601d3b with merge base 09a7cbe ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-05-14T19:23:31Z

@hboyraz has exported this pull request. If you are a Meta employee, you can view the originating Diff in D105123995.

github-actions · 2026-05-14T19:24:12Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

hboyraz · 2026-05-14T22:01:29Z

@pytorchbot label "release notes: bug fix"

pytorch-bot · 2026-05-14T22:01:33Z

Didn't find following labels among repository labels: release notes: bug fix

…ytorch#19603) Summary: Replaces the compile-time `#ifdef ENABLE_XNNPACK_WEIGHTS_CACHE` gate in XNNCompiler.cpp with a runtime boolean plumbed from `XnnpackBackendOptions::resolve_weight_cache(context)` through `XNNPACKBackend::init` to `XNNCompiler::compileModel`. This fixes a silent-disable bug: previously, runtime opt-in via `set_option(weight_cache_option_key, true)` was silently a no-op unless the build also set `-c executorch.xnnpack_weights_cache=1`, because the cache pointer handed to `xnn_create_runtime_v4` was hardcoded to nullptr when the macro was undefined. Multimethod LoRA models re-packed the entire backbone for every method load, costing hundreds of MB of resident memory. The runtime path now keys all three cache-relevant code regions (unpacked-data load, cache pointer handoff to xnn_create_runtime_v4, and finalize_for_runtime) on `bool use_weight_cache` resolved per-init from the BackendInitContext. The `Result<vector<string>>` declaration in compileModel was reshaped to plain `vector<string>` since `Result<>` is non-assignable, which is required for the new runtime branch. Reviewed By: GregoryComer Differential Revision: D105123995

hboyraz requested a review from digantdesai as a code owner May 14, 2026 19:23

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 14, 2026

meta-codesync Bot added fb-exported meta-exported labels May 14, 2026

JacobSzwejbka approved these changes May 14, 2026

View reviewed changes

GregoryComer approved these changes May 14, 2026

View reviewed changes

meta-codesync Bot changed the title ~~Gate weights cache on runtime option instead of compile-time macro~~ Gate weights cache on runtime option instead of compile-time macro (#19603) May 15, 2026

hboyraz force-pushed the export-D105123995 branch from e33c3f9 to fcbc108 Compare May 15, 2026 02:32

hboyraz force-pushed the export-D105123995 branch from fcbc108 to bbf2b17 Compare May 15, 2026 02:33

Merge branch 'main' into export-D105123995

8601d3b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gate weights cache on runtime option instead of compile-time macro (#19603)#19603

Gate weights cache on runtime option instead of compile-time macro (#19603)#19603
hboyraz wants to merge 2 commits into
pytorch:mainfrom
hboyraz:export-D105123995

hboyraz commented May 14, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented May 14, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

hboyraz commented May 14, 2026

Uh oh!

pytorch-bot Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hboyraz commented May 14, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19603

❗ 1 Active SEVs

⏳ No Failures, 138 Pending

Uh oh!

meta-codesync Bot commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

This PR needs a release notes: label

Uh oh!

hboyraz commented May 14, 2026

Uh oh!

pytorch-bot Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hboyraz commented May 14, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented May 14, 2026 •

edited

Loading

This PR needs a `release notes:` label