Add Paralellization across multiple devices during Solve by mj023 · Pull Request #346 · OpenSourceEconomics/pylcm

mj023 · 2026-05-08T20:13:36Z

This PR is a continuation of #147. It uses JAX Auto Parallelization inside jitted functions to make it possible to split the state space across multiple devices and then let every device solve it's part independently.

Distribution strategy

The grids get a new argument distributed that the user can use to specify which grids should be considered for the distribution across devices. If only one grid is marked for distribution, then the length of the grid needs to be a multiple of the available devices, if multiple grids are marked, the product of the lengths needs to be exactly the number of available devices (might be possible to relax this requirement). The grids then need to be moved to the right devices after they have been initialized with runtime-supplied points and shocks. The resulting VF_arr will then automatically be split on the devices where each value has been calculated.

TODO

Move grids to right devices after state action space creation
Fix AOT-Compilation
Test communication overhead

read-the-docs-community · 2026-05-08T20:20:26Z

Documentation build overview

📚 pylcm | 🛠️ Build #32637634 | 📁 Comparing 7eeaf37 against latest (a4eca9b)

🔍 Preview build

32 files changed · ± 32 modified

± Modified

github-actions · 2026-05-08T21:10:25Z

Benchmark comparison (main → HEAD)

Comparing 99a5e31d (main) → 7eeaf370 (HEAD)

Benchmark	Statistic	before	after	Ratio	Alert
aca-baseline	execution time	27.474 s	26.771 s	0.97
	peak GPU mem	509 MB	847 MB	1.66	❌
	compilation time	299.51 s	301.27 s	1.01
	peak CPU mem	7.65 GB	7.53 GB	0.98
Mahler-Yum	execution time	4.712 s	4.742 s	1.01
	peak GPU mem	529 MB	529 MB	1.00
	compilation time	14.59 s	16.96 s	1.16	❌
	peak CPU mem	1.68 GB	1.73 GB	1.03
Precautionary Savings - Solve	execution time	50.8 ms	49.9 ms	0.98
	peak GPU mem	101 MB	101 MB	1.00
	compilation time	2.71 s	2.80 s	1.03
	peak CPU mem	1.13 GB	1.13 GB	1.00
Precautionary Savings - Simulate	execution time	126.7 ms	127.0 ms	1.00
	peak GPU mem	344 MB	344 MB	1.00
	compilation time	4.90 s	7.11 s	1.45	❌
	peak CPU mem	1.31 GB	1.32 GB	1.01
Precautionary Savings - Solve & Simulate	execution time	145.2 ms	152.9 ms	1.05
	peak GPU mem	578 MB	578 MB	1.00
	compilation time	7.02 s	9.00 s	1.28	❌
	peak CPU mem	1.28 GB	1.31 GB	1.03
Precautionary Savings - Solve & Simulate (irreg)	execution time	283.3 ms	295.9 ms	1.04
	peak GPU mem	2.19 GB	2.19 GB	1.00
	compilation time	7.58 s	9.86 s	1.30	❌
	peak CPU mem	1.34 GB	1.36 GB	1.02

Squash of `distributed` (mj023). Adds a `distributed=True` flag on `DiscreteGrid` to shard the grid across JAX devices, threads the distribution pattern through `solve_brute._get_regime_V_shapes_and_shardings`, and validates the device-count match at runtime via a new check in `InternalRegime.state_action_space`. Rebased on top of `feat/canonical-float-dtype` so the work picks up the dtype-barrier and simulate-AOT changes. Also retargets the second caller of the renamed shapes helper (`_reconstruct_next_regime_to_V_arr`) at the new name. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Add a `distributed=True` flag on `DiscreteGrid` to shard the grid across JAX devices, thread the distribution pattern through `solve_brute._get_regime_V_shapes_and_shardings`, and validate the device-count match at runtime via a new check in `InternalRegime.state_action_space`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…pylcm into distributed

mj023 added 3 commits May 4, 2026 14:06

First changes

9f3937b

Merge branch 'main' into distributed

2cab0af

Add second distribution pattern

e3bd7e4

mj023 mentioned this pull request May 8, 2026

Add multi-device support #147

Closed

hmgaudecker force-pushed the distributed branch from e71053a to cd57a0d Compare May 9, 2026 11:42

hmgaudecker changed the base branch from main to feat/canonical-float-dtype May 9, 2026 11:42

Base automatically changed from feat/canonical-float-dtype to main May 11, 2026 07:14

hmgaudecker force-pushed the distributed branch from 14e67a7 to 1b2baa3 Compare May 11, 2026 07:22

mj023 added 2 commits May 11, 2026 15:17

Merge branch 'distributed' of https://github.com/OpenSourceEconomics/…

c1aa68d

…pylcm into distributed

Fix AOT + Add Simulation

7eeaf37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Paralellization across multiple devices during Solve#346

Add Paralellization across multiple devices during Solve#346
mj023 wants to merge 6 commits intomainfrom
distributed

mj023 commented May 8, 2026 •

edited

Loading

Uh oh!

read-the-docs-community Bot commented May 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mj023 commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Distribution strategy

TODO

Uh oh!

read-the-docs-community Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

github-actions Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark comparison (main → HEAD)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mj023 commented May 8, 2026 •

edited

Loading

read-the-docs-community Bot commented May 8, 2026 •

edited

Loading

github-actions Bot commented May 8, 2026 •

edited

Loading