PSLmodels
diff --git a/‎CHANGELOG.md‎
Lines changed: 6 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎docs/book/content/theory/derivations.md‎
Lines changed: 13 additions & 0 deletions b/‎docs/book/content/theory/derivations.md‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎docs/book/content/theory/equilibrium.md‎
Lines changed: 9 additions & 0 deletions b/‎docs/book/content/theory/equilibrium.md‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎docs/book/content/theory/images/HH_jac_sparsity.png‎
65.5 KB b/‎docs/book/content/theory/images/HH_jac_sparsity.png‎
65.5 KB
diff --git a/‎examples/run_sparse_FOC_jac_compare.py‎
Lines changed: 254 additions & 0 deletions b/‎examples/run_sparse_FOC_jac_compare.py‎
Lines changed: 254 additions & 0 deletions
@@ -5,6 +5,12 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [0.15.14] - 2026-06-03 12:00:00
+
+### Added
+
+- Adds an optional `use_sparse_FOC_jac` `Specifications` parameter (default off, so default runs are unchanged) that accelerates the time path iteration (TPI) household solve. When True, `scipy.optimize.root` is given a sparse (banded) finite-difference Jacobian for the stacked household Euler and labor first order conditions: the sparsity pattern is auto-detected once per problem size and the solver then needs far fewer function evaluations per Jacobian build (about 20x fewer on the default S=80 cohort solve), with an automatic fallback to dense finite differences if the Jacobian is not sparse enough to benefit or if a solve fails. The result matches the dense-finite-difference solution to within the model's resource-constraint accuracy floor on every calibration tested (OG-Core standard example, OG-ETH, OG-ZAF, OG-PHL, OG-IDN), giving roughly a 1.9-2.4x TPI speedup at no accuracy cost.
+
 ## [0.15.13] - 2026-05-15 06:00:00
 
 ### Added
 
@@ -114,3 +114,16 @@ In the Cobb-Douglas unit elasticity case ($\varepsilon=1$) of the CES production
 ```
 
 Again, even if this simple case, we cannot solve for $r$ as a function of $w$ for the reasons above.
+
+
+(SecAppDerivHHjac)=
+## Sparsity of the household equation Jacobian
+
+  Holding fixed the prices and policies a type-$j$ cohort faces, its $2S$ stationarized necessary conditions {eq}`EqStnrz_eul_n`, {eq}`EqStnrz_eul_b`, and {eq}`EqStnrz_eul_bS` in the $2S$ unknowns $\{n_{j,s},\hat b_{j,s+1}\}_{s=E+1}^{E+S}$ have a banded Jacobian. From the budget constraint {eq}`EqStnrzHHBC`, stationarized consumption at age $s$ depends on only three unknowns,
+
+  ```{math}
+  :label: EqAppDerivHHjac_cons
+    \hat c_{j,s} = \frac{1}{p}\Bigl[(1+r_p)\hat b_{j,s} + \hat w\,e_{j,s}\,n_{j,s} - \widehat{tax}_{j,s} - e^{g_y}\hat b_{j,s+1}\Bigr] + X_{j,s},
+  ```
+
+  where $\widehat{tax}_{j,s}$ depends only on $(\hat b_{j,s}, n_{j,s})$ through labor and capital income (already in the active set, so it adds no further coupling), and $X_{j,s}$ collects terms fixed in the inner solve (bequests $\hat{bq}_{j,s}$, remittances $\hat{rm}_{j,s}$, government transfers $\hat{tr}_{j,s}$, UBI $\hat{ubi}_{j,s}$, the pension benefit $\theta_j$, and the $\hat c_{min,i}$ terms). The labor Euler equation {eq}`EqStnrz_eul_n` at age $s$ therefore depends on $\{\hat b_{j,s},\hat b_{j,s+1},n_{j,s}\}$ alone, and the savings Euler equation {eq}`EqStnrz_eul_b`---which links $\hat c_{j,s}$ to $\hat b_{j,s+1}$ and $\hat c_{j,s+1}$---depends on $\{\hat b_{j,s},\hat b_{j,s+1},\hat b_{j,s+2},n_{j,s},n_{j,s+1}\}$. The marginal tax rates $\tau^{mtrx}_s$ and $\tau^{mtry}_{s+1}$ are functions of own-age income (already in these sets), so they add no further coupling, and the terminal condition {eq}`EqStnrz_eul_bS` is sparser still. Each of the $2S$ equations therefore depends on at most five of the $2S$ unknowns, regardless of $S$, so the Jacobian has at most $10S$ nonzero entries rather than the $(2S)^2 = 4S^2$ of a fully coupled system. This is the per-cohort counterpart to the dense $2JS$ system noted at the start of Chapter {ref}`Chap_Eqm`: cohorts couple only through prices, which are held fixed in the inner solve. A finite-difference Jacobian can then be built from a number of evaluations set by the bandwidth---about seven at $S = 80$---rather than $2S$, by probing together unknowns that affect no common equation (Figure {numref}`FigHHjacSparsity`).
@@ -25,6 +25,15 @@ In all of the specifications of `OG-Core`, we use a two-stage fixed point algori
 
 Our approach is to choose the minimum number of macroeconomic variables in an outer loop in order to be able to solve the household's $2JS$ Euler equations in terms of only the $\bar{n}_{j,s}$ and $\bar{b}_{j,s+1}$ variables directly, holding all other variables constant. The household system of Euler equations has a provable root solution and is orders of magnitude more tractable (less nonlinear) to solve holding these outer loop variables constant.
 
+Moreover, with the outer-loop variables held fixed, each cohort's system of $2S$ Euler equations is not only less nonlinear but structurally sparse: every equation involves at most five of the $2S$ unknowns---a household's own age and its immediate neighbors. The root finder normally probes each unknown separately when building each step ($2S = 160$ evaluations of the system when $S = 80$), but with most equations depending on only a handful of unknowns, those affecting no common equation can be probed together, cutting the count to about seven at $S = 80$---a number set by how many neighbors couple, not by $S$. The parameter `use_sparse_FOC_jac` (default `False`) turns this on; the solver falls back to the standard calculation otherwise. The structure is derived in Appendix {ref}`SecAppDerivHHjac`.
+
+```{figure} ./images/HH_jac_sparsity.png
+---
+name: FigHHjacSparsity
+---
+Sparsity pattern of the household equation Jacobian, at $S = 12$. Left: the standard finite-difference solve treats every entry of the $2S\times 2S$ matrix as live ($(2S)^2 = 576$ entries). Right: the actual structure---each Euler equation depends only on a household's own age and its immediate neighbors, leaving most entries zero (92 of 576 here; 636 of 25{,}600 at the default $S = 80$).
+```
+
 The steady-state solution method for each of the cases above is associated with a solution method that has a subset of the following outer-loop variables $\Bigl\{\bar{r}_p, \bar{r}, \bar{w}, \{\bar{p}_m\}_{m=1}^{M-1}, \bar{Y}, \overline{TR}, \overline{BQ}, factor\Bigr\}$.
 
 
 
@@ -0,0 +1,254 @@
+"""
+Side-by-side comparison: a baseline run with ``use_sparse_FOC_jac`` off
+(default) and on. Reports the wall-time speedup, diffs the converged
+steady-state and TPI paths, prints the resource-constraint residual, and
+issues a NO DRIFT / DRIFT DETECTED verdict against a 0.1% threshold.
+
+With no arguments, runs OG-Core's standard example baseline (the same
+configuration as ``run_ogcore_example.py``). With a country package name
+(e.g. ``ogphl``, ``ogzaf``, ``ogidn``, ``ogeth``) as a single argument,
+runs that country's packaged baseline twice. The country package must be
+importable in the active environment; outputs land in the current working
+directory.
+
+The reform leg is skipped; this is about solver speed and correctness on
+a single run.
+
+Run from the repo root:
+
+    python examples/run_sparse_FOC_jac_compare.py          # OG-Core
+    python examples/run_sparse_FOC_jac_compare.py ogphl    # PHL
+"""
+
+# import modules
+import importlib
+import json
+import multiprocessing
+import os
+import sys
+import time
+from importlib.resources import files
+
+import numpy as np
+from distributed import Client
+
+from ogcore.execute import runner
+from ogcore.parameters import Specifications
+from ogcore.utils import safe_read_pickle
+
+
+# Default config for OG-Core mode (no country arg). Matches
+# run_ogcore_example.py.
+_alpha_T = np.zeros(50)
+_alpha_T[0:2] = 0.09
+_alpha_T[2:10] = 0.09 + 0.01
+_alpha_T[10:40] = 0.09 - 0.01
+_alpha_T[40:] = 0.09
+_alpha_G = np.zeros(7)
+_alpha_G[0:3] = 0.05 - 0.01
+_alpha_G[3:6] = 0.05 - 0.005
+_alpha_G[6:] = 0.05
+OGCORE_SPEC = {
+    "frisch": 0.41,
+    "start_year": 2021,
+    "cit_rate": [[0.21]],
+    "debt_ratio_ss": 1.0,
+    "alpha_T": _alpha_T.tolist(),
+    "alpha_G": _alpha_G.tolist(),
+    "initial_guess_r_SS": 0.04,
+}
+
+KEY_AGGREGATES = (
+    "Y",
+    "C",
+    "K",
+    "L",
+    "B",
+    "I_total",
+    "r",
+    "w",
+    "r_p",
+    "r_gov",
+    "TR",
+    "total_tax_revenue",
+    "D",
+    "BQ",
+)
+
+# "No drift" threshold: aggregate differences within 0.1% are economically
+# indistinguishable from the model's own convergence noise.
+NO_DRIFT_THRESHOLD = 1e-3
+
+
+def _max_rel_diff(a, b):
+    a = np.asarray(a, dtype=float)
+    b = np.asarray(b, dtype=float)
+    if a.shape != b.shape or a.size == 0:
+        return float("nan")
+    scale = max(float(np.max(np.abs(a))), 1e-300)
+    return float(np.max(np.abs(a - b))) / scale
+
+
+def _diff_dict(d_dense, d_sparse, var_list=None):
+    """Return [(var, rel_diff), ...] sorted by rel_diff descending."""
+    keys = (
+        var_list
+        if var_list is not None
+        else sorted(set(d_dense) & set(d_sparse))
+    )
+    out = []
+    for var in keys:
+        if var in d_dense and var in d_sparse:
+            try:
+                rel = _max_rel_diff(d_dense[var], d_sparse[var])
+            except (TypeError, ValueError):
+                continue
+            if rel == rel:
+                out.append((var, rel))
+    out.sort(key=lambda x: -x[1])
+    return out
+
+
+def _load_country_defaults(pkg):
+    """Load <pkg>_default_parameters.json, with a 2-D shim for older country
+    calibrations whose replacement_rate_adjust is still 1-D."""
+    with files(pkg).joinpath(f"{pkg}_default_parameters.json").open("r") as f:
+        defaults = json.load(f)
+    rra = defaults.get("replacement_rate_adjust")
+    if isinstance(rra, list) and rra and not isinstance(rra[0], list):
+        defaults["replacement_rate_adjust"] = [rra]
+    return defaults
+
+
+def _apply_country_calibration(pkg, p):
+    """Try the country's offline Calibration; quietly skip on error."""
+    try:
+        Cal = importlib.import_module(pkg + ".calibrate").Calibration
+        try:
+            c = Cal(p, update_from_api=False)
+        except TypeError:
+            c = Cal(p)
+        p.update_specifications(c.get_dict())
+    except Exception as e:
+        print(f"  (calibration skipped: {type(e).__name__}: {str(e)[:80]})")
+
+
+def _run_one(label, country_pkg, out_dir, num_workers, client, sparse_jac):
+    p = Specifications(
+        baseline=True,
+        num_workers=num_workers,
+        baseline_dir=out_dir,
+        output_base=out_dir,
+    )
+    if country_pkg is None:
+        p.update_specifications(OGCORE_SPEC)
+    else:
+        p.update_specifications(_load_country_defaults(country_pkg))
+        _apply_country_calibration(country_pkg, p)
+    p.update_specifications({"use_sparse_FOC_jac": bool(sparse_jac)})
+    print(f"\n[{label}] use_sparse_FOC_jac = {p.use_sparse_FOC_jac}")
+    start = time.time()
+    runner(p, time_path=True, client=client)
+    wall = time.time() - start
+    print(f"[{label}] wall time = {wall:.2f} s")
+    ss = safe_read_pickle(os.path.join(out_dir, "SS", "SS_vars.pkl"))
+    tpi = safe_read_pickle(os.path.join(out_dir, "TPI", "TPI_vars.pkl"))
+    return wall, ss, tpi
+
+
+def main(country_pkg=None):
+    num_workers = min(multiprocessing.cpu_count(), 7)
+    label = country_pkg if country_pkg else "ogcore (standard example)"
+    print(f"Workers: {num_workers}  |  model: {label}")
+    client = Client(n_workers=num_workers, threads_per_worker=1)
+
+    # Outputs land in the current working directory so they're easy to find
+    # regardless of where this script file lives.
+    root = os.path.join(
+        os.getcwd(),
+        "sparse-FOC-jac-compare",
+        country_pkg if country_pkg else "ogcore",
+    )
+    dense_dir = os.path.join(root, "dense")
+    sparse_dir = os.path.join(root, "sparse")
+
+    t_dense, ss_dense, tpi_dense = _run_one(
+        f"{label} DENSE (default)",
+        country_pkg,
+        dense_dir,
+        num_workers,
+        client,
+        False,
+    )
+    t_sparse, ss_sparse, tpi_sparse = _run_one(
+        f"{label} SPARSE (use_sparse_FOC_jac=True)",
+        country_pkg,
+        sparse_dir,
+        num_workers,
+        client,
+        True,
+    )
+
+    tpi_diffs = _diff_dict(tpi_dense, tpi_sparse, KEY_AGGREGATES)
+    ss_diffs = _diff_dict(ss_dense, ss_sparse)
+
+    tpi_worst_var, tpi_worst = tpi_diffs[0] if tpi_diffs else ("n/a", 0.0)
+    ss_worst_var, ss_worst = ss_diffs[0] if ss_diffs else ("n/a", 0.0)
+    worst = max(tpi_worst, ss_worst)
+
+    rc_d = float(
+        np.max(np.abs(tpi_dense.get("resource_constraint_error", np.zeros(1))))
+    )
+    rc_s = float(
+        np.max(
+            np.abs(tpi_sparse.get("resource_constraint_error", np.zeros(1)))
+        )
+    )
+
+    speedup = t_dense / t_sparse if t_sparse > 0 else float("inf")
+    bar = "=" * 64
+    print()
+    print(bar)
+    print(f"  MODEL:  {label}")
+    print("  SPEED")
+    print(f"    dense  :  {t_dense:7.2f} s")
+    print(f"    sparse :  {t_sparse:7.2f} s   ->   {speedup:.2f}x faster")
+    print()
+    print("  DRIFT  (max relative difference, sparse vs dense)")
+    print(
+        f"    TPI worst:  {tpi_worst_var:22s} "
+        f"{tpi_worst * 100:9.4f}%   ({tpi_worst:.2e})"
+    )
+    print(
+        f"    SS  worst:  {ss_worst_var:22s} "
+        f"{ss_worst * 100:9.4f}%   ({ss_worst:.2e})"
+    )
+    if tpi_diffs:
+        print()
+        print("    All TPI aggregates, sorted by drift:")
+        for var, rel in tpi_diffs:
+            print(f"      {var:22s} {rel * 100:9.4f}%   ({rel:.2e})")
+    print()
+    print("  ACCURACY FLOOR  (resource-constraint residual)")
+    print(f"    dense  :  {rc_d:.2e}")
+    print(f"    sparse :  {rc_s:.2e}")
+    print()
+    threshold_pct = NO_DRIFT_THRESHOLD * 100
+    if worst <= NO_DRIFT_THRESHOLD:
+        print(
+            f"  RESULT:  NO DRIFT  "
+            f"(worst {worst * 100:.4f}%  <=  {threshold_pct:g}% threshold)"
+        )
+    else:
+        print(
+            f"  RESULT:  DRIFT DETECTED  "
+            f"(worst {worst * 100:.4f}%  >   {threshold_pct:g}% threshold) "
+            f"-- investigate"
+        )
+    print(bar)
+
+    client.close()
+
+
+if __name__ == "__main__":
+    main(sys.argv[1] if len(sys.argv) > 1 else None)