lbl-camera
diff --git a/‎CLAUDE.md‎
Lines changed: 11 additions & 2 deletions b/‎CLAUDE.md‎
Lines changed: 11 additions & 2 deletions
diff --git a/‎fvgp/fvgp.py‎
Lines changed: 54 additions & 5 deletions b/‎fvgp/fvgp.py‎
Lines changed: 54 additions & 5 deletions
diff --git a/‎fvgp/gp.py‎
Lines changed: 54 additions & 5 deletions b/‎fvgp/gp.py‎
Lines changed: 54 additions & 5 deletions
@@ -39,7 +39,7 @@ Both classes are composed of internal specialist objects created at `__init__` t
 | `GPprior` | [gp_prior.py](fvgp/gp_prior.py) | Kernel and mean function (default: anisotropic Matérn with ARD). In gp2Scale mode also owns `x_data_scatter_future` (the persistent dask scatter of `x_data`) |
 | `GPlikelihood` | [gp_likelihood.py](fvgp/gp_likelihood.py) | Noise model (variances or callable) |
 | `GPkv` | [gp_kv.py](fvgp/gp_kv.py) | Owns K+V matrix state and all factorizations; dispatches solves/logdets across linalg modes |
-| `GPMarginalLikelihood` | [gp_marginal_likelihood.py](fvgp/gp_marginal_likelihood.py) | Log marginal likelihood and its gradient; delegates factorization to `GPkv` |
+| `GPMarginalLikelihood` | [gp_marginal_likelihood.py](fvgp/gp_marginal_likelihood.py) | Log marginal likelihood and its gradient; delegates factorization to `GPkv`. Maintains `_warm_start_KVinvY` for iterative training solves when `args["sparse_krylov_warm_start"]=True`. |
 | `GPposterior` | [gp_posterior.py](fvgp/gp_posterior.py) | Posterior mean/covariance; information-theoretic quantities |
 | `GPtraining` | [gp_training.py](fvgp/gp_training.py) | Hyperparameter optimization (scipy, hgdl async, MCMC, Adam) |
 
@@ -64,7 +64,7 @@ Gotchas:
 ### Key supporting modules
 
 - **[gp_lin_alg.py](fvgp/gp_lin_alg.py)** — CPU/GPU linear algebra primitives; Cholesky, LU, sparse solvers; defines `NonPositiveDefiniteError`
-- **[gp_kv.py](fvgp/gp_kv.py)** — `GPkv` manages all K+V state across linalg modes: `"Chol"`, `"CholInv"`, `"Inv"`, `"sparseLU"`, `"sparseCG"`, `"sparseMINRES"`, and preconditioned variants. The mode is set at init and determines which factorization is updated when data or hyperparameters change. Custom solvers can be injected as a 3-tuple of callables.
+- **[gp_kv.py](fvgp/gp_kv.py)** — `GPkv` manages all K+V state across linalg modes: `"Chol"`, `"CholInv"`, `"Inv"`, `"sparseLU"`, `"sparseCG"`, `"sparseMINRES"`, and preconditioned variants. The mode is set at init and determines which factorization is updated when data or hyperparameters change. Custom solvers can be injected as a 3-tuple of callables. For `sparseMINRESpre`/`sparseCGpre`, `GPkv` caches the preconditioner across `update_KV` / `compute_new_*` calls and rebuilds when `Preconditioner_reuse_counter` ≥ `args["sparse_preconditioner_refresh_interval"] - 1` or when the shape/`sparse_preconditioner_*` args fingerprint changes. `set_KV` always force-refreshes. Aliases like `"sparseCGpre_amg"` are resolved at `__init__` into the canonical mode plus `args["sparse_preconditioner_type"]`.
 - **[kernels.py](fvgp/kernels.py)** — 15+ built-in kernels including Matérn, squared exponential, Wendland (compactly supported)
 - **[gp_mcmc.py](fvgp/gp_mcmc.py)** — Adaptive Metropolis–Hastings sampler used for Bayesian hyperparameter inference
 - **[gp_actor.py](fvgp/gp_actor.py)** — `AsyncOptimizer` wraps `_MCMCActor` and `_AdamActor` for non-blocking background training; used by `GPtraining` for async MCMC and Adam modes
@@ -92,6 +92,15 @@ client.run(lambda: None)  # flush pending releases
 
 The `test_gp2Scale` test uses exactly this pattern between linalg-mode iterations.
 
+### Iterative-solver acceleration (sparseCG / sparseMINRES / *pre modes)
+
+For `sparseCG`, `sparseMINRES`, `sparseCGpre`, and `sparseMINRESpre`, the user can opt into two orthogonal accelerators via `args` on the `GP` constructor:
+
+- **Preconditioner caching** (`sparseCGpre`/`sparseMINRESpre` only): `args["sparse_preconditioner_refresh_interval"] = N` reuses a single preconditioner for up to N consecutive `update_KV` / `compute_new_*` calls before rebuilding. Default `N=1` rebuilds on every call (same as no caching). `args["sparse_preconditioner_type"]` selects the kernel — `"ilu"` (default), `"ic"`/`"incomplete_cholesky"`, `"block_jacobi"`, `"schwarz"`/`"additive_schwarz"`, `"amg"` (requires pyamg). Mode aliases `"sparseCGpre_<type>"` / `"sparseMINRESpre_<type>"` set the type as a shortcut. Cache is invalidated automatically when `KV.shape` or any `sparse_preconditioner_*` arg changes.
+- **Warm-start** (all iterative modes): `args["sparse_krylov_warm_start"] = True` makes `GPMarginalLikelihood` pass the previous training iteration's `KVinvY` as `x0` to the next iterative solve. Cuts iteration counts substantially when successive hyperparameter trials are close. Stored in `marginal_likelihood._warm_start_KVinvY`; reset to `None` on pickling.
+
+Both default off so existing behavior is preserved.
+
 ### Customization API
 
 Kernels, mean functions, and noise models are all plain Python callables with standardized signatures. Users pass them as arguments to `GP`/`fvGP` constructors. The full hyperparameter vector is shared across kernel, mean, and noise callables, but each callable must only read its reserved index range. Kernel gradients can be user-supplied or computed via finite differences.
 
@@ -180,8 +180,13 @@ class fvGP(GP):
         * ``"sparseCG"`` — sparse conjugate-gradient iterative solver.
         * ``"sparseMINRES"`` — sparse MINRES iterative solver.
         * ``"sparseSolve"`` — direct sparse solve via scipy.
-        * ``"sparseCGpre"`` — conjugate-gradient with an incomplete-LU preconditioner.
-        * ``"sparseMINRESpre"`` — MINRES with an incomplete-LU preconditioner.
+        * ``"sparseCGpre"`` — preconditioned conjugate-gradient. The preconditioner type
+          is selected by ``args["sparse_preconditioner_type"]`` (default ``"ilu"``;
+          also ``"ic"``/``"incomplete_cholesky"``, ``"block_jacobi"``,
+          ``"schwarz"``/``"additive_schwarz"``, or ``"amg"`` (requires pyamg)).
+        * ``"sparseMINRESpre"`` — preconditioned MINRES; same preconditioner choices.
+        * ``"sparseCGpre_<type>"`` / ``"sparseMINRESpre_<type>"`` — shortcut that sets
+          ``args["sparse_preconditioner_type"]`` to ``<type>`` (e.g. ``"sparseCGpre_amg"``).
 
         **Custom solver (any GP):**
 
@@ -207,18 +212,62 @@ class fvGP(GP):
     args: dict, optional
         Advanced options. Recognized keys are:
 
+        Stochastic-Lanczos logdet (sparse modes):
+
         - "random_logdet_lanczos_degree" : int; default = 20
         - "random_logdet_error_rtol" : float; default = 0.01
         - "random_logdet_verbose" : True/False; default = False
         - "random_logdet_print_info" : True/False; default = False
-        - "sparse_minres_tol" : float
-        - "sparse_cg_tol" : float
         - "random_logdet_lanczos_compute_device" : str; default = "cpu"/"gpu"
+
+        Sparse iterative solver tolerances and iteration limits:
+
+        - "sparse_cg_tol" : float; default = 1e-5
+        - "sparse_minres_tol" : float; default = 1e-5
+        - "sparse_cg_maxiter" : int; default = None (use scipy default)
+        - "sparse_minres_maxiter" : int; default = None (use scipy default)
+        - "sparse_krylov_maxiter" : int; default = None (applies to both if the
+          solver-specific key is not set)
+        - "sparse_block_krylov" : True/False; default = False — use a block CG
+          variant when there are multiple RHS columns
+        - "sparse_krylov_mode" : "single"/"block"; equivalent toggle
+        - "sparse_krylov_block_size" : int — RHS block size for block CG
+
+        Iterative-solver acceleration (``sparseCG``/``sparseMINRES`` and the
+        ``*pre`` variants):
+
+        - "sparse_krylov_warm_start" : True/False; default = False — feed the
+          previous training iteration's ``KVinvY`` as ``x0`` to the next solve
+        - "sparse_preconditioner_type" : str; default = "ilu". One of "ilu",
+          "ic"/"ichol"/"incomplete_cholesky", "block_jacobi", "schwarz"/
+          "additive_schwarz", "amg" (requires pyamg)
+        - "sparse_preconditioner_refresh_interval" : int; default = 1 —
+          reuse the cached preconditioner for up to N consecutive solves
+          before rebuilding. ``set_KV`` always force-refreshes.
+        - "sparse_preconditioner_block_size" : int — block size for block_jacobi
+          and additive_schwarz partitions
+        - "sparse_preconditioner_schwarz_overlap" : int — overlap layers for
+          additive Schwarz
+        - "sparse_preconditioner_drop_tol" / "sparse_preconditioner_fill_factor"
+          — forwarded to scipy ``spilu`` for "ilu"
+        - "sparse_preconditioner_amg_*" — forwarded to pyamg
+          (``max_levels``, ``max_coarse``, ``strength``, ``cycle``, etc.)
+        - "sparse_preconditioner_shift" / "_growth" / "_attempts" — diagonal
+          shift retry knobs for "ic" / "block_jacobi" / "additive_schwarz" when
+          a local Cholesky encounters a non-PD block
+
+        Cholesky compute-device routing:
+
         - "Chol_factor_compute_device" : str; default = "cpu"/"gpu"
         - "update_Chol_factor_compute_device": str; default = "cpu"/"gpu"
         - "Chol_solve_compute_device" : str; default = "cpu"/"gpu"
         - "Chol_logdet_compute_device" : str; default = "cpu"/"gpu"
-        - "GPU_engine" : str; default = "torch"/"cupy"
+
+        GPU backend:
+
+        - "GPU_engine" : "torch"/"cupy"; default = first available
+        - "GPU_device" : str; e.g. "cuda:1" or "mps"
+        - "GPU_device_index" : int — explicit CUDA device index
 
         All other keys will be stored and are available as part of the object instance and
         in kernel, mean, and noise functions.
 
@@ -180,8 +180,13 @@ class GP:
         * ``"sparseCG"`` — sparse conjugate-gradient iterative solver.
         * ``"sparseMINRES"`` — sparse MINRES iterative solver.
         * ``"sparseSolve"`` — direct sparse solve via scipy.
-        * ``"sparseCGpre"`` — conjugate-gradient with an incomplete-LU preconditioner.
-        * ``"sparseMINRESpre"`` — MINRES with an incomplete-LU preconditioner.
+        * ``"sparseCGpre"`` — preconditioned conjugate-gradient. The preconditioner type
+          is selected by ``args["sparse_preconditioner_type"]`` (default ``"ilu"``;
+          also ``"ic"``/``"incomplete_cholesky"``, ``"block_jacobi"``,
+          ``"schwarz"``/``"additive_schwarz"``, or ``"amg"`` (requires pyamg)).
+        * ``"sparseMINRESpre"`` — preconditioned MINRES; same preconditioner choices.
+        * ``"sparseCGpre_<type>"`` / ``"sparseMINRESpre_<type>"`` — shortcut that sets
+          ``args["sparse_preconditioner_type"]`` to ``<type>`` (e.g. ``"sparseCGpre_amg"``).
 
         **Custom solver (any GP):**
 
@@ -207,18 +212,62 @@ class GP:
     args: dict, optional
         Advanced options. Recognized keys are:
 
+        Stochastic-Lanczos logdet (sparse modes):
+
         - "random_logdet_lanczos_degree" : int; default = 20
         - "random_logdet_error_rtol" : float; default = 0.01
         - "random_logdet_verbose" : True/False; default = False
         - "random_logdet_print_info" : True/False; default = False
-        - "sparse_minres_tol" : float
-        - "sparse_cg_tol" : float
         - "random_logdet_lanczos_compute_device" : str; default = "cpu"/"gpu"
+
+        Sparse iterative solver tolerances and iteration limits:
+
+        - "sparse_cg_tol" : float; default = 1e-5
+        - "sparse_minres_tol" : float; default = 1e-5
+        - "sparse_cg_maxiter" : int; default = None (use scipy default)
+        - "sparse_minres_maxiter" : int; default = None (use scipy default)
+        - "sparse_krylov_maxiter" : int; default = None (applies to both if the
+          solver-specific key is not set)
+        - "sparse_block_krylov" : True/False; default = False — use a block CG
+          variant when there are multiple RHS columns
+        - "sparse_krylov_mode" : "single"/"block"; equivalent toggle
+        - "sparse_krylov_block_size" : int — RHS block size for block CG
+
+        Iterative-solver acceleration (``sparseCG``/``sparseMINRES`` and the
+        ``*pre`` variants):
+
+        - "sparse_krylov_warm_start" : True/False; default = False — feed the
+          previous training iteration's ``KVinvY`` as ``x0`` to the next solve
+        - "sparse_preconditioner_type" : str; default = "ilu". One of "ilu",
+          "ic"/"ichol"/"incomplete_cholesky", "block_jacobi", "schwarz"/
+          "additive_schwarz", "amg" (requires pyamg)
+        - "sparse_preconditioner_refresh_interval" : int; default = 1 —
+          reuse the cached preconditioner for up to N consecutive solves
+          before rebuilding. ``set_KV`` always force-refreshes.
+        - "sparse_preconditioner_block_size" : int — block size for block_jacobi
+          and additive_schwarz partitions
+        - "sparse_preconditioner_schwarz_overlap" : int — overlap layers for
+          additive Schwarz
+        - "sparse_preconditioner_drop_tol" / "sparse_preconditioner_fill_factor"
+          — forwarded to scipy ``spilu`` for "ilu"
+        - "sparse_preconditioner_amg_*" — forwarded to pyamg
+          (``max_levels``, ``max_coarse``, ``strength``, ``cycle``, etc.)
+        - "sparse_preconditioner_shift" / "_growth" / "_attempts" — diagonal
+          shift retry knobs for "ic" / "block_jacobi" / "additive_schwarz" when
+          a local Cholesky encounters a non-PD block
+
+        Cholesky compute-device routing:
+
         - "Chol_factor_compute_device" : str; default = "cpu"/"gpu"
         - "update_Chol_factor_compute_device": str; default = "cpu"/"gpu"
         - "Chol_solve_compute_device" : str; default = "cpu"/"gpu"
         - "Chol_logdet_compute_device" : str; default = "cpu"/"gpu"
-        - "GPU_engine" : str; default = "torch"/"cupy"
+
+        GPU backend:
+
+        - "GPU_engine" : "torch"/"cupy"; default = first available
+        - "GPU_device" : str; e.g. "cuda:1" or "mps"
+        - "GPU_device_index" : int — explicit CUDA device index
 
         All other keys will be stored and are available as part of the object instance and
         in kernel, mean, and noise functions.