PolicyEngine
diff --git a/‎.github/workflows/site-snapshot.yml‎
Lines changed: 126 additions & 0 deletions b/‎.github/workflows/site-snapshot.yml‎
Lines changed: 126 additions & 0 deletions
diff --git a/‎AGENTS.md‎
Lines changed: 82 additions & 0 deletions b/‎AGENTS.md‎
Lines changed: 82 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 14 additions & 2 deletions b/‎README.md‎
Lines changed: 14 additions & 2 deletions
@@ -0,0 +1,126 @@
+name: Site Snapshot
+
+on:
+  pull_request:
+  push:
+    branches:
+      - main
+  workflow_dispatch:
+
+permissions:
+  contents: read
+
+jobs:
+  site-snapshot:
+    runs-on: ubuntu-latest
+    defaults:
+      run:
+        working-directory: microplex-us
+    steps:
+      - name: Check out microplex-us
+        uses: actions/checkout@v4
+        with:
+          path: microplex-us
+
+      - name: Check out core microplex
+        uses: actions/checkout@v4
+        with:
+          repository: CosilicoAI/microplex
+          ref: 71f270edecac3ef748411deb3beb77109c56a721
+          path: microplex
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.13"
+
+      - name: Set up uv
+        uses: astral-sh/setup-uv@v6
+
+      - name: Verify snapshot tooling
+        run: |
+          uv run --extra dev --with pydantic --with-editable ../microplex pytest -q \
+            tests/test_package_imports.py \
+            tests/pipelines/test_check_site_snapshot.py \
+            tests/pipelines/test_imputation_ablation.py \
+            tests/pipelines/test_site_snapshot.py \
+            tests/pipelines/test_version_benchmark.py
+
+      - name: Check generated site snapshot
+        run: |
+          snapshot_path="$(uv run python - <<'PY'
+          import json
+          import tempfile
+          from pathlib import Path
+
+          from microplex_us.pipelines.site_snapshot import write_us_microplex_site_snapshot
+
+          root = Path(tempfile.mkdtemp()).resolve()
+          artifact_dir = root / "run-1"
+          artifact_dir.mkdir()
+          for filename in (
+              "seed_data.parquet",
+              "synthetic_data.parquet",
+              "calibrated_data.parquet",
+              "targets.json",
+          ):
+              (artifact_dir / filename).write_text("{}" if filename == "targets.json" else "")
+
+          (artifact_dir / "manifest.json").write_text(
+              json.dumps(
+                  {
+                      "created_at": "2026-03-29T00:00:00+00:00",
+                      "config": {"n_synthetic": 2000},
+                      "artifacts": {
+                          "seed_data": "seed_data.parquet",
+                          "synthetic_data": "synthetic_data.parquet",
+                          "calibrated_data": "calibrated_data.parquet",
+                          "targets": "targets.json",
+                          "policyengine_harness": "policyengine_harness.json",
+                      },
+                      "synthesis": {
+                          "scaffold_source": "cps_asec_2023",
+                          "state_program_support_proxies": {
+                              "available": ["ssi"],
+                              "missing": ["snap"],
+                          },
+                      },
+                      "calibration": {
+                          "n_loaded_targets": 100,
+                          "n_supported_targets": 90,
+                          "converged": False,
+                          "weight_collapse_suspected": False,
+                      },
+                      "policyengine_harness": {
+                          "candidate_mean_abs_relative_error": 0.9,
+                          "baseline_mean_abs_relative_error": 1.1,
+                          "mean_abs_relative_error_delta": -0.2,
+                      },
+                  }
+              )
+          )
+          (artifact_dir / "policyengine_harness.json").write_text(
+              json.dumps(
+                  {
+                      "summary": {
+                          "candidate_mean_abs_relative_error": 0.9,
+                          "baseline_mean_abs_relative_error": 1.1,
+                          "mean_abs_relative_error_delta": -0.2,
+                          "candidate_composite_parity_loss": 0.8,
+                          "baseline_composite_parity_loss": 1.2,
+                          "target_win_rate": 0.2,
+                          "slice_win_rate": 0.5,
+                          "supported_target_rate": 0.9,
+                          "tag_summaries": {},
+                          "parity_scorecard": {},
+                          "attribute_cell_summaries": {},
+                      }
+                  }
+              )
+          )
+          snapshot_path = root / "snapshots" / "site_snapshot_us.json"
+          write_us_microplex_site_snapshot(artifact_dir, snapshot_path)
+          print(snapshot_path)
+          PY
+          )"
+          uv run microplex-us-check-site-snapshot "$snapshot_path"
@@ -0,0 +1,82 @@
+# AGENTS.md
+
+This repo is the US country pack for `microplex`. Keep it thin where possible and push shared abstractions upstream into core.
+
+## Default posture
+
+- Prefer spec-driven behavior over ad hoc logic in large pipeline files.
+- If a seam is useful for both UK and US, move it to `microplex` instead of polishing a US-only local helper.
+- Keep PolicyEngine-US execution details local unless there is a clean shared protocol.
+
+## Current architectural intent
+
+- `microplex-us` owns:
+  - US source manifests and raw source adapters
+  - PolicyEngine-US execution/materialization
+  - US-specific target providers and benchmark harnesses
+  - US-local pipeline orchestration
+- `microplex` core owns:
+  - targets specs/providers/protocols
+  - reweighting bundles and solver
+  - benchmark metrics/comparisons/suites
+  - shared result-based benchmark builders
+
+## Current mission notes
+
+- For US, the canonical mission metric is the PE-native broad loss frontier, not composite parity.
+- When evaluating progress, prefer:
+  - matched-size `Microplex@N` vs `PE@N`
+  - full `enhanced_cps_2024` only as a stretch reference
+- Recent direct-objective testing showed that changing only the post-export weight objective moves loss very little on the same fixed candidate.
+- Bias effort toward:
+  - better candidate records
+  - fuller support coverage
+  - budgeted selection on larger candidates
+- Bias away from:
+  - repeated small-candidate donor-backend A/Bs
+  - more entropy tuning without evidence that the candidate population itself improved
+
+## Review checklist
+
+When reviewing recent changes here, check:
+
+1. Is this still duplicating something that should now live in core?
+2. Is the US harness using shared core benchmarking helpers instead of rebuilding them inline?
+3. Are any benchmark claims relying on non-common-target comparisons?
+4. Is the work using PE-native broad loss when it claims mission progress?
+5. Does PE-US materialization handle dependency chains and partial failures safely?
+6. Is this baking in fixed tax-unit structure more deeply than necessary?
+
+## Be careful around
+
+- `src/microplex_us/policyengine/us.py`
+  - Large file with execution/materialization logic and remaining monolith risk.
+- `src/microplex_us/policyengine/harness.py`
+  - Should keep delegating more suite/result logic to core.
+- `src/microplex_us/pipelines/local_reweighting.py`
+  - Should remain a thin adapter over core bundle/reweighting surfaces.
+
+## Standard commands
+
+- Ruff: `uv run ruff check src tests`
+- Focused comparison/harness tests: `uv run pytest -q tests/policyengine/test_comparison.py tests/policyengine/test_harness.py`
+- Local reweighting tests: `uv run pytest -q tests/pipelines/test_local_reweighting.py`
+
+## Claude/Codex review shortcut
+
+For a quick review, read:
+
+1. [`/Users/maxghenis/CosilicoAI/microplex-us/AGENTS.md`](/Users/maxghenis/CosilicoAI/microplex-us/AGENTS.md)
+2. [`/Users/maxghenis/CosilicoAI/microplex-us/_WORKSPACE.md`](/Users/maxghenis/CosilicoAI/microplex-us/_WORKSPACE.md)
+3. [`/Users/maxghenis/CosilicoAI/microplex-us/_BUILD_LOG.md`](/Users/maxghenis/CosilicoAI/microplex-us/_BUILD_LOG.md)
+
+Then inspect changed files and return findings first.
+
+## Review handoff
+
+To avoid rebuilding long prompts in chat:
+
+1. Treat [`/Users/maxghenis/CosilicoAI/microplex-us/reviews/PENDING_CLAUDE_REVIEW.md`](/Users/maxghenis/CosilicoAI/microplex-us/reviews/PENDING_CLAUDE_REVIEW.md) as the current review request.
+2. Read that file after the standard repo context files above.
+3. Write the full review to a dated file under [`/Users/maxghenis/CosilicoAI/microplex-us/reviews/`](/Users/maxghenis/CosilicoAI/microplex-us/reviews/).
+4. Append only a concise summary to [`/Users/maxghenis/CosilicoAI/microplex-us/_BUILD_LOG.md`](/Users/maxghenis/CosilicoAI/microplex-us/_BUILD_LOG.md).
@@ -8,12 +8,19 @@ built on top of the generic `microplex` engine.
 - [Docs index](./docs/README.md)
 - [Architecture](./docs/architecture.md)
 - [Source semantics](./docs/source-semantics.md)
+- [Imputation conditioning contract](./docs/imputation-conditioning-contract.md)
 - [Benchmarking](./docs/benchmarking.md)
+- [Methodology ledger](./docs/methodology-ledger.md)
+- [PolicyEngine oracle compatibility path](./docs/policyengine-oracle-compatibility.md)
+- [PE construction parity](./docs/pe-construction-parity.md)
+- [Superseding `policyengine-us-data`](./docs/superseding-policyengine-us-data.md)
 
 ## Current focus
 
-`microplex-us` is being built as a library-first replacement path for
-`policyengine-us-data`:
+`microplex-us` is being built as a library-first US runtime with
+`policyengine-us` as the shared measurement operator and
+`policyengine-us-data` as the incumbent comparator, not as the thing we are
+trying to clone wholesale:
 
 - canonical source and target metadata
 - PE-US-compatible export
@@ -22,3 +29,8 @@ built on top of the generic `microplex` engine.
 
 The architecture is still evolving, so the docs are deliberately technical and
 operational rather than paper-like.
+
+Method-level decomposable-family bakeoffs now live in the sibling eval repo:
+`/Users/maxghenis/CosilicoAI/microplex-evals`. `microplex-us` should keep the
+runtime helpers and pipeline-adjacent diagnostics, not the long-lived eval
+orchestration and artifact curation.