microplex-us is the US-specific country package built on top of the generic
microplex engine.
microplex: generic engine pieces- source descriptors and observation frames
- fusion planning
- synthesis and calibration
- canonical target spec and provider protocol
- generic geography and entity abstractions
microplex-us: US-specific implementations- CPS, PUF, and other source providers
- PE-US target import and compilation
- PE-US export and evaluation
- US experiment, registry, and artifact layers
Main entrypoint:
microplex_us.pipelines.USMicroplexPipeline
Current broad flow:
- Load one or more
SourceProviders intoObservationFrames. - Build a
FusionPlanfrom the source descriptors. - Choose a public structured scaffold source.
- Prepare canonical seed data from the scaffold.
- Integrate donor-only variables from other sources using source and variable capability metadata, with donor-block-specific automatic condition selection, declared condition-entity policy, and native-entity projection when entity IDs are available.
- Synthesize a new population.
- Build PolicyEngine-style entity tables.
- Materialize PE-derived features needed by targets.
- Calibrate against PE-US DB targets.
- Export a PE-ingestable H5 and evaluate against the full active target set.
Important files:
src/microplex_us/pipelines/us.pysrc/microplex_us/policyengine/us.pysrc/microplex_us/policyengine/comparison.pysrc/microplex_us/pipelines/artifacts.pysrc/microplex_us/pipelines/index_db.py
- The package is library-first. The core build, artifact saving, experiment running, and frontier tracking all live in importable APIs.
- PolicyEngine evaluation uses the real
policyengine-us-datatargets DB as truth targets. - Saved runs persist:
- artifact bundle
policyengine_harness.jsonrun_registry.jsonlrun_index.duckdb
- Broad PE-US parity is not stable yet.
- The current US path is still scaffold-plus-donors rather than a fully symmetric multientity latent-population model.
- Held-out target evaluation is not the default loop yet.
- Local-area production replacement is still future work.
The intended long-run shape is:
- canonical source metadata
- canonical variable semantics
- multientity fusion
- derived-variable materialization after atomic modeling
- target compilation as a generic feature/filter/aggregation problem
The current implementation is already moving in that direction:
- canonical target spec
- source capability registry
- variable semantic registry
- donor block specs with declared match strategies
- donor block specs with declared condition-entity policy
- variable semantics with declared projection aggregation for group-level donor fits
- automatic donor condition selection from source overlap plus data signal
- native-entity donor execution for tax-unit-native blocks when IDs are present
- full-target PE-US harness
But it is still an actively evolving system, not a finished paper architecture.