You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix h5 files by saving calibration geography artifact, and model fit resume function (#708)
* Save geography artifacts and add calibration resume/checkpoint support
Calibration now persists geography_assignment.npz alongside weights so
that downstream publish and worker steps use the exact same geography
instead of regenerating it randomly. Adds --resume-from and
--checkpoint-output flags to unified_calibration for continuing fits
from a saved checkpoint or warm-starting from weights. Also gitignores
*.csv.gz to prevent accidental commits of cached ORG data.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Run ruff format
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Add changelog fragment for PR 708
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Stub l0 module in test so patch works without l0-python installed
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Add self-employment and SSN card type count targets to calibration config
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Distinguish ITIN holders from SSN holders in CPS data
Fix calibration crash on string constraint variables (ssn_card_type) by
falling back from float32 cast when values are non-numeric.
Impute ITIN status for undocumented (code-0) persons: select tax units
with code-0 earners via weighted random sampling targeting 4.4M ITIN
returns (IRS NTA), then mark all code-0 members of those units. Updates
has_tin = (ssn_card_type != 0) | has_itin_number so ITIN holders
correctly qualify for ODC ($500 credit).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* Fix PR 708 checkpoint and ID regressions
* Handle string ID fields in PUF cloning
* Fold taxpayer ID imputation into calibration resume PR
* Fix PUF subsample logging format
* fixes
* Expand PR 708 changelog summary
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: Max Ghenis <mghenis@gmail.com>
Save calibration geography as a pipeline artifact, add ``--resume-from`` and checkpoint support for long-running calibration fits, and fix resume/artifact handling in the remote calibration pipeline. This also adds conservative CPS taxpayer-ID outputs (``has_tin``, ``has_valid_ssn``, and a temporary ``has_itin`` compatibility alias), plus string-valued constraint handling needed for ID-target calibration.
0 commit comments