You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix stale calibration targets by deriving time_period from dataset (#505)
* Fix stale calibration targets by deriving time_period from dataset
- Remove hardcoded CBO_YEAR and TREASURY_YEAR constants
- Add --dataset CLI argument to etl_national_targets.py
- Derive time_period from sim.default_calculation_period
- Default to HuggingFace production dataset
The dataset itself is now the single source of truth for the
calibration year, preventing future drift when updating to new
base years.
Closes#503
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Use income_tax_positive for CBO calibration in loss.py
The CBO income_tax parameter represents positive-only receipts (refundable
credit payments in excess of liability are classified as outlays, not
negative receipts). Using income_tax_positive matches this definition.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Add --dataset argument to all database ETL scripts
All ETL scripts now derive their target year from the dataset's
default_calculation_period instead of hardcoding years. This ensures
all calibration targets stay synchronized when updating to a new
base year annually.
Updated scripts:
- create_initial_strata.py
- etl_age.py
- etl_irs_soi.py (with configurable --lag for IRS data delay)
- etl_medicaid.py
- etl_snap.py
- etl_state_income_tax.py
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Add 119th Congress district code support for 2024 ACS data
- Update parse_ucgid to recognize both 5001800US (118th) and 5001900US (119th Congress)
- Expand Puerto Rico and territory filters to handle both Congress code formats
- Update TERRITORY_UCGIDS and NON_VOTING_GEO_IDS with 119th Congress codes
This ensures consistent redistricting alignment: 2024 ACS data uses 119th Congress
codes natively, and IRS SOI data is converted via the 116th→119th mapping matrix.
Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>
* Remove seed-related changes to reduce PR scope
Revert deterministic hash-based medicaid/SSI seed logic in cps.py,
update Makefile seed to 3526.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Upgrade policyengine-us to 1.550.1 in uv.lock
Needed for income_tax_positive variable used in loss.py.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Cherry-pick ACA PTC targets from PR #508 and update changelog
Adds aca_ptc ingestion from IRS SOI data (code 85530) to etl_irs_soi.py
and updates DATABASE_GUIDE.md to reflect stratum_group_id 119.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Split local area publish into build+stage and promote phases
Prevents silent no-op promotes by detecting when HF commits don't
change HEAD. Adds separate promote workflow for manual gate before
pushing staging files to production. Also bumps calibration epochs
from 200 to 250.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
description: Fix stale calibration targets by deriving time_period from dataset across all ETL scripts, using income_tax_positive for CBO calibration, and adding 119th Congress district code support for consistent redistricting alignment
4
+
- date: 2026-02-07
5
+
type: added
6
+
description: Add ACA Premium Tax Credit targets from IRS SOI data (cherry-picked from PR #508)
0 commit comments