Commit 23ce32c
Clarify SIPP is public-use; only IRS-PUF is access-restricted (#809)
John Sabelhaus corrected a licensing overclaim in the 2026-04-21
meeting: the SIPP vintage we consume (Census public-use SIPP) has no
per-user license, data-use agreement, or registration requirement. Of
the six upstream sources the pipeline ingests (CPS, ACS, SCF, ORG,
SIPP, IRS-PUF), only IRS-PUF has a genuine access restriction. The
HuggingFace mirror of pu2023.csv is a caching convenience, not an
access-restriction workaround.
This matters for TRACE / reproducibility writeups: overstating which
inputs are restricted distorts the institutional-certification story.
Fixes #808.
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 2db4cc7 commit 23ce32c
2 files changed
Lines changed: 20 additions & 0 deletions
File tree
- changelog.d
- policyengine_us_data/datasets/sipp
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
0 commit comments