Skip to content

Latest commit

 

History

History
146 lines (115 loc) · 3.39 KB

File metadata and controls

146 lines (115 loc) · 3.39 KB

Appendix

Appendix A: Implementation Code

A.1 Quantile Regression Forest Implementation

The following code demonstrates the implementation of Quantile Regression Forests for variable imputation:

from quantile_forest import RandomForestQuantileRegressor

qrf = RandomForestQuantileRegressor(
    n_estimators=100,
    min_samples_leaf=1,
    random_state=0
)

A.2 PyTorch Optimization for Reweighting

The reweighting optimization uses PyTorch for gradient-based optimization:

import torch

# Initialize with log of original weights
log_weights = torch.log(original_weights)
log_weights.requires_grad = True

# Adam optimizer
optimizer = torch.optim.Adam([log_weights], lr=0.1)

# Optimization loop
for iteration in range(5000):
    weights = torch.exp(log_weights)
    achieved = weights @ loss_matrix
    relative_errors = (achieved - targets) / targets
    loss = torch.mean(relative_errors ** 2)
    
    optimizer.zero_grad()
    loss.backward()
    optimizer.step()

Appendix B: Tables

Table A1: Complete List of Imputed Variables

Variables Imputed from IRS Public Use File (67 variables)

Income Variables:

  • employment_income
  • partnership_s_corp_income
  • social_security
  • taxable_pension_income
  • tax_exempt_pension_income
  • long_term_capital_gains
  • short_term_capital_gains
  • taxable_ira_distributions
  • self_employment_income
  • qualified_dividend_income
  • non_qualified_dividend_income
  • rental_income
  • taxable_unemployment_compensation
  • taxable_interest_income
  • tax_exempt_interest_income
  • estate_income
  • miscellaneous_income
  • farm_income
  • alimony_income
  • farm_rent_income
  • non_sch_d_capital_gains
  • long_term_capital_gains_on_collectibles
  • unrecaptured_section_1250_gain
  • salt_refund_income

Deductions and Adjustments:

  • interest_deduction
  • unreimbursed_business_employee_expenses
  • pre_tax_contributions
  • charitable_cash_donations
  • self_employed_pension_contribution_ald
  • domestic_production_ald
  • self_employed_health_insurance_ald
  • charitable_non_cash_donations
  • alimony_expense
  • health_savings_account_ald
  • student_loan_interest
  • investment_income_elected_form_4952
  • early_withdrawal_penalty
  • educator_expense
  • deductible_mortgage_interest

Tax Credits:

  • cdcc_relevant_expenses
  • foreign_tax_credit
  • american_opportunity_credit
  • general_business_credit
  • energy_efficient_home_improvement_credit
  • amt_foreign_tax_credit
  • excess_withheld_payroll_tax
  • savers_credit
  • prior_year_minimum_tax_credit
  • other_credits

Qualified Business Income Variables:

  • w2_wages_from_qualified_business
  • unadjusted_basis_qualified_property
  • business_is_sstb
  • qualified_reit_and_ptp_income
  • qualified_bdc_income
  • farm_operations_income
  • estate_income_would_be_qualified
  • farm_operations_income_would_be_qualified
  • farm_rent_income_would_be_qualified
  • partnership_s_corp_income_would_be_qualified
  • rental_income_would_be_qualified
  • self_employment_income_would_be_qualified

Other Tax Variables:

  • traditional_ira_contributions
  • qualified_tuition_expenses
  • casualty_loss
  • unreported_payroll_tax
  • recapture_of_investment_credit

Variables Imputed from Survey of Income and Program Participation (1 variable)

  • tip_income

Variables Imputed from Survey of Consumer Finances (3 variables)

  • networth
  • auto_loan_balance
  • auto_loan_interest

Variables Imputed from American Community Survey (2 variables)

  • rent
  • real_estate_taxes