Skip to content

Commit b312938

Browse files
TomeHirataclaude
andcommitted
Address PR review comments and update for latest codebase
- Add "(RCTs, also known as A/B tests)" clarification per reviewer feedback - Add DoWhy and EconML comparison in Statement of Need section - Add imperfect compliance paper reference for LDTE estimators - Update CAR paper to ICML'25 proceedings format with updated author list - Update multi-task learning paper author list (add Shunsuke Uto) - Fix Fisher citation key to match bibliography 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
1 parent f0c4af6 commit b312938

2 files changed

Lines changed: 37 additions & 11 deletions

File tree

paper.bib

Lines changed: 32 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -48,22 +48,48 @@ @article{scikit-learn
4848
year={2011}
4949
}
5050

51-
@misc{byambadalai2025efficientestimationdistributionaltreatment,
51+
@inproceedings{byambadalai2025efficientestimationdistributionaltreatment,
5252
title={On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization},
53-
author={Undral Byambadalai and Tatsushi Oka and Shota Yasui},
53+
author={Undral Byambadalai and Tomu Hirata and Tatsushi Oka and Shota Yasui},
54+
booktitle={Proceedings of the 42nd International Conference on Machine Learning},
5455
year={2025},
55-
eprint={2506.05945},
56-
archivePrefix={arXiv},
57-
primaryClass={econ.EM},
56+
series={ICML'25},
5857
url={https://arxiv.org/abs/2506.05945}
5958
}
6059

6160
@misc{hirata2025efficientscalableestimationdistributional,
6261
title={Efficient and Scalable Estimation of Distributional Treatment Effects with Multi-Task Neural Networks},
63-
author={Tomu Hirata and Undral Byambadalai and Tatsushi Oka and Shota Yasui},
62+
author={Tomu Hirata and Undral Byambadalai and Tatsushi Oka and Shota Yasui and Shunsuke Uto},
6463
year={2025},
6564
eprint={2507.07738},
6665
archivePrefix={arXiv},
6766
primaryClass={econ.EM},
6867
url={https://arxiv.org/abs/2507.07738}
6968
}
69+
70+
@misc{byambadalai2025imperfectcompliance,
71+
title={Beyond the Average: Distributional Causal Inference under Imperfect Compliance},
72+
author={Undral Byambadalai and Tomu Hirata and Tatsushi Oka and Shota Yasui},
73+
year={2025},
74+
eprint={2509.15594},
75+
archivePrefix={arXiv},
76+
primaryClass={econ.EM},
77+
url={https://arxiv.org/abs/2509.15594}
78+
}
79+
80+
@article{dowhy,
81+
title={DoWhy: An End-to-End Library for Causal Inference},
82+
author={Sharma, Amit and Kiciman, Emre},
83+
journal={arXiv preprint arXiv:2011.04216},
84+
year={2020},
85+
url={https://arxiv.org/abs/2011.04216}
86+
}
87+
88+
@inproceedings{econml,
89+
title={Causal Inference and Machine Learning in Practice with {EconML} and {CausalML}: Industrial Use Cases at {Microsoft}, {TripAdvisor}, {Uber}},
90+
author={Battocchi, Keith and Dillon, Eleanor and Hei, Maggie and Lewis, Greg and Ling, Miruna and Rao, Jing and Shyr, Dennis and Syrgkanis, Vasilis},
91+
booktitle={Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery \& Data Mining},
92+
pages={4072--4073},
93+
year={2021},
94+
organization={ACM}
95+
}

paper.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -38,9 +38,9 @@ bibliography: paper.bib
3838

3939
# Statement of Need
4040

41-
Randomized experiments have been fundamental to scientific inquiry since the pioneering work of @Fisher:1935, providing the gold standard for causal inference. While most experimental analyses focus on average treatment effects (ATEs), many research questions require understanding how treatments affect the entire distribution of outcomes, not just the mean. Distributional treatment effects (DTEs) capture these richer patterns, revealing heterogeneous impacts across different outcome levels that averages can mask.
41+
Randomized experiments (RCTs, also known as A/B tests) have been fundamental to scientific inquiry since the pioneering work of @fisher1935design, providing the gold standard for causal inference. While most experimental analyses focus on average treatment effects (ATEs), many research questions require understanding how treatments affect the entire distribution of outcomes, not just the mean. Distributional treatment effects (DTEs) capture these richer patterns, revealing heterogeneous impacts across different outcome levels that averages can mask.
4242

43-
Despite the growing importance of distributional analysis in fields ranging from economics to medicine, the Python ecosystem lacks comprehensive tools for DTE estimation. While SciPy provides basic empirical cumulative distribution functions, it offers no specialized functionality for treatment effect estimation, variance reduction, or confidence interval construction in experimental settings. Existing R packages like `RDDtools` focus on regression discontinuity rather than randomized experiments, and lack modern machine learning integration.
43+
Despite the growing importance of distributional analysis in fields ranging from economics to medicine, the Python ecosystem lacks comprehensive tools for DTE estimation. While SciPy provides basic empirical cumulative distribution functions, it offers no specialized functionality for treatment effect estimation, variance reduction, or confidence interval construction in experimental settings. Existing Python packages for causal inference, such as DoWhy [@dowhy] and EconML [@econml], focus primarily on average treatment effects and conditional average treatment effects (CATE), with EconML incorporating machine learning for heterogeneous treatment effect estimation. However, these packages do not address distributional treatment effects, which capture how treatments affect the entire outcome distribution rather than just the mean. Existing R packages like `RDDtools` focus on regression discontinuity rather than randomized experiments, and lack modern machine learning integration.
4444

4545
`dte_adj` addresses this gap by providing a comprehensive Python framework for distributional treatment effect analysis. The package implements state-of-the-art variance reduction techniques using machine learning models for regression adjustment [@byambadalai2024estimatingdistributionaltreatmenteffects], enabling more precise DTE estimates with smaller sample sizes. It supports multiple experimental designs including covariate-adaptive randomization [@byambadalai2025efficientestimationdistributionaltreatment] and local treatment effects, with a scikit-learn [@scikit-learn] compatible API that integrates seamlessly into existing machine learning workflows. This makes advanced distributional analysis accessible to the broader Python research community, supporting more nuanced causal inference in experimental studies.
4646

@@ -60,9 +60,9 @@ The package implements multiple estimator classes following a hierarchical desig
6060
- `SimpleStratifiedDistributionEstimator`: Handles stratified block randomization designs
6161
- `AdjustedStratifiedDistributionEstimator`: Combines stratification with ML-based variance reduction
6262

63-
**Local Distribution Estimators:**
64-
- `SimpleLocalDistributionEstimator`: Estimates local distributional treatment effects (LDTE)
65-
- `AdjustedLocalDistributionEstimator`: LDTE estimation with ML adjustment for improved precision
63+
**Local Distribution Estimators (for Imperfect Compliance):**
64+
- `SimpleLocalDistributionEstimator`: Estimates local distributional treatment effects (LDTE) for settings with imperfect compliance
65+
- `AdjustedLocalDistributionEstimator`: LDTE estimation with ML adjustment for improved precision [@byambadalai2025imperfectcompliance]
6666

6767
## Core Methods
6868

0 commit comments

Comments
 (0)