Optimal Transport for Counterfactual Estimation: A Method for Causal Inference

Many problems ask a question that can be formulated as a causal question: what would have happened if... ? For example, would the person have had surgery if he or she had been Black ? To address this kind of questions, calculating an average treatment effect (ATE) is often uninformative, because one would like to know how much impact a variable (such as skin color) has on a specific individual, characterized by certain covariates. Trying to calculate a conditional ATE (CATE) seems more appropriate. In causal inference, the propensity score approach assumes that the treatment is influenced by x , a collection of covariates. Here, we will have the dual view: doing an intervention, or changing the treatment (even just hypothetically, in a thought experiment, for example by asking what would have happened if a person had been Black) can have an impact on the values of x . We will see here that optimal transport allows us to change certain characteristics that are influenced by the variable we are trying to quantify the effect of. We propose here a mutatis mutandis version of the CATE, which will be done simply in dimension one by saying that the CATE must be computed relative to a level of probability, associated to the proportion of x (a single covariate) in the control population, and by looking for the equivalent quantile in the test population. In higher dimension, it will be necessary to go through transport, and an application will be proposed on the impact of some variables on the probability of having an unnatural birth (the fact that the mother smokes, or that the mother is Black).

[1]  N. C. Camgoz,et al.  D’ya Like DAGs? A Survey on Structure Learning and Causal Discovery , 2021, ACM Comput. Surv..

[2]  Arthur Charpentier Quantifying fairness and discrimination in predictive models , 2022, 2212.09868.

[3]  Carlos Matrán,et al.  Distribution and quantile functions, ranks and signs in dimension d: A measure transportation approach , 2021, The Annals of Statistics.

[4]  Robert P. Lieli,et al.  Counterfactual Treatment Effects: Estimation and Inference , 2020, Journal of Business & Economic Statistics.

[5]  Robert P. Lieli,et al.  Estimation of Conditional Average Treatment Effects With High-Dimensional Data , 2019, Journal of Business & Economic Statistics.

[6]  S. Athey,et al.  Estimating Treatment Effects with Causal Forests: An Application , 2019, Observational Studies.

[7]  Sören R. Künzel,et al.  Metalearners for estimating heterogeneous treatment effects using machine learning , 2017, Proceedings of the National Academy of Sciences.

[8]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[9]  Mélanie Frappier,et al.  The Book of Why: The New Science of Cause and Effect , 2018, Science.

[10]  Sanjog Misra,et al.  Heterogeneous Treatment Effects and Optimal Targeting Policy Evaluation , 2018 .

[11]  Trevor Hastie,et al.  Some methods for heterogeneous treatment effect estimation in high dimensions , 2017, Statistics in medicine.

[12]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[13]  Kari Lock Morgan,et al.  Balancing Covariates via Propensity Score Weighting , 2014, 1609.07494.

[14]  Jonathan M. V. Davis,et al.  Using Causal Forests to Predict Treatment Heterogeneity: An Application to Summer Jobs , 2017 .

[15]  K. Imai Quantitative social science : an introduction , 2017 .

[16]  A. Galichon,et al.  Optimal Transport Methods in Economics , 2016 .

[17]  E. Mach,et al.  The Science of Mechanics: A Critical and Historical Exposition of Its Principles , 2015 .

[18]  Filippo Santambrogio,et al.  Optimal Transport for Applied Mathematicians , 2015 .

[19]  Robert P. Lieli,et al.  Estimating Conditional Average Treatment Effects , 2014 .

[20]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[21]  Judea Pearl,et al.  Causal Inference , 2010 .

[22]  Arthur Cayley,et al.  The Collected Mathematical Papers: On Monge's “Mémoire sur la théorie des déblais et des remblais” , 2009 .

[23]  C. Villani Optimal Transport: Old and New , 2008 .

[24]  Nicholas J. Higham,et al.  Functions of matrices - theory and computation , 2008 .

[25]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[26]  M. Hernán,et al.  The birth weight "paradox" uncovered? , 2006, American journal of epidemiology.

[27]  L. Kantorovich On the Translocation of Masses , 2006 .

[28]  C. Villani Topics in Optimal Transportation , 2003 .

[29]  A. Wilcox,et al.  On the importance--and the unimportance--of birthweight. , 2001, International journal of epidemiology.

[30]  R. McCann Exact solutions to the transportation problem on the line , 1999, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[31]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[32]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[33]  A. Wilcox Birth weight and perinatal mortality: the effect of maternal smoking. , 1993, American journal of epidemiology.

[34]  R. Brualdi Combinatorial Matrix Classes , 2006 .

[35]  Y. Brenier Polar Factorization and Monotone Rearrangement of Vector-Valued Functions , 1991 .

[36]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[37]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[38]  D. Rubin Matched Sampling for Causal Effects: Matching to Remove Bias in Observational Studies , 1973 .

[39]  J. McKinsey,et al.  The Problem of Counterfactual Conditionals. , 1947 .

[40]  Roderick M. Chisholm,et al.  The Contrary-to-Fact Conditional. , 1947 .