Robust Nonparametric Testing for Causal Inference in Observational Studies

We consider the decision problem of making causal conclusions from observational data. Typically, using standard matched pairs techniques, there is a source of uncertainty that is not usually quantified, namely the uncertainty due to the choice of the experimenter: two different reasonable experimenters can easily have opposite results. In this work we present an alternative to the standard nonparametric hypothesis tests, where our tests are robust to the choice of experimenter. In particular, these tests provide the maximum and minimum P -value associated with the set of reasonable assignments of matched pairs. We create robust versions of the sign test, the Wilcoxon signed rank test, the Kolmogorov-Smirnov test, and the Wilcoxon rank sum test (also called the Mann-Whitney U test). keywords: causal inference, observational studies, nonparametric hypothesis test, matched pairs design, discrete optimization, integer programming.

[1]  Paul R. Rosenbaum,et al.  Optimal Matching for Observational Studies , 1989 .

[2]  Dylan S. Small,et al.  Stronger instruments via integer programming in an observational study of late preterm birth outcomes , 2013, 1304.4066.

[3]  Hadi Fanaee-T,et al.  Event labeling combining ensemble detectors and background knowledge , 2014, Progress in Artificial Intelligence.

[4]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[5]  Paul R. Rosenbaum,et al.  Matching for Balance, Pairing for Heterogeneity in an Observational Study of the Effectiveness of For-Profit and Not-For-Profit High Schools in Chile , 2014, 1404.3584.

[6]  Alexander G. Nikolaev,et al.  An optimization approach for making causal inferences , 2013 .

[7]  G. Nemhauser,et al.  Integer Programming , 2020 .

[8]  Md. Noor-E-Alam,et al.  Integer linear programming models for grid-based light post location problem , 2012, Eur. J. Oper. Res..

[9]  Paul R. Rosenbaum,et al.  Optimal Matching of an Optimally Chosen Subset in Observational Studies , 2012 .

[10]  Brian W. Kernighan,et al.  AMPL: A Modeling Language for Mathematical Programming , 1993 .

[11]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[12]  Der-San Chen,et al.  Applied Integer Programming: Modeling and Solution , 2010 .

[13]  Christopher Winship,et al.  Counterfactuals and Causal Inference: Methods and Principles for Social Research , 2007 .

[14]  J. Zubizarreta Journal of the American Statistical Association Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery , 2022 .

[15]  B. Hansen,et al.  Optimal Full Matching and Related Designs via Network Flows , 2006 .

[16]  D. Ayres-de- Campos,et al.  SisPorto 2.0: a program for automated analysis of cardiotocograms. , 2000, The Journal of maternal-fetal medicine.

[17]  S. Kruger Design Of Observational Studies , 2016 .

[18]  Luka Neralić Introduction to Mathematical Programming 1 , 2003 .

[19]  B. Hansen Full Matching in an Observational Study of Coaching for the SAT , 2004 .

[20]  Doreen Pfeifer,et al.  Statistics and Data Analysis , 1997 .

[21]  P. Holland Statistics and Causal Inference , 1985 .

[22]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[23]  西崎 一郎,et al.  数理計画法入門 = Introduction to mathematical programming , 2014 .