A ROBUST AND EFFICIENT APPROACH TO CAUSAL INFERENCE BASED ON SPARSE SUFFICIENT DIMENSION REDUCTION.

A fundamental assumption used in causal inference with observational data is that treatment assignment is ignorable given measured confounding variables. This assumption of no missing confounders is plausible if a large number of baseline covariates are included in the analysis, as we often have no prior knowledge of which variables can be important confounders. Thus, estimation of treatment effects with a large number of covariates has received considerable attention in recent years. Most existing methods require specifying certain parametric models involving the outcome, treatment and confounding variables, and employ a variable selection procedure to identify confounders. However, selection of a proper set of confounders depends on correct specification of the working models. The bias due to model misspecification and incorrect selection of confounding variables can yield misleading results. We propose a robust and efficient approach for inference about the average treatment effect via a flexible modeling strategy incorporating penalized variable selection. Specifically, we consider an estimator constructed based on an efficient influence function that involves a propensity score and an outcome regression. We then propose a new sparse sufficient dimension reduction method to estimate these two functions without making restrictive parametric modeling assumptions. The proposed estimator of the average treatment effect is asymptotically normal and semiparametrically efficient without the need for variable selection consistency. The proposed methods are illustrated via simulation studies and a biomedical application.

[1]  L. Wasserman,et al.  HIGH DIMENSIONAL VARIABLE SELECTION. , 2007, Annals of statistics.

[2]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[3]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[4]  Marie Davidian,et al.  Comment: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data. , 2008, Statistical science : a review journal of the Institute of Mathematical Statistics.

[5]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[6]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[7]  E. Augustson,et al.  Unraveling the Relationship between Smoking and Weight: The Role of Sedentary Behavior , 2011, Journal of obesity.

[8]  C. Morrissey Unraveling the relationship between rejection and infection. , 2019, The Journal of heart and lung transplantation : the official publication of the International Society for Heart Transplantation.

[9]  James M. Carpenter,et al.  ON SIMULTANEOUS ANALYSIS , 1996, Cladistics : the international journal of the Willi Hennig Society.

[10]  D. Ghosh Propensity score modelling in observational studies using dimension reduction methods. , 2011, Statistics & probability letters.

[11]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[12]  Jieping Ye,et al.  A General Iterative Shrinkage and Thresholding Algorithm for Non-convex Regularized Optimization Problems , 2013, ICML.

[13]  H. Tong,et al.  An adaptive estimation of dimension reduction , 2002 .

[14]  Cun-Hui Zhang,et al.  Confidence intervals for low dimensional parameters in high dimensional linear models , 2011, 1110.2563.

[15]  J. Robins,et al.  Improved double-robust estimation in missing data and causal inference models. , 2012, Biometrika.

[16]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[17]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[18]  B. Silverman,et al.  Weak and strong uniform consistency of kernel regression estimates , 1982 .

[19]  A. Buja,et al.  Valid post-selection inference , 2013, 1306.1059.

[20]  W. K. Li,et al.  An adaptive estimation of dimension reduction space , 2002 .

[21]  R. Cook,et al.  Dimension Reduction in Binary Response Regression , 1999 .

[22]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[23]  G. Imbens,et al.  Large Sample Properties of Matching Estimators for Average Treatment Effects , 2004 .

[24]  Kwun Chuen Gary Chan,et al.  Oracle, Multiple Robust and Multipurpose Calibration in a Missing Response Problem , 2014, 1410.3958.

[25]  P. Kowal,et al.  Fruit and Vegetable Intake and Body Mass Index in a Large Sample of Middle-Aged Australian Men and Women , 2014, Nutrients.

[26]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[27]  D. Ghosh,et al.  On estimating regression-based causal effects using sufficient dimension reduction , 2017 .

[28]  A F Subar,et al.  Design and serendipity in establishing a large cohort with wide dietary intake distributions : the National Institutes of Health-American Association of Retired Persons Diet and Health Study. , 2001, American journal of epidemiology.

[29]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[30]  Wei Zhou,et al.  Restoration for fiber bundle endomicroscopy using a fast iterative shrinkage-thresholding algorithm , 2020, Applied Optics and Photonics China.

[31]  R. Tibshirani,et al.  A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[32]  R. Cook,et al.  Dimension reduction for conditional mean in regression , 2002 .

[33]  D. Freedman,et al.  Weighting Regressions by Propensity Scores , 2008, Evaluation review.

[34]  M. Davidian,et al.  Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data , 2009, Biometrika.

[35]  Ker-Chau Li,et al.  On Principal Hessian Directions for Data Visualization and Dimension Reduction: Another Application of Stein's Lemma , 1992 .

[36]  Bing Li,et al.  Successive direction extraction for estimating the central subspace in a multiple-index regression , 2008 .

[37]  D. Jacobs,et al.  Whole grain intake is associated with lower body mass and greater insulin sensitivity among adolescents. , 2003, American journal of epidemiology.

[38]  K. Imai,et al.  Covariate balancing propensity score , 2014 .

[39]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[40]  Ker-Chau Li,et al.  Slicing Regression: A Link-Free Regression Method , 1991 .

[41]  Ker-Chau Li,et al.  Sliced Inverse Regression for Dimension Reduction , 1991 .

[42]  Ryung S. Kim,et al.  Inverse Association between Fruit and Vegetable Intake and BMI even after Controlling for Demographic, Socioeconomic and Lifestyle Factors , 2011, Obesity Facts.

[43]  Jasjeet S. Sekhon,et al.  Multivariate and Propensity Score Matching Software with Automated Balance Optimization: The Matching Package for R , 2008 .

[44]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[45]  Zhiqiang Tan,et al.  Bounded, efficient and doubly robust estimation with inverse weighting , 2010 .

[46]  Sherri Rose,et al.  Implementation of G-computation on a simulated data set: demonstration of a causal inference technique. , 2011, American journal of epidemiology.

[47]  P. Bickel,et al.  SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.

[48]  Zhiqiang Tan,et al.  A Distributional Approach for Causal Inference Using Propensity Scores , 2006 .

[49]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[50]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[51]  S. Geer,et al.  Adaptive Lasso for High Dimensional Regression and Gaussian Graphical Modeling , 2009, 0903.2515.

[52]  Cun-Hui Zhang,et al.  The sparsity and bias of the Lasso selection in high-dimensional linear regression , 2008, 0808.0967.

[53]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[54]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[55]  Wei Luo,et al.  Combining eigenvalues and variation of eigenvectors for order determination , 2016 .

[56]  Liping Zhu,et al.  Efficiency loss and the linearity condition in dimension reduction , 2013 .

[57]  Erich Barke,et al.  Improving Efficiency and Robustness of Analog Behavioral Models , 2007 .

[58]  Y. Xia,et al.  A Multiple-Index Model and Dimension Reduction , 2008 .

[59]  Martin J. Wainwright,et al.  Restricted Eigenvalue Properties for Correlated Gaussian Designs , 2010, J. Mach. Learn. Res..

[60]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[61]  Liping Zhu,et al.  A Semiparametric Approach to Dimension Reduction , 2012, Journal of the American Statistical Association.

[62]  R. Tibshirani,et al.  Discussion: " a Significance Test for the Lasso " , 2014 .

[63]  Jianhua Z. Huang,et al.  Sparse Reduced-Rank Regression for Simultaneous Dimension Reduction and Variable Selection , 2012 .

[64]  Zhou Yu,et al.  On Partial Sufficient Dimension Reduction With Applications to Partially Linear Multi-Index Models , 2013 .

[65]  A. Belloni,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.