A data-adaptive strategy for inverse weighted estimation of causal effects

In most nonrandomized observational studies, differences between treatment groups may arise not only due to the treatment but also because of the effect of confounders. Therefore, causal inference regarding the treatment effect is not as straightforward as in a randomized trial. To adjust for confounding due to measured covariates, the average treatment effect is often estimated by using propensity scores. Typically, propensity scores are estimated by logistic regression. More recent suggestions have been to employ nonparametric classification algorithms from machine learning. In this article, we propose a weighted estimator combining parametric and nonparametric models. Some theoretical results regarding consistency of the procedure are given. Simulation studies are used to assess the performance of the newly proposed methods relative to existing methods, and a data analysis example from the Surveillance, Epidemiology and End Results database is presented.

[1]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[2]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[3]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[4]  D. McCaffrey,et al.  Propensity score estimation with boosted regression for evaluating causal effects in observational studies. , 2004, Psychological methods.

[5]  Yuhong Yang Adaptive Regression by Mixing , 2001 .

[6]  Jens Hainmueller,et al.  Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies , 2012, Political Analysis.

[7]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[8]  J B Birch,et al.  A semiparametric approach to analysing dose-response data. , 2000, Statistics in medicine.

[9]  Ralf Bender,et al.  Generating survival times to simulate Cox proportional hazards models , 2005, Statistics in medicine.

[10]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[11]  Elizabeth A Stuart,et al.  Improving propensity score weighting using machine learning , 2010, Statistics in medicine.

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[14]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[15]  Mark J van der Laan,et al.  Super Learning: An Application to the Prediction of HIV-1 Drug Resistance , 2007, Statistical applications in genetics and molecular biology.

[16]  S. Schneeweiss,et al.  Evaluating uses of data mining techniques in propensity score estimation: a simulation study , 2008, Pharmacoepidemiology and drug safety.

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[19]  Mark J. van der Laan,et al.  A semiparametric model selection criterion with applications to the marginal structural model , 2006, Comput. Stat. Data Anal..

[20]  Elizabeth A Stuart,et al.  Propensity score techniques and the assessment of measured covariate balance to test causal associations in psychological research. , 2010, Psychological methods.

[21]  L. Stefanski,et al.  The Calculus of M-Estimation , 2002 .

[22]  Kosuke Imai,et al.  Causal Inference With General Treatment Regimes , 2004 .

[23]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[24]  Sharon-Lise T Normand,et al.  On the use of discrete choice models for causal inference , 2005, Statistics in medicine.

[25]  Jeffrey B. Birch,et al.  Model robust regression: combining parametric, nonparametric, and semiparametric methods , 2001 .

[26]  D. Pregibon Resistant fits for some commonly used logistic models with medical application. , 1982, Biometrics.

[27]  G. Ridgeway The State of Boosting ∗ , 1999 .

[28]  R. Kronmal,et al.  Assessing the sensitivity of regression results to unmeasured confounders in observational studies. , 1998, Biometrics.

[29]  Bin Yu,et al.  Boosting with early stopping: Convergence and consistency , 2005, math/0508276.

[30]  Yuhong Yang,et al.  Combining Linear Regression Models , 2005, Journal of the American Statistical Association.

[31]  Debashis Ghosh,et al.  Combining Multiple Biomarker Models in Logistic Regression , 2008, Biometrics.

[32]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[33]  Luc Devroye,et al.  Consistency of Random Forests and Other Averaging Classifiers , 2008, J. Mach. Learn. Res..

[34]  K. Imai,et al.  Covariate balancing propensity score , 2014 .

[35]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[36]  Jean-Philippe Vert,et al.  Consistency of Random Forests , 2014, 1405.2881.

[37]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[38]  Lane F Burgette,et al.  A tutorial on propensity score estimation for multiple treatments using generalized boosted models , 2013, Statistics in medicine.

[39]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[40]  Jagbir Singh,et al.  A Semiparametric Approach to Hazard Estimation with Randomly Censored Observations , 1997 .

[41]  D. Heitjan,et al.  Sensitivity of the hazard ratio to nonignorable treatment assignment in an observational study , 2007, Statistics in medicine.

[42]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[43]  Sabine Mercier,et al.  H-Tuple Approach to Evaluate Statistical Significance of Biological Sequence Comparison with Gaps , 2007, Statistical applications in genetics and molecular biology.

[44]  Ingram Olkin,et al.  A Semiparametric Approach to Density Estimation , 1987 .

[45]  Nandita Mitra,et al.  Radiation therapy is associated with improved survival in the adjuvant and definitive treatment of intrahepatic cholangiocarcinoma. , 2008, International journal of radiation oncology, biology, physics.