Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach

Individualized treatment rules (ITRs) are considered a promising recipe to deliver better policy interventions. One key ingredient in optimal ITR estimation problems is to estimate the average treatment effect conditional on a subject's covariate information, which is often challenging in observational studies due to the universal concern of unmeasured confounding. Instrumental variables (IVs) are widely-used tools to infer the treatment effect when there is unmeasured confounding between the treatment and outcome. In this work, we propose a general framework of approaching the optimal ITR estimation problem when a valid IV is allowed to only partially identify the treatment effect. We introduce a novel notion of optimality called "IV-optimality". A treatment rule is said to be IV-optimal if it minimizes the maximum risk with respect to the putative IV and the set of IV identification assumptions. We derive a bound on the risk of an IV-optimal rule that illuminates when an IV-optimal rule has favorable generalization performance. We propose a classification-based statistical learning method that estimates such an IV-optimal rule, design computationally-efficient algorithms, and prove theoretical guarantees. We contrast our proposed method to the popular outcome weighted learning (OWL) approach via extensive simulations, and apply our method to study which mothers would benefit from traveling to deliver their premature babies at hospitals with high level neonatal intensive care units.

[1]  Toru Kitagawa,et al.  The identification region of the potential outcome distributions under instrument independence , 2009, Journal of Econometrics.

[2]  Marie Davidian,et al.  Using decision lists to construct interpretable and parsimonious treatment regimes , 2015, Biometrics.

[3]  Yang Ning,et al.  Efficient augmentation and relaxation learning for individualized treatment rules using observational data , 2019, J. Mach. Learn. Res..

[4]  Erica E M Moodie,et al.  Demystifying Optimal Dynamic Treatment Regimes , 2007, Biometrics.

[5]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[6]  Marie Davidian,et al.  Interpretable Dynamic Treatment Regimes , 2016, Journal of the American Statistical Association.

[7]  Cecilia Elena Rouse,et al.  Democratization or Diversion? The Effect of Community Colleges on Educational Attainment , 1995 .

[8]  W. Barfield,et al.  Comparison of state risk-appropriate neonatal care policies with the 2012 AAP policy statement , 2018, Journal of Perinatology.

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[11]  James J. Heckman,et al.  Identification of Causal Effects Using Instrumental Variables: Comment , 1996 .

[12]  Mark J. van der Laan,et al.  Cross-Validated Targeted Minimum-Loss-Based Estimation , 2011 .

[13]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[14]  Ying Liu,et al.  Learning Optimal Individualized Treatment Rules from Electronic Health Record Data , 2016, 2016 IEEE International Conference on Healthcare Informatics (ICHI).

[15]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[16]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[17]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[18]  I. König,et al.  What is precision medicine? , 2017, European Respiratory Journal.

[19]  M. Baiocchi,et al.  Instrumental variable methods for causal inference , 2014, Statistics in medicine.

[20]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[21]  Martin Huber,et al.  Sharp IV Bounds on Average Treatment Effects on the Treated and Other Populations Under Endogeneity and Noncompliance , 2017 .

[22]  Marco Carone,et al.  Optimal Individualized Decision Rules Using Instrumental Variable Methods , 2020, Journal of the American Statistical Association.

[23]  James M. Robins,et al.  Analysis of the Binary Instrumental Variable Model , 2010 .

[24]  Dylan S. Small,et al.  Quantitative Evaluation of the Trade-Off of Strengthened Instruments and Sample Size in Observational Studies , 2018, Journal of the American Statistical Association.

[25]  Zahra Siddique,et al.  Partially Identified Treatment Effects Under Imperfect Compliance: The Case of Domestic Violence , 2013, SSRN Electronic Journal.

[26]  James M. Robins,et al.  Partial Identification of the Average Treatment Effect Using Instrumental Variables: Review of Methods for Binary Instruments, Treatments, and Outcomes , 2018, Journal of the American Statistical Association.

[27]  I. Johnstone,et al.  Minimax estimation via wavelet shrinkage , 1998 .

[28]  Vladimir Vapnik,et al.  Principles of Risk Minimization for Learning Theory , 1991, NIPS.

[29]  Nathan Kallus,et al.  Confounding-Robust Policy Improvement , 2018, NeurIPS.

[30]  J. Robins Estimation of the time-dependent accelerated failure time model in the presence of confounding factors , 1992 .

[31]  R. Hogg,et al.  On adaptive estimation , 1984 .

[32]  W. Newey,et al.  Double machine learning for treatment and causal parameters , 2016 .

[33]  R. Rochat,et al.  Perinatal regionalization for very low-birth-weight and very preterm infants: a meta-analysis. , 2010, JAMA.

[34]  Xiaojie Mao,et al.  Interval Estimation of Individual-Level Causal Effects Under Unobserved Confounding , 2018, AISTATS.

[35]  T. Cai,et al.  Minimax and Adaptive Inference in Nonparametric Function Estimation , 2012, 1203.4911.

[36]  J. Lafferty,et al.  Rodeo: Sparse, greedy nonparametric regression , 2008, 0803.1709.

[37]  Dylan S. Small,et al.  The Differential Impact of Delivery Hospital on the Outcomes of Premature Infants , 2012, Pediatrics.

[38]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[39]  Anastasios A. Tsiatis,et al.  Dynamic Treatment Regimes , 2019 .

[40]  Bibhas Chakraborty,et al.  Q‐learning for estimating optimal dynamic treatment rules from observational data , 2012, The Canadian journal of statistics = Revue canadienne de statistique.

[41]  M. Kosorok,et al.  Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer , 2011, Biometrics.

[42]  M. Kosorok,et al.  Reinforcement learning design for cancer clinical trials , 2009, Statistics in medicine.

[43]  Erwan Scornet,et al.  Minimax optimal rates for Mondrian trees and forests , 2018, The Annals of Statistics.

[44]  Bo Zhang,et al.  Selecting and Ranking Individualized Treatment Rules With Unmeasured Confounding , 2020, Journal of the American Statistical Association.

[45]  J M Robins,et al.  Marginal Mean Models for Dynamic Regimes , 2001, Journal of the American Statistical Association.

[46]  E. T. Tchetgen Tchetgen,et al.  A Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under Endogeneity , 2019, Journal of the American Statistical Association.

[47]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[48]  C. Manski Partial Identification of Probability Distributions , 2003 .

[49]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[50]  C. Manski,et al.  Monotone Instrumental Variables with an Application to the Returns to Schooling , 1998 .

[51]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[52]  Eric Tchetgen Tchetgen,et al.  Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables , 2016, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[53]  Michael I. Jordan,et al.  Convexity, Classification, and Risk Bounds , 2006 .

[54]  Eric B. Laber,et al.  Tree-based methods for individualized treatment regimes. , 2015, Biometrika.

[55]  James M. Robins,et al.  MINIMAX ESTIMATION OF A FUNCTIONAL ON A STRUCTURED , 2016 .

[56]  R. Dechter,et al.  Heuristics, Probability and Causality. A Tribute to Judea Pearl , 2010 .

[57]  J. Pearl,et al.  Bounds on Treatment Effects from Studies with Imperfect Compliance , 1997 .

[58]  Marco Loog,et al.  Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results , 2019, ArXiv.