Estimating Optimal Treatment Rules with an Instrumental Variable: A Semi-Supervised Learning Approach

Individualized treatment rules (ITRs) are regarded as a promising recipe to deliver better policy interventions. One key ingredient in optimal ITR estimation problems is to estimate average treatment effect conditional on a subject's covariate information, which is often challenging in observational studies due to the universal concern of unmeasured confounding. Instrumental variables (IVs) are widely-used tools to infer treatment effect when there is unmeasured confounding between the treatment and outcome. In this work, we propose a general framework to approach the ITR estimation problem with a valid IV. Just as Zhang et al. (2012) and Zhao et al. (2012) cast the ITR estimation problem in non-IV settings into a supervised classification problem, we recast the ITR estimation problem with a valid IV into a semi-supervised classification problem consisting of a supervised and an unsupervised part. The unsupervised part stems naturally from the partial identification nature of an IV in terms of identifying the treatment effect. We define a new notion of optimality called "IV-optimality". A treatment rule is said to be IV-optimal if it minimizes the maximum risk with respect to the putative IV and the set of IV identification assumptions. We propose a statistical learning method that estimates such an IV-optimal rule, design computationally-efficient algorithms, and prove theoretical guarantees. We apply our method to studying which moms would better deliver their premature babies at hospitals with high-level neonatal intensive care units (NICUs).

[1]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[2]  T. Cai,et al.  Minimax and Adaptive Inference in Nonparametric Function Estimation , 2012, 1203.4911.

[3]  Erwan Scornet,et al.  Minimax optimal rates for Mondrian trees and forests , 2018, The Annals of Statistics.

[4]  Michael I. Jordan,et al.  Convexity, Classification, and Risk Bounds , 2006 .

[5]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[6]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[7]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[8]  Marco Loog,et al.  Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results , 2019, ArXiv.

[9]  James M. Robins,et al.  Analysis of the Binary Instrumental Variable Model , 2010 .

[10]  Zahra Siddique,et al.  Partially Identified Treatment Effects Under Imperfect Compliance: The Case of Domestic Violence , 2013, SSRN Electronic Journal.

[11]  James M. Robins,et al.  Partial Identification of the Average Treatment Effect Using Instrumental Variables: Review of Methods for Binary Instruments, Treatments, and Outcomes , 2018, Journal of the American Statistical Association.

[12]  I. Johnstone,et al.  Minimax estimation via wavelet shrinkage , 1998 .

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  J M Robins,et al.  Marginal Mean Models for Dynamic Regimes , 2001, Journal of the American Statistical Association.

[15]  Eric Tchetgen Tchetgen,et al.  Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables , 2016, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[16]  Yang Ning,et al.  Efficient augmentation and relaxation learning for individualized treatment rules using observational data , 2019, J. Mach. Learn. Res..

[17]  M. Kosorok,et al.  Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer , 2011, Biometrics.

[18]  W. Newey,et al.  Double machine learning for treatment and causal parameters , 2016 .

[19]  Eric B. Laber,et al.  Tree-based methods for individualized treatment regimes. , 2015, Biometrika.

[20]  Marie Davidian,et al.  Using decision lists to construct interpretable and parsimonious treatment regimes , 2015, Biometrics.

[21]  I. König,et al.  What is precision medicine? , 2017, European Respiratory Journal.

[22]  Nathan Kallus,et al.  Confounding-Robust Policy Improvement , 2018, NeurIPS.

[23]  James M. Robins,et al.  MINIMAX ESTIMATION OF A FUNCTIONAL ON A STRUCTURED , 2016 .

[24]  R. Dechter,et al.  Heuristics, Probability and Causality. A Tribute to Judea Pearl , 2010 .

[25]  J. Pearl,et al.  Bounds on Treatment Effects from Studies with Imperfect Compliance , 1997 .

[26]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[27]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[28]  Dylan S. Small,et al.  The Differential Impact of Delivery Hospital on the Outcomes of Premature Infants , 2012, Pediatrics.

[29]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[30]  M. Baiocchi,et al.  Instrumental variable methods for causal inference , 2014, Statistics in medicine.

[31]  R. Hogg,et al.  On adaptive estimation , 1984 .

[32]  J. Lafferty,et al.  Rodeo: Sparse, greedy nonparametric regression , 2008, 0803.1709.

[33]  Marie Davidian,et al.  Interpretable Dynamic Treatment Regimes , 2016, Journal of the American Statistical Association.

[34]  Mark J. van der Laan,et al.  Cross-Validated Targeted Minimum-Loss-Based Estimation , 2011 .

[35]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[36]  J. Robins Estimation of the time-dependent accelerated failure time model in the presence of confounding factors , 1992 .

[37]  E. T. Tchetgen Tchetgen,et al.  A Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under Endogeneity , 2019, Journal of the American Statistical Association.

[38]  Vladimir Vapnik,et al.  Principles of Risk Minimization for Learning Theory , 1991, NIPS.

[39]  Anastasios A. Tsiatis,et al.  Dynamic Treatment Regimes , 2019 .