Deep representation learning for individualized treatment effect estimation using electronic health records

Utilizing clinical observational data to estimate individualized treatment effects (ITE) is a challenging task, as confounding inevitably exists in clinical data. Most of the existing models for ITE estimation tackle this problem by creating unbiased estimators of the treatment effects. Although valuable, learning a balanced representation is sometimes directly opposed to the objective of learning an effective and discriminative model for ITE estimation. We propose a novel hybrid model bridging multi-task deep learning and K-nearest neighbors (KNN) for ITE estimation. In detail, the proposed model firstly adopts multi-task deep learning to extract both outcome-predictive and treatment-specific latent representations from Electronic Health Records (EHR), by jointly performing the outcome prediction and treatment category classification. Thereafter, we estimate counterfactual outcomes by KNN based on the learned hidden representations. We validate the proposed model on a widely used semi-simulated dataset, i.e. IHDP, and a real-world clinical dataset consisting of 736 heart failure (HF) patients. The performance of our model remains robust and reaches 1.7 and 0.23 in terms of Precision in the estimation of heterogeneous effect (PEHE) and average treatment effect (ATE), respectively, on IHDP dataset, and 0.703 and 0.796 in terms of accuracy and F1 score respectively, on HF dataset. The results demonstrate that the proposed model achieves competitive performance over state-of-the-art models. In addition, the results reveal several findings which are consistent with existing medical domain knowledge, and discover certain suggestive hypotheses that could be validated through further investigations in the clinical domain.

[1]  Jonathon Shlens,et al.  Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[2]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[3]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[4]  Uri Shalit,et al.  Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.

[5]  Max Welling,et al.  Causal Effect Inference with Deep Latent-Variable Models , 2017, NIPS 2017.

[6]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[7]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: A General Method for Estimating Sampling Variances for Standard Estimators for Average Causal Effects , 2015 .

[8]  R. Prentice Use of the logistic model in retrospective studies. , 1976, Biometrics.

[9]  Patrick P. K. Chan,et al.  Convolutional Neural Networks based Click-Through Rate Prediction with Multiple Feature Sequences , 2018, IJCAI.

[10]  Jimeng Sun,et al.  Using recurrent neural network models for early detection of heart failure onset , 2016, J. Am. Medical Informatics Assoc..

[11]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[12]  Kenney Ng,et al.  Personalized Predictive Modeling and Risk Factor Identification using Patient Similarity , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[13]  Bo Li,et al.  Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing , 2017, KDD.

[14]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[15]  Juerg Schwitter,et al.  ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure 2012 , 2010, European journal of heart failure.

[16]  J H Ellenberg,et al.  Selection bias in observational and experimental studies. , 1994, Statistics in medicine.

[17]  A. Deaton,et al.  Understanding and Misunderstanding Randomized Controlled Trials , 2016, Social science & medicine.

[18]  M. Höfler,et al.  Causal inference based on counterfactuals , 2005, BMC medical research methodology.

[19]  Martin Bland,et al.  An Introduction to Medical Statistics , 1987 .

[20]  Ping Zhang,et al.  Risk Prediction with Electronic Health Records: A Deep Learning Approach , 2016, SDM.

[21]  David Sontag,et al.  Multi-task Prediction of Disease Onsets from Longitudinal Laboratory Tests , 2016, MLHC.

[22]  M. Drazner,et al.  2013 ACCF/AHA guideline for the management of heart failure: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines. , 2013, Journal of the American College of Cardiology.

[23]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[24]  Mihaela van der Schaar,et al.  GANITE: Estimation of Individualized Treatment Effects using Generative Adversarial Nets , 2018, ICLR.

[25]  Donald Steinwachs,et al.  Estimating Causal Effects in Observational Studies using Electronic Health Data: Challenges and (Some) Solutions , 2013, EGEMS.

[26]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[27]  Ralph B D'Agostino,et al.  Estimating treatment effects using observational data. , 2007, JAMA.

[28]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[29]  Melissa Nichols,et al.  Comparison of Propensity Score Methods and Covariate Adjustment: Evaluation in 4 Cardiovascular Studies. , 2016, Journal of the American College of Cardiology.

[30]  Constantin F. Aliferis,et al.  Predicting dire outcomes of patients with community acquired pneumonia , 2005, J. Biomed. Informatics.

[31]  Fei Wang,et al.  Measuring Patient Similarities via a Deep Architecture with Medical Concept Embedding , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[32]  Wojciech Samek,et al.  Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..

[33]  Martin Wattenberg,et al.  How to Use t-SNE Effectively , 2016 .

[34]  Changhee Lee,et al.  Estimation of Individual Treatment Effect in Latent Confounder Models via Adversarial Learning , 2018, ArXiv.

[35]  Anthonius de Boer,et al.  Systematic differences in treatment effect estimates between propensity score methods and logistic regression. , 2008, International journal of epidemiology.

[36]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[37]  P. Austin An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies , 2011, Multivariate behavioral research.

[38]  Mihaela van der Schaar,et al.  Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes , 2017, NIPS.

[39]  Eileen Munro,et al.  The limitations of randomized controlled trials in predicting effectiveness. , 2010, Journal of evaluation in clinical practice.

[40]  Fei Wang,et al.  A Framework for Mining Signatures from Event Sequences and Its Applications in Healthcare Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[42]  Xiao-Hua Zhou,et al.  Generalized propensity score for estimating the average treatment effect of multiple treatments , 2012, Statistics in medicine.

[43]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[44]  Andy Podgurski,et al.  Improving Health Care Outcomes through Personalized Comparisons of Treatment Effectiveness Based on Electronic Health Records , 2011, The Journal of law, medicine & ethics : a journal of the American Society of Law, Medicine & Ethics.

[45]  Jennifer G. Dy,et al.  Informative Subspace Learning for Counterfactual Inference , 2017, AAAI.

[46]  U. Rajendra Acharya,et al.  Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals , 2017, Comput. Biol. Medicine.

[47]  Marie Davidian,et al.  Doubly robust estimation of causal effects. , 2011, American journal of epidemiology.

[48]  Helmut Baumgartner,et al.  ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure 2012 , 2012, European journal of heart failure.

[49]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[50]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[51]  Yue Zhang,et al.  Event Factuality Identification via Generative Adversarial Networks with Auxiliary Classification , 2018, IJCAI.

[52]  Uri Shalit,et al.  Learning Representations for Counterfactual Inference , 2016, ICML.

[53]  Issa J Dahabreh,et al.  Using group data to treat individuals: understanding heterogeneous treatment effects in the age of precision medicine and patient-centred evidence. , 2016, International journal of epidemiology.

[54]  Bryan Lim,et al.  Forecasting Treatment Responses Over Time Using Recurrent Marginal Structural Networks , 2018, NeurIPS.

[55]  Mihaela van der Schaar,et al.  Deep-Treat: Learning Optimal Personalized Treatments From Observational Data Using Neural Networks , 2018, AAAI.

[56]  R. Coronel,et al.  Defining heart failure. , 2001, Cardiovascular research.

[57]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[58]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[59]  Gerasimos S Filippatos,et al.  2017 ACC/AHA/HFSA Focused Update of the 2013 ACCF/AHA Guideline for the Management of Heart Failure: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines and the Heart Failure Society of America. , 2017, Journal of the American College of Cardiology.

[60]  Mihaela van der Schaar,et al.  Deep Counterfactual Networks with Propensity-Dropout , 2017, ArXiv.

[61]  Sascha O. Becker,et al.  Estimation of Average Treatment Effects Based on Propensity Scores , 2002 .

[62]  J. Concato,et al.  Randomized, controlled trials, observational studies, and the hierarchy of research designs. , 2000, The New England journal of medicine.

[63]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[64]  Huilong Duan,et al.  A Regularized Deep Learning Approach for Clinical Risk Prediction of Acute Coronary Syndrome Using Electronic Health Records , 2018, IEEE Transactions on Biomedical Engineering.