Heterogeneous Treatment Effect Estimation using machine learning for Healthcare application: tutorial and benchmark

Background: Developing new drugs for target diseases is a time-consuming and expensive task, drug repurposing has become a popular topic in the drug development field. As much health claim data become available, many studies have been conducted on the data. The real-world data is noisy, sparse and has many confounding factors. In addition, many studies have shown that drugs effects are heterogeneous among the population. Lots of advanced machine learning models about estimating heterogeneous treatment effects (HTE) have emerged in recent years, and have been applied to in econometrics and machine learning communities. These studies acknowledge medicine and drug development as the main application area, but there has been limited translational research from the HTE methodology to drug development. Aim: We aim to introduce the HTE methodology to the healthcare area and provide feasibility consideration when translating the methodology with benchmark experiments on healthcare administrative claim data. Also, we want to use benchmark experiments to show how to interpret and evaluate the model when it is applied to healthcare research. Scope: For readers in statistics, machine learning, and economics communities, we wish to show them some real-world constraints when applying the HTE methodologies, with an example of large-scale claim data and a long-term progressive disease. For readers in the healthcare and drug development field, we aim to provide them with the feasibility of HTE estimation tools in researching drugs or therapies’ effects. Contribution: By introducing the recent HTE techniques to a broad readership in biomedical informatics communities, we expect to promote the wide adoption of causal inference using machine learning. We also expect to provide the feasibility of HTE for personalized drug effectiveness.

[1]  W. Newey,et al.  Double machine learning for treatment and causal parameters , 2016 .

[2]  Mihaela van der Schaar,et al.  Nonparametric Estimation of Heterogeneous Treatment Effects: From Theory to Learning Algorithms , 2021, AISTATS.

[3]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[4]  Mihaela van der Schaar,et al.  Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes , 2017, NIPS.

[5]  James M Robins,et al.  Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. , 2016, American journal of epidemiology.

[6]  Aidong Zhang,et al.  Representation Learning for Treatment Effect Estimation from Observational Data , 2018, NeurIPS.

[7]  Tyler J. VanderWeele,et al.  On the definition of a confounder , 2013, Annals of statistics.

[8]  Rajeev Dehejia,et al.  Propensity Score-Matching Methods for Nonexperimental Causal Studies , 2002, Review of Economics and Statistics.

[9]  Mihaela van der Schaar,et al.  Validating Causal Inference Models via Influence Functions , 2019, ICML.

[10]  M. Schaar,et al.  Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks , 2020, NeurIPS.

[11]  Hao Wang,et al.  Montelukast rescues primary neurons against Aβ1–42-induced toxicity through inhibiting CysLT1R-mediated NF-κB signaling , 2014, Neurochemistry International.

[12]  Mihaela van der Schaar,et al.  Deep-Treat: Learning Optimal Personalized Treatments From Observational Data Using Neural Networks , 2018, AAAI.

[13]  Mihaela van der Schaar,et al.  Limits of Estimating Heterogeneous Treatment Effects: Guidelines for Practical Algorithm Design , 2018, ICML.

[14]  H. Zhang,et al.  Exploring the feasibility of using real-world data from a large clinical data research network to simulate clinical trials of Alzheimer’s disease , 2020, npj Digital Medicine.

[15]  Ioana Bica,et al.  From Real‐World Patient Data to Individualized Treatment Effects Using Machine Learning: Current and Future Methods to Address Underlying Challenges , 2020, Clinical pharmacology and therapeutics.

[16]  Ping Zhang,et al.  A deep learning framework for drug repurposing via emulating clinical trials on real-world patient data , 2021, Nat. Mach. Intell..

[17]  David M. Blei,et al.  Adapting Neural Networks for the Estimation of Treatment Effects , 2019, NeurIPS.

[18]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[19]  Peter C Austin,et al.  A critical appraisal of propensity‐score matching in the medical literature between 1996 and 2003 , 2008, Statistics in medicine.

[20]  Joshua J. Armstrong,et al.  A call for comparative effectiveness research to learn whether routine clinical care decisions can protect from dementia and cognitive decline , 2016, Alzheimer's Research & Therapy.

[21]  P. Scheltens,et al.  The “rights” of precision drug development for Alzheimer’s disease , 2019, Alzheimer's Research & Therapy.

[22]  P. Schulz,et al.  Risk Factors for Progression to Alzheimer's Disease in African Americans in a Large National Cohort , 2020, medRxiv.

[23]  Ayman E El-Sahar,et al.  Montelukast attenuates rotenone‐induced microglial activation/p38 MAPK expression in rats: Possible role of its antioxidant, anti‐inflammatory and antiapoptotic effects , 2018, Toxicology and applied pharmacology.

[24]  Sandra E Black,et al.  Clinical, imaging, and pathological heterogeneity of the Alzheimer's disease syndrome , 2013, Alzheimer's Research & Therapy.

[25]  A. Kesselheim,et al.  Evaluation of Aducanumab for Alzheimer Disease: Scientific Evidence and Regulatory Review Involving Efficacy, Safety, and Futility. , 2021, JAMA.

[26]  Marco Caliendo,et al.  Some Practical Guidance for the Implementation of Propensity Score Matching , 2005, SSRN Electronic Journal.

[27]  B. Lamb,et al.  Inflammation as a central mechanism in Alzheimer's disease , 2018, Alzheimer's & dementia.

[28]  D. Jacob CATE meets ML - Conditional Average Treatment Effect and Machine Learning , 2021, 2104.09935.

[29]  L. Aigner,et al.  The leukotriene signaling pathway: a druggable target in Alzheimer's disease. , 2019, Drug discovery today.

[30]  Yun Fu,et al.  Matching on Balanced Nonlinear Representations for Treatment Effects Estimation , 2017, NIPS.

[31]  M. Glymour,et al.  Evaluating Public Health Interventions: 5. Causal Inference in Public Health Research-Do Sex, Race, and Biological Factors Cause Health Outcomes? , 2017, American journal of public health.

[32]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[33]  A. Bozek,et al.  Improved activity and mental function related to proper antiasthmatic treatment in elderly patients with Alzheimer's disease. , 2011, Allergy and asthma proceedings.

[34]  E. Stuart,et al.  Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies , 2015, Statistics in medicine.

[35]  M. Parascandola,et al.  Causation in epidemiology , 2001, Journal of epidemiology and community health.

[36]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[37]  K. Kendler,et al.  Applying Causal Inference Methods in Psychiatric Epidemiology: A Review. , 2019, JAMA psychiatry.

[38]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[39]  Uri Shalit,et al.  Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.

[40]  Marie Davidian,et al.  Doubly robust estimation of causal effects. , 2011, American journal of epidemiology.

[41]  D. Reda,et al.  Generalizability of heterogeneous treatment effects based on causal forests applied to two randomized clinical trials of intensive glycemic control. , 2021, Annals of epidemiology.

[42]  Edward H Kennedy,et al.  An introduction to g methods. , 2016, International journal of epidemiology.

[43]  Mélanie Frappier,et al.  The Book of Why: The New Science of Cause and Effect , 2018, Science.

[44]  Jason Roy,et al.  A SEMIPARAMETRIC MODELING APPROACH USING BAYESIAN ADDITIVE REGRESSION TREES WITH AN APPLICATION TO EVALUATE HETEROGENEOUS TREATMENT EFFECTS. , 2018, The annals of applied statistics.

[45]  Prescription database analyses indicates that the asthma medicine montelukast might protect against dementia: a hypothesis to be verified , 2017, Immunity & Ageing.

[46]  E. Newcombe,et al.  Inflammation: the link between comorbidities, genetics, and Alzheimer’s disease , 2018, Journal of Neuroinflammation.

[47]  Hector Zenil,et al.  Causal deconvolution by algorithmic generative models , 2019, Nature Machine Intelligence.

[48]  D. Chen,et al.  Statistical Causal Inferences and Their Applications in Public Health Research , 2016 .

[49]  S. Haeberlein,et al.  Clinical Development of Aducanumab, an Anti-Aβ Human Monoclonal Antibody Being Investigated for the Treatment of Early Alzheimer's Disease. , 2017, The journal of prevention of Alzheimer's disease.

[50]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[51]  P. Su,et al.  Heterogeneous Treatment Effects on Cardiovascular Diseases With Dipeptidyl Peptidase‐4 Inhibitors Versus Sulfonylureas in Type 2 Diabetes Patients , 2020, Clinical pharmacology and therapeutics.

[52]  Negar Hassanpour,et al.  Learning Disentangled Representations for CounterFactual Regression , 2020, ICLR.

[53]  G. Imbens,et al.  Matching on the Estimated Propensity Score , 2009 .

[54]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[55]  Michal Ozery-Flato,et al.  Framework for identifying drug repurposing candidates from observational healthcare data , 2020, JAMIA open.

[56]  J. Robins,et al.  Estimating causal effects from epidemiological data , 2006, Journal of Epidemiology and Community Health.

[57]  J. McGough,et al.  Estimating the size of treatment effects: moving beyond p values. , 2009, Psychiatry (Edgmont (Pa. : Township)).

[58]  Jiang Bian,et al.  Causal inference and counterfactual prediction in machine learning for actionable healthcare , 2020, Nature Machine Intelligence.

[59]  Sören R. Künzel,et al.  Metalearners for estimating heterogeneous treatment effects using machine learning , 2017, Proceedings of the National Academy of Sciences.

[60]  C. Plata-salamán,et al.  Inflammation and Alzheimer’s disease , 2000, Neurobiology of Aging.

[61]  Lawrence Carin,et al.  Counterfactual Representation Learning with Balancing Weights , 2021, AISTATS.

[62]  Aaron N. Winn,et al.  Using Group-based Trajectory Models and Propensity Score Weighting to Detect Heterogeneous Treatment Effects: The Case Study of Generic Hormonal Therapy for Women With Breast Cancer , 2019, Medical care.

[63]  P. Austin An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies , 2011, Multivariate behavioral research.

[64]  Russ B. Altman,et al.  Examining the Use of Real‐World Evidence in the Regulatory Process , 2019, Clinical pharmacology and therapeutics.

[65]  Uri Shalit,et al.  Learning Representations for Counterfactual Inference , 2016, ICML.

[66]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.