When deep learning meets causal inference: a computational framework for drug repurposing from real-world data

Drug repurposing is an effective strategy to identify new uses for existing drugs, providing the quickest possible transition from bench to bedside. Existing methods for drug repurposing that mainly focus on pre-clinical information may exist translational issues when applied to human beings. Real world data (RWD), such as electronic health records and insurance claims, provide information on large cohorts of users for many drugs. Here we present an efficient and easily-customized framework for generating and testing multiple candidates for drug repurposing using a retrospective analysis of RWDs. Building upon well-established causal inference and deep learning methods, our framework emulates randomized clinical trials for drugs present in a large-scale medical claims database. We demonstrate our framework in a case study of coronary artery disease (CAD) by evaluating the effect of 55 repurposing drug candidates on various disease outcomes. We achieve 6 drug candidates that significantly improve the CAD outcomes but not have been indicated for treating CAD, paving the way for drug repurposing.

[1]  R. Tagliaferri,et al.  Discovery of drug mode of action and drug repositioning from transcriptional responses , 2010, Proceedings of the National Academy of Sciences.

[2]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[3]  R. D'Agostino Adjustment Methods: Propensity Score Methods for Bias Reduction in the Comparison of a Treatment to a Non‐Randomized Control Group , 2005 .

[4]  Oakland J. Peters,et al.  Predicting new indications for approved drugs using a proteochemometric method. , 2012, Journal of medicinal chemistry.

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  James M Robins,et al.  Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. , 2016, American journal of epidemiology.

[7]  Beau Dabbs,et al.  Summary and discussion of : “ Controlling the False Discovery Rate : A Practical and Powerful Approach to Multiple Testing , 2014 .

[8]  M. Kessentini,et al.  A Systematic Literature Review , 2016 .

[9]  G J Boerma,et al.  Effects of lipid lowering by pravastatin on progression and regression of coronary artery disease in symptomatic men with normal to moderately elevated serum cholesterol levels. The Regression Growth Evaluation Statin Study (REGRESS). , 1995, Circulation.

[10]  L. Cardon,et al.  Use of genome-wide association studies for drug repositioning , 2012, Nature Biotechnology.

[11]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[12]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[13]  Wei Xu,et al.  Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) , 2014, ICLR.

[14]  P. Sanseau,et al.  Drug repurposing: progress, challenges and recommendations , 2018, Nature Reviews Drug Discovery.

[15]  O. Faergeman,et al.  The effects of simvastatin on the incidence of heart failure in patients with coronary heart disease. , 1997, Journal of cardiac failure.

[16]  Vincent Mor,et al.  Principles for modeling propensity scores in medical research: a systematic literature review , 2004, Pharmacoepidemiology and drug safety.

[17]  Hsin-Chieh Yeh,et al.  Effect of the 2011 vs 2003 duty hour regulation-compliant models on sleep duration, trainee education, and continuity of patient care among internal medicine house staff: a randomized trial. , 2013, JAMA internal medicine.

[18]  P. Austin An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies , 2011, Multivariate behavioral research.

[19]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[20]  G. Plotnick,et al.  Beneficial effects of metoprolol in heart failure associated with coronary artery disease: a randomized trial. , 1994, Journal of the American College of Cardiology.

[21]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[22]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[23]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[25]  Philippe Sanseau,et al.  The role of translational bioinformatics in drug discovery. , 2011, Drug discovery today.

[26]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[27]  S. Nekolla,et al.  Effect of the Angiotensin Receptor Blocker Valsartan on Coronary Microvascular Flow Reserve in Moderately Hypertensive Patients with Stable Coronary Artery Disease , 2007, Microcirculation.

[28]  A. Mantel‐Teeuwisse,et al.  Drug repositioning and repurposing: terminology and definitions in literature. , 2015, Drug discovery today.

[29]  R. Califf,et al.  Real-World Evidence - What Is It and What Can It Tell Us? , 2016, The New England journal of medicine.

[30]  P. Mitchell,et al.  Fenofibrate - a potential systemic treatment for diabetic retinopathy? , 2012, American journal of ophthalmology.

[31]  Robert Tibshirani,et al.  Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy , 1986 .

[32]  Heng Luo,et al.  DPDR-CPI, a server that predicts Drug Positioning and Drug Repositioning via Chemical-Protein Interactome , 2016, Scientific Reports.

[33]  Alain Abran,et al.  A systematic literature review: Opinion mining studies from mobile app store user reviews , 2017, J. Syst. Softw..

[34]  Ralph B D'Agostino,et al.  Estimating treatment effects using observational data. , 2007, JAMA.

[35]  Albert-László Barabási,et al.  Network-based approach to prediction and population-based validation of in silico drug repurposing , 2018, Nature Communications.