Applications of artificial intelligence in drug development using real-world data

The US Food and Drug Administration (FDA) has been actively promoting the use of real-world data (RWD) in drug development. RWD can generate important real-world evidence reflecting the real-world clinical environment where the treatments are used. Meanwhile, artificial intelligence (AI), especially machine- and deep-learning (ML/DL) methods, have been increasingly used across many stages of the drug development process. Advancements in AI have also provided new strategies to analyze large, multidimensional RWD. Thus, we conducted a rapid review of articles from the past 20 years, to provide an overview of the drug development studies that use both AI and RWD. We found that the most popular applications were adverse event detection, trial recruitment, and drug repurposing. Here, we also discuss current research gaps and future opportunities.

[1]  J. Aronson,et al.  What is a clinical trial? , 2004, British journal of clinical pharmacology.

[2]  J Benschop,et al.  Comparison between generalized linear modelling and additive Bayesian network; identification of factors associated with the incidence of antibodies against Leptospira interrogans sv Pomona in meat workers in New Zealand. , 2017, Acta tropica.

[3]  T. J. Moore,et al.  Estimated Costs of Pivotal Trials for Novel Therapeutic Agents Approved by the US Food and Drug Administration, 2015-2016 , 2018, JAMA internal medicine.

[4]  Padraig Corcoran,et al.  Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach , 2019, JMIR medical informatics.

[5]  Shyam Visweswaran,et al.  Accrual to Clinical Trials (ACT): A Clinical and Translational Science Award Consortium Network , 2018, JAMIA open.

[6]  James M Robins,et al.  Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. , 2016, American journal of epidemiology.

[7]  Yonghui Wu,et al.  MADEx: A System for Detecting Medications, Adverse Drug Events, and Their Relations from Clinical Notes , 2019, Drug Safety.

[8]  R. Stewart,et al.  A Machine-Learning-Based Drug Repurposing Approach Using Baseline Regularization. , 2018, Methods in molecular biology.

[9]  D. Roden,et al.  Biobanks and Electronic Medical Records: Enabling Cost-Effective Research , 2014, Science Translational Medicine.

[10]  Philippe Sanseau,et al.  In silico prediction of novel therapeutic targets using gene–disease association data , 2017, Journal of Translational Medicine.

[11]  Sunghwan Sohn,et al.  Clinical concept extraction: A methodology review , 2020, J. Biomed. Informatics.

[12]  Sophia Ananiadou,et al.  Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods , 2019, J. Am. Medical Informatics Assoc..

[13]  Henrik Boström,et al.  Predictive modeling of structured electronic health records for adverse drug event detection , 2015, BMC Medical Informatics and Decision Making.

[14]  Yong Fan,et al.  Early Prediction Of Alzheimer’s Disease Dementia Based On Baseline Hippocampal MRI and 1-Year Follow-Up Cognitive Measures Using Deep Recurrent Neural Networks , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[15]  Yu-Chuan Li,et al.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers , 2015, MedInfo.

[16]  Peter J. Embi,et al.  Development of an Electronic Health Record-based Clinical Trial Alert System to Enhance Recruitment at the Point of Care , 2005, AMIA.

[17]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[18]  Elias Bareinboim,et al.  Transportability of Causal and Statistical Relations: A Formal Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[19]  R. Califf,et al.  Real-World Evidence - What Is It and What Can It Tell Us? , 2016, The New England journal of medicine.

[20]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[21]  Hegler Tissot,et al.  Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial , 2019, IEEE Journal of Biomedical and Health Informatics.

[22]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[23]  Dimitra Pappa,et al.  Harnessing social media data for pharmacovigilance: a review of current state of the art, challenges and future directions , 2019, International Journal of Data Science and Analytics.

[24]  Justin Starren,et al.  Natural Language Processing for EHR-Based Pharmacovigilance: A Structured Review , 2017, Drug Safety.

[25]  49 – Drug development , 2015 .

[26]  Hamid Alinejad-Rokny,et al.  A data-driven, knowledge-based approach to biomarker discovery: application to circulating microRNA markers of colorectal cancer prognosis , 2018, npj Systems Biology and Applications.

[27]  F. Lewis,et al.  Revealing the complexity of health determinants in resource-poor settings. , 2012, American journal of epidemiology.

[28]  HemaSree Gns,et al.  An update on Drug Repurposing: Re-written saga of the drug's fate. , 2019, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[29]  Robert Bradford,et al.  Recruiting for a pragmatic trial using the electronic health record and patient portal: successes and lessons learned , 2018, J. Am. Medical Informatics Assoc..

[30]  Xi Yang,et al.  A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings , 2019, 2019 IEEE International Conference on Healthcare Informatics (ICHI).

[31]  Kazem Rahimi,et al.  BEHRT: Transformer for Electronic Health Records , 2019, Scientific Reports.

[32]  Judea Pearl,et al.  The seven tools of causal inference, with reflections on machine learning , 2019, Commun. ACM.

[33]  Elias Bareinboim,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[34]  Jiang Bian,et al.  Implementing a hash-based privacy-preserving record linkage tool in the OneFlorida clinical research network , 2019, JAMIA open.

[35]  W. Chung,et al.  Defining a comprehensive verotype using electronic health records for personalized medicine. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[36]  Pernille Warrer,et al.  Using text-mining techniques in electronic patient records to identify ADRs from medicine use. , 2012, British Journal of Clinical Pharmacology.

[37]  Dipak Kalra,et al.  Real world big data for clinical research and drug development. , 2017, Drug discovery today.

[38]  Fei Wang,et al.  Patient Subtyping via Time-Aware LSTM Networks , 2017, KDD.

[39]  Philip M. Kim,et al.  A systematic approach to identify novel cancer drug targets using machine learning, inhibitor design and high-throughput screening , 2014, Genome Medicine.

[40]  Bin Li,et al.  Applications of machine learning in drug discovery and development , 2019, Nature Reviews Drug Discovery.

[41]  Xi Yang,et al.  Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition , 2018, AMIA.

[42]  T. Davenport,et al.  The potential for artificial intelligence in healthcare , 2019, Future Healthcare Journal.

[43]  Yonghui Wu,et al.  Clinical concept extraction using transformers , 2020, J. Am. Medical Informatics Assoc..

[44]  J. Afseth,et al.  A review of the impact of utilising electronic medical records for clinical research recruitment , 2019, Clinical trials.

[45]  Francis S. Collins,et al.  PCORnet: turning a dream into reality , 2014, J. Am. Medical Informatics Assoc..

[46]  C. Christophi,et al.  Preliminary evidence of the association between monochlorinated bisphenol A exposure and type II diabetes mellitus: A pilot study , 2015, Journal of environmental science and health. Part A, Toxic/hazardous substances & environmental engineering.

[47]  Xiaoqian Jiang,et al.  Electronic Health Records for Drug Repurposing: Current Status, Challenges, and Future Directions , 2020, Clinical pharmacology and therapeutics.

[48]  Haipeng Shen,et al.  Artificial intelligence in healthcare: past, present and future , 2017, Stroke and Vascular Neurology.

[49]  N. Holford,et al.  Clinical Trial Simulation: A Review , 2010, Clinical pharmacology and therapeutics.

[50]  Graeme Hirst,et al.  Utility of social media and crowd-intelligence data for pharmacovigilance: a scoping review , 2018, BMC Medical Informatics and Decision Making.

[51]  W. Aronow Faculty Opinions recommendation of Continued statin prescriptions after adverse reactions and patient outcomes: A cohort study. , 2017 .

[52]  E. Sausville,et al.  CHAPTER 28 – Drug Discovery , 2007 .

[53]  Jiang Bian,et al.  Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review , 2020, J. Am. Medical Informatics Assoc..