论文信息 - Clinical Relation Extraction Toward Drug Safety Surveillance Using Electronic Health Record Narratives: Classical Learning Versus Deep Learning - 字舞流文

Clinical Relation Extraction Toward Drug Safety Surveillance Using Electronic Health Record Narratives: Classical Learning Versus Deep Learning

Background Medication and adverse drug event (ADE) information extracted from electronic health record (EHR) notes can be a rich resource for drug safety surveillance. Existing observational studies have mainly relied on structured EHR data to obtain ADE information; however, ADEs are often buried in the EHR narratives and not recorded in structured data. Objective To unlock ADE-related information from EHR narratives, there is a need to extract relevant entities and identify relations among them. In this study, we focus on relation identification. This study aimed to evaluate natural language processing and machine learning approaches using the expert-annotated medical entities and relations in the context of drug safety surveillance, and investigate how different learning approaches perform under different configurations. Methods We have manually annotated 791 EHR notes with 9 named entities (eg, medication, indication, severity, and ADEs) and 7 different types of relations (eg, medication-dosage, medication-ADE, and severity-ADE). Then, we explored 3 supervised machine learning systems for relation identification: (1) a support vector machines (SVM) system, (2) an end-to-end deep neural network system, and (3) a supervised descriptive rule induction baseline system. For the neural network system, we exploited the state-of-the-art recurrent neural network (RNN) and attention models. We report the performance by macro-averaged precision, recall, and F1-score across the relation types. Results Our results show that the SVM model achieved the best average F1-score of 89.1% on test data, outperforming the long short-term memory (LSTM) model with attention (F1-score of 65.72%) as well as the rule induction baseline system (F1-score of 7.47%) by a large margin. The bidirectional LSTM model with attention achieved the best performance among different RNN models. With the inclusion of additional features in the LSTM model, its performance can be boosted to an average F1-score of 77.35%. Conclusions It shows that classical learning models (SVM) remains advantageous over deep learning models (RNN variants) for clinical relation identification, especially for long-distance intersentential relations. However, RNNs demonstrate a great potential of significant improvement if more training data become available. Our work is an important step toward mining EHRs to improve the efficacy of drug safety surveillance. Most importantly, the annotated data used in this study will be made publicly available, which will further promote drug safety research in the community.

Hong Yu | Tsendsuren Munkhdalai | Feifan Liu | Tsendsuren Munkhdalai | Hong Yu | Feifan Liu

[1] Slav Petrov,et al. Globally Normalized Transition-Based Neural Networks , 2016, ACL.

[2] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3] D. Bates,et al. Incidence and preventability of adverse drug events among older persons in the ambulatory setting. , 2003, JAMA.

[4] Rong Xu,et al. Comparing a knowledge-driven approach to a supervised machine learning approach in large-scale extraction of drug-side effect relationships from free-text biomedical literature , 2015, BMC Bioinformatics.

[5] Zhiyuan Liu,et al. Relation Classification via Multi-Level Attention CNNs , 2016, ACL.

[6] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[7] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[8] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.

[9] Zhi Jin,et al. Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths , 2015, EMNLP.

[10] Shuying Shen,et al. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[11] Zina M. Ibrahim,et al. Identification of Adverse Drug Events from Free Text Electronic Patient Records and Information in a Large Mental Health Case Register , 2015, PloS one.

[12] Juan M Banda,et al. A curated and standardized adverse drug event resource to accelerate drug safety research , 2016, Scientific Data.

[13] S D Small,et al. Incidence of adverse drug events and potential adverse drug events. Implications for prevention. ADE Prevention Study Group. , 1995, JAMA.

[14] Makoto Miwa,et al. End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures , 2016, ACL.

[15] Keun Ho Ryu,et al. Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations , 2015, Journal of Cheminformatics.

[16] David W Bates,et al. Participation in an ambulatory e‐pharmacovigilance system , 2010, Pharmacoepidemiology and drug safety.

[17] Dong Wang,et al. Relation Classification: CNN or RNN? , 2016, NLPCC/ICCPOL.

[18] Hong Yu,et al. Automatically Detecting Acute Myocardial Infarction Events from EHR Text: A Preliminary Study , 2014, AMIA.

[19] N. Shah,et al. Pharmacovigilance Using Clinical Notes , 2013, Clinical pharmacology and therapeutics.

[20] Richard Platt,et al. Early adverse drug event signal detection within population‐based health networks using sequential methods: key methodologic considerations , 2009, Pharmacoepidemiology and drug safety.

[21] Jian Yang,et al. Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[22] Charlene R. Weir,et al. Critical Gaps in the World's Largest Electronic Medical Record: Ad Hoc Nursing Narratives and Invisible Adverse Drug Events , 2003, AMIA.

[23] Hong Yu,et al. Bidirectional RNN for Medical Event Detection in Electronic Health Records , 2016, NAACL.

[24] Subashan Perera,et al. Research Paper: A Systematic Review of the Performance Characteristics of Clinical Event Monitor Signals Used to Detect Adverse Drug Events in the Hospital Setting , 2007, J. Am. Medical Informatics Assoc..

[25] N. Laird,et al. Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[26] Kazuhiko Ohe,et al. Extraction of Adverse Drug Effects from Clinical Records , 2010, MedInfo.

[27] S. Edlavitch,et al. Adverse drug event reporting. Improving the low US reporting rates. , 1988, Archives of internal medicine.

[28] A. Burgun,et al. Adverse Drug Reaction Identification and Extraction in Social Media: A Scoping Review , 2015, Journal of medical Internet research.

[29] Phil Blunsom,et al. Reasoning about Entailment with Neural Attention , 2015, ICLR.

[30] Sunil Kumar Sahu,et al. Relation extraction from clinical texts using domain invariant convolutional neural network , 2016, BioNLP@ACL.

[31] D E Knapp,et al. Discovery of adverse drug reactions. A comparison of selected phase IV studies with spontaneous reporting methods. , 1983, JAMA.

[32] Geoffrey E. Hinton,et al. Grammar as a Foreign Language , 2014, NIPS.

[33] Nanyun Peng,et al. Cross-Sentence N-ary Relation Extraction with Graph LSTMs , 2017, TACL.

[34] John F Hurdle,et al. High rates of adverse drug events in a highly computerized hospital. , 2005, Archives of internal medicine.

[35] Özlem Uzuner,et al. Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[36] Robin E Ferner,et al. Internet accounts of serious adverse drug reactions: a study of experiences of Stevens-Johnson syndrome and toxic epidermal necrolysis. , 2012, Drug safety.

[37] Brett R South,et al. Looking for a needle in the haystack? A case for detecting adverse drug events (ADE) in clinical notes. , 2007, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[38] Maria Kvist,et al. Identifying adverse drug event information in clinical notes with distributional semantic representations of context , 2015, J. Biomed. Informatics.

[39] S D Small,et al. The costs of adverse drug events in hospitalized patients. Adverse Drug Events Prevention Study Group. , 1998, JAMA.

[40] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[41] Michel Dumontier,et al. Discovering associations between adverse drug events using pattern structures and ontologies , 2017, Journal of Biomedical Semantics.

[42] Anita Burgun,et al. Filtering Entities to Optimize Identification of Adverse Drug Reaction From Social Media: How Can the Number of Words Between Entities in the Messages Help? , 2017, JMIR public health and surveillance.

[43] D. Bates,et al. The Costs of Adverse Drug Events in Hospitalized Patients , 1997 .

[44] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[45] David W. Bates,et al. Research Paper: Using Computerized Data to Identify Adverse Drug Events in Outpatients , 2001, J. Am. Medical Informatics Assoc..

[46] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[47] Geoffrey I. Webb,et al. Supervised Descriptive Rule Induction , 2010, Encyclopedia of Machine Learning.

[48] Barbara Evans,et al. A policy framework for public health uses of electronic health data , 2012, Pharmacoepidemiology and drug safety.

[49] D. Classen,et al. Adverse drug events in hospitalized patients. Excess length of stay, extra costs, and attributable mortality. , 1997, JAMA.

[50] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[51] Tobias Glasmachers,et al. Limits of End-to-End Learning , 2017, ACML.

[52] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[53] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[54] Rainu Kaushal,et al. Technology Evaluation: Return on Investment for a Computerized Physician Order Entry System , 2006, J. Am. Medical Informatics Assoc..

[55] Hong Yu,et al. Automatically Recognizing Medication and Adverse Event Information From Food and Drug Administration’s Adverse Event Reporting System Narratives , 2014, JMIR medical informatics.

[56] Koldo Gojenola,et al. Learning to extract adverse drug reaction events from electronic health records in Spanish , 2016, Expert Syst. Appl..

[57] Guan Wang,et al. A method for systematic discovery of adverse drug events from clinical notes , 2015, J. Am. Medical Informatics Assoc..

[58] P. Corey,et al. Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[59] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[60] Luca Toldo,et al. Extraction of potential adverse drug events from medical case reports , 2012, Journal of biomedical semantics.

[61] R S Evans,et al. Description of a computerized adverse drug event monitor using a hospital information system. , 1992, Hospital pharmacy.

[62] B Begaud,et al. False-positives in spontaneous reporting: should we worry about them? , 1994, British journal of clinical pharmacology.

[63] Jinfeng Yang,et al. Clinical Relation Extraction with Deep Learning , 2016 .

[64] J J Rybacki,et al. Adverse drug events in hospitalized patients. , 1997, JAMA.

[65] Shyam Visweswaran,et al. Detecting Adverse Drug Events in Discharge Summaries Using Variations on the Simple Bayes Model , 2003, AMIA.

[66] D. Levine,et al. Physician knowledge, attitudes, and behavior related to reporting adverse drug events. , 1988, Archives of internal medicine.

[67] Hua Xu,et al. Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records , 2013, J. Am. Medical Informatics Assoc..

[68] Erik M. van Mulligen,et al. Knowledge-based extraction of adverse drug events from biomedical text , 2014, BMC Bioinformatics.

[69] Fei Li,et al. A neural joint model for entity and relation extraction from biomedical text , 2017, BMC Bioinformatics.

[70] Sepp Hochreiter,et al. The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[71] Dong Wang,et al. Relation Classification via Recurrent Neural Network , 2015, ArXiv.