Identification of Adverse Drug Event–Related Japanese Articles: Natural Language Processing Analysis

Background Medical articles covering adverse drug events (ADEs) are systematically reported by pharmaceutical companies for drug safety information purposes. Although policies governing reporting to regulatory bodies vary among countries and regions, all medical article reporting may be categorized as precision or recall based. Recall-based reporting, which is implemented in Japan, requires the reporting of any possible ADE. Therefore, recall-based reporting can introduce numerous false negatives or substantial amounts of noise, a problem that is difficult to address using limited manual labor. Objective Our aim was to develop an automated system that could identify ADE-related medical articles, support recall-based reporting, and alleviate manual labor in Japanese pharmaceutical companies. Methods Using medical articles as input, our system based on natural language processing applies document-level classification to extract articles containing ADEs (replacing manual labor in the first screening) and sentence-level classification to extract sentences within those articles that imply ADEs (thus supporting experts in the second screening). We used 509 Japanese medical articles annotated by a medical engineer to evaluate the performance of the proposed system. Results Document-level classification yielded an F1 of 0.903. Sentence-level classification yielded an F1 of 0.413. These were averages of fivefold cross-validations. Conclusions A simple automated system may alleviate the manual labor involved in screening drug safety–related medical articles in pharmaceutical companies. After improving the accuracy of the sentence-level classification by considering a wider context, we intend to apply this system toward real-world postmarketing surveillance.

[1]  Eiji Aramaki,et al.  J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage , 2018, LREC.

[2]  Eric R. LaRose,et al.  Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure , 2017, JMIR medical informatics.

[3]  Juliane Fluck,et al.  Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports , 2012, J. Biomed. Informatics.

[4]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[5]  Yonatan Belinkov,et al.  Analysis Methods in Neural Language Processing: A Survey , 2018, TACL.

[6]  A. Hartzema,et al.  Adverse Drug Events: Identification and Attribution , 1987, Drug intelligence & clinical pharmacy.

[7]  Chirag Jain,et al.  A novel method for drug-adverse event extraction using machine learning , 2019, Informatics in Medicine Unlocked.

[8]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[9]  Taku Kudo,et al.  MeCab : Yet Another Part-of-Speech and Morphological Analyzer , 2005 .

[10]  Michele Filannino,et al.  2018 N2c2 Shared Task on Adverse Drug Events and Medication Extraction in Electronic Health Records , 2020, J. Am. Medical Informatics Assoc..

[11]  Kazuhiko Ohe,et al.  Orthographic Disambiguation Incorporating Transliterated Probability , 2008, IJCNLP.

[12]  Shoko Wakamiya,et al.  Extraction and Standardization of Patient Complaints from Electronic Medication Histories for Pharmacovigilance: Natural Language Processing Analysis in Japanese , 2018, JMIR medical informatics.

[13]  Investigational new drug safety reporting requirements for human drug and biological products and safety reporting requirements for bioavailability and bioequivalence studies in humans. Final rule. , 2010, Federal register.

[14]  Erik M. van Mulligen,et al.  Knowledge-based extraction of adverse drug events from biomedical text , 2014, BMC Bioinformatics.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  B. S. Nilsson,et al.  Pharmacovigilance in the pharmaceutical industry. , 1998, British journal of clinical pharmacology.

[17]  M. Pirmohamed,et al.  Which drugs cause preventable admissions to hospital? A systematic review. , 2007, British journal of clinical pharmacology.

[18]  C. Dolea,et al.  World Health Organization , 1949, International Organization.

[19]  Jing Zhao,et al.  Detecting adverse drug events with multiple representations of clinical measurements , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[20]  Luca Toldo,et al.  Extraction of potential adverse drug events from medical case reports , 2012, Journal of biomedical semantics.

[21]  Fei Li,et al.  Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning , 2018, JMIR medical informatics.

[22]  Henrik Boström,et al.  Predictive modeling of structured electronic health records for adverse drug event detection , 2015, BMC Medical Informatics and Decision Making.

[23]  Maria Kvist,et al.  Identifying adverse drug event information in clinical notes with distributional semantic representations of context , 2015, J. Biomed. Informatics.

[24]  Cédric Bousquet,et al.  Mining Patients' Narratives in Social Media for Pharmacovigilance: Adverse Effects and Misuse of Methylphenidate , 2018, Front. Pharmacol..

[25]  Eiji Aramaki,et al.  MedEx/J: A One-Scan Simple and Fast NLP Tool for Japanese Clinical Texts , 2017, MedInfo.