Role of text mining in early identification of potential drug safety issues.

Drugs are an important part of today's medicine, designed to treat, control, and prevent diseases; however, besides their therapeutic effects, drugs may also cause adverse effects that range from cosmetic to severe morbidity and mortality. To identify these potential drug safety issues early, surveillance must be conducted for each drug throughout its life cycle, from drug development to different phases of clinical trials, and continued after market approval. A major aim of pharmacovigilance is to identify the potential drug-event associations that may be novel in nature, severity, and/or frequency. Currently, the state-of-the-art approach for signal detection is through automated procedures by analyzing vast quantities of data for clinical knowledge. There exists a variety of resources for the task, and many of them are textual data that require text analytics and natural language processing to derive high-quality information. This chapter focuses on the utilization of text mining techniques in identifying potential safety issues of drugs from textual sources such as biomedical literature, consumer posts in social media, and narrative electronic medical records.

[1]  George Hripcsak,et al.  Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries , 1999, AMIA.

[2]  Christopher G. Chute,et al.  Word sense disambiguation across two domains: Biomedical literature and clinical notes , 2008, J. Biomed. Informatics.

[3]  G Hripcsak,et al.  Biclustering of Adverse Drug Events in the FDA's Spontaneous Reporting System , 2011, Clinical pharmacology and therapeutics.

[4]  Daniel Hanisch,et al.  ProMiner: rule-based protein and gene entity recognition , 2005, BMC Bioinformatics.

[5]  Hua Xu,et al.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries , 2011, J. Am. Medical Informatics Assoc..

[6]  Anderson Spickard,et al.  Research Paper: "Understanding" Medical School Curriculum Content Using KnowledgeMap , 2003, J. Am. Medical Informatics Assoc..

[7]  Alberto Lavelli,et al.  Disease Mention Recognition with Specific Features , 2010, BioNLP@ACL.

[8]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[9]  Carol Friedman,et al.  Towards a comprehensive medical language processing system: methods and issues , 1997, AMIA.

[10]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[11]  Pernille Warrer,et al.  Using text-mining techniques in electronic patient records to identify ADRs from medicine use. , 2012, British Journal of Clinical Pharmacology.

[12]  Juliane Fluck,et al.  Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports , 2012, J. Biomed. Informatics.

[13]  Carol Friedman,et al.  Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions , 2013, J. Am. Medical Informatics Assoc..

[14]  Bridget T. McInnes,et al.  Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation , 2011, BMC Bioinformatics.

[15]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[16]  R. O’Neill,et al.  Use of Screening Algorithms and Computer Systems to Efficiently Signal Higher-Than-Expected Combinations of Drugs and Events in the US FDA’s Spontaneous Reports Database , 2002, Drug safety.

[17]  Roger Hale Text mining: getting more value from literature resources. , 2005, Drug discovery today.

[18]  Carol Friedman,et al.  Mining multi-item drug adverse effect associations in spontaneous reporting systems , 2010, BMC Bioinformatics.

[19]  Christian Lovis,et al.  Automatic medical encoding with SNOMED categories , 2008, BMC Medical Informatics Decis. Mak..

[20]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[21]  Carol Friedman,et al.  Drug-drug interaction through molecular structure similarity analysis , 2012, J. Am. Medical Informatics Assoc..

[22]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[23]  A. Butte,et al.  Predicting Adverse Drug Reactions Using Publicly Available PubChem BioAssay Data , 2011, Clinical pharmacology and therapeutics.

[24]  Son Doan,et al.  Application of information technology: MedEx: a medication information extraction system for clinical narratives , 2010, J. Am. Medical Informatics Assoc..

[25]  E. Björnsson,et al.  Suspected drug-induced liver fatalities reported to the WHO database. , 2006, Digestive and liver disease : official journal of the Italian Society of Gastroenterology and the Italian Association for the Study of the Liver.

[26]  Shuying Shen,et al.  Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents , 2010, J. Am. Medical Informatics Assoc..

[27]  Hongfei Lin,et al.  Extracting Drug-Drug Interaction from the Biomedical Literature Using a Stacked Generalization-Based Approach , 2013, PloS one.

[28]  Lang Li,et al.  Evaluation of Linear Classifiers on Articles Containing Pharmacokinetic Evidence of Drug-Drug Interactions , 2012, Pacific Symposium on Biocomputing.

[29]  Richard D. Boyce,et al.  Using natural language processing to identify pharmacokinetic drug-drug interactions described in drug package inserts , 2012 .

[30]  Nigam H. Shah,et al.  Practice-Based Evidence: Profiling the Safety of Cilostazol by Text-Mining of Clinical Notes , 2013, PloS one.

[31]  George Hripcsak,et al.  Extracting Structured Medication Event Information from Discharge Summaries , 2008, AMIA.

[32]  Xiaowei Xu,et al.  Mining FDA drug labels using an unsupervised learning technique - topic modeling , 2011, BMC Bioinformatics.

[33]  Sophia Ananiadou,et al.  Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[34]  A. Bender,et al.  Analysis of Pharmacology Data and the Prediction of Adverse Drug Reactions and Off‐Target Effects from Chemical Structure , 2007, ChemMedChem.

[35]  Cédrick Fairon,et al.  Annotation analysis for testing drug safety signals using unstructured clinical notes , 2012, J. Biomed. Semant..

[36]  Tapio Salakoski,et al.  Drug-Drug Interaction Extraction from Biomedical Texts with SVM and RLS Classifiers , 2011 .

[37]  Jonathan M. Teich,et al.  Identifying hospital admissions due to adverse drug events using a computer‐based monitor , 2001, AMIA.

[38]  Joshua C. Denny,et al.  Identifying QT prolongation from ECG impressions using Natural Language Processing and Negation Detection , 2007, MedInfo.

[39]  Michael Krauthammer,et al.  Term identification in the biomedical literature , 2004, J. Biomed. Informatics.

[40]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors , 2009, PLoS Comput. Biol..

[41]  P. Zed,et al.  Drug‐Related Visits to the Emergency Department: How Big Is the Problem? , 2002, Pharmacotherapy.

[42]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[43]  Hua Xu,et al.  Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records , 2013, J. Am. Medical Informatics Assoc..

[44]  Yijia Zhang,et al.  A Single Kernel-Based Approach to Extract Drug-Drug Interactions from Biomedical Literature , 2012, PloS one.

[45]  William D. Figg,et al.  Drug interactions in cancer therapy , 2006, Nature Reviews Cancer.

[46]  D Yoon,et al.  Detection of Adverse Drug Reaction Signals Using an Electronic Health Records Database: Comparison of the Laboratory Extreme Abnormality Ratio (CLEAR) Algorithm , 2012, Clinical pharmacology and therapeutics.

[47]  L. Engelen,et al.  Definition of Health 2.0 and Medicine 2.0: A Systematic Review , 2010, Journal of medical Internet research.

[48]  Xu Han,et al.  Literature Based Drug Interaction Prediction with Clinical Assessment Using Electronic Medical Records: Novel Myopathy Associated Drug Interactions , 2012, PLoS Comput. Biol..

[49]  Ulf Leser,et al.  Relation Extraction for Drug-Drug Interactions using Ensemble Learning , 2011 .

[50]  Richard B. Berlin,et al.  Predicting adverse drug events from personal health messages. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[51]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[52]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[53]  Randolph A. Miller,et al.  Identifying UMLS concepts from ECG Impressions using Knowledge Map , 2005, AMIA.

[54]  César de Pablo-Sánchez,et al.  Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents , 2010, BMC Bioinformatics.

[55]  Carol Friedman,et al.  Enhancing Adverse Drug Event Detection in Electronic Health Records Using Molecular Structure Similarity: Application to Pancreatitis , 2012, PloS one.

[56]  M. Kulldorff,et al.  Early detection of adverse drug events within population‐based health networks: application of sequential testing methods , 2007, Pharmacoepidemiology and drug safety.

[57]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[58]  Chao Yang,et al.  Automatic Adverse Drug Events Detection Using Letters to the Editor , 2012, AMIA.

[59]  Hisashi Kashima,et al.  Side Effect Prediction Using Cooperative Pathways , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine.

[60]  P. Neuvonen,et al.  Drug-related deaths in a university central hospital , 2002, European Journal of Clinical Pharmacology.

[61]  P. Bork,et al.  Network Neighbors of Drug Targets Contribute to Drug Side-Effect Similarity , 2011, PloS one.

[62]  Graham Wilcock,et al.  Unstructured Information Management Architecture (UIMA) , 2009 .

[63]  Joshua C. Denny,et al.  The KnowledgeMap Project: Development of a Concept-Based Medical School Curriculum Database , 2003, AMIA.

[64]  Russ B. Altman,et al.  Discovery and Explanation of Drug-Drug Interactions via Text Mining , 2011, Pacific Symposium on Biocomputing.

[65]  David Madigan,et al.  Disproportionality methods for pharmacovigilance in longitudinal observational databases , 2013, Statistical methods in medical research.

[66]  A. Fliri,et al.  Analysis of drug-induced effect patterns to link structure and side effects of medicines , 2005, Nature chemical biology.

[67]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[68]  J. Austin,et al.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. , 2002, Radiology.

[69]  Allen C. Browne,et al.  dTagger: A POS Tagger , 2006, AMIA.

[70]  C. Friedman,et al.  A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from PubMed citations. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[71]  Graciela Gonzalez,et al.  BANNER: An Executable Survey of Advances in Biomedical Named Entity Recognition , 2007, Pacific Symposium on Biocomputing.

[72]  Yanqing Ji,et al.  A Potential Causal Association Mining Algorithm for Screening Adverse Drug Reactions in Postmarketing Surveillance , 2011, IEEE Transactions on Information Technology in Biomedicine.

[73]  Chitta Baral,et al.  Discovering drug–drug interactions: a text-mining and reasoning approach based on properties of drug metabolism , 2010, Bioinform..

[74]  Hua Xu,et al.  Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs , 2012, J. Am. Medical Informatics Assoc..

[75]  Lyle H. Ungar,et al.  Identifying potential adverse effects using the web: A new approach to medical hypothesis generation , 2011, J. Biomed. Informatics.

[76]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[77]  Bin Chen,et al.  Gaining Insight into Off-Target Mediated Effects of Drug Candidates with a Comprehensive Systems Chemical Biology Analysis , 2009, J. Chem. Inf. Model..

[78]  Siddhartha R. Dalal,et al.  Using information mining of the medical literature to improve drug safety , 2011, J. Am. Medical Informatics Assoc..

[79]  Stephanie Chung,et al.  Postmarketing surveillance of potentially fatal reactions to oncology drugs: potential utility of two signal-detection algorithms , 2004, European Journal of Clinical Pharmacology.

[80]  Thomas C. Rindflesch,et al.  MedPost: a part-of-speech tagger for bioMedical text , 2004, Bioinform..

[81]  Isabel Segura-Bedmar,et al.  The 1st DDIExtraction-2011 challenge task: Extraction of Drug-Drug Interactions from biomedical texts , 2011 .

[82]  C. Friedman,et al.  Detection of Pharmacovigilance‐Related Adverse Events Using Electronic Health Records and Automated Methods , 2012, Clinical pharmacology and therapeutics.

[83]  Manfred Hauben,et al.  Reports of hyperkalemia after publication of RALES—a pharmacovigilance study , 2006, Pharmacoepidemiology and drug safety.

[84]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[85]  T. J. Moore,et al.  Serious adverse drug events reported to the Food and Drug Administration, 1998-2005. , 2007, Archives of internal medicine.

[86]  Xiaoyan Wang,et al.  Selecting information in electronic health records for knowledge acquisition , 2010, J. Biomed. Informatics.

[87]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[88]  Christian von Mering,et al.  STITCH: interaction networks of chemicals and proteins , 2007, Nucleic Acids Res..

[89]  Munir Pirmohamed,et al.  Fortnightly review: Adverse drug reactions , 1998 .

[90]  M Lindquist,et al.  The WHO Programme for International Drug Monitoring, its database, and the technical support of the Uppsala Monitoring Center. , 2001, The Journal of rheumatology.

[91]  C Helma,et al.  Prediction of Adverse Drug Reactions Using Decision Tree Modeling , 2010, Clinical pharmacology and therapeutics.

[92]  Christopher C. Yang,et al.  Social media mining for drug safety signal detection , 2012, SHB '12.

[93]  Nigam H. Shah,et al.  Using Temporal Patterns in Medical Records to Discern Adverse Drug Events from Indications , 2012, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[94]  Eneko Agirre,et al.  Exploiting domain information for Word Sense Disambiguation of medical documents , 2011, J. Am. Medical Informatics Assoc..

[95]  Luca Toldo,et al.  Extraction of potential adverse drug events from medical case reports , 2012, Journal of biomedical semantics.

[96]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[97]  A Valencia,et al.  An Overview of BioCreative II.5 , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[98]  Donald R. Miller,et al.  Differential associations of beta‐blockers with hemorrhagic events for chronic heart failure patients on warfarin , 2006, Pharmacoepidemiology and drug safety.

[99]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[100]  Randolph A. Miller,et al.  Identifying QT prolongation from ECG impressions using a general-purpose Natural Language Processor , 2009, Int. J. Medical Informatics.

[101]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[102]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[103]  Jie Chen,et al.  Mining Unexpected Temporal Associations: Applications in Detecting Adverse Drug Reactions , 2008, IEEE Transactions on Information Technology in Biomedicine.

[104]  César de Pablo-Sánchez,et al.  Using a shallow linguistic kernel for drug-drug interaction extraction , 2011, J. Biomed. Informatics.

[105]  ChengXiang Zhai,et al.  An empirical study of tokenization strategies for biomedical information retrieval , 2007, Information Retrieval.

[106]  Maria Kvist,et al.  Exploration of Adverse Drug Reactions in Semantic Vector Space Models of Clinical Text , 2012, ICML 2012.

[107]  Paloma Martínez,et al.  A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents , 2011, BMC Bioinformatics.

[108]  N. Shah,et al.  Pharmacovigilance Using Clinical Notes , 2013, Clinical pharmacology and therapeutics.

[109]  W. DuMouchel,et al.  Unlocking Clinical Data from Narrative Reports: A Study of Natural Language Processing , 1995, Annals of Internal Medicine.

[110]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[111]  Sunghwan Sohn,et al.  Drug side effect extraction from clinical narratives of psychiatry and psychology patients , 2011, J. Am. Medical Informatics Assoc..

[112]  Naomi L Kruhlak,et al.  Identification of structure-activity relationships for adverse effects of pharmaceuticals in humans: Part C: use of QSAR and an expert system for the estimation of the mechanism of action of drug-induced hepatobiliary and urinary tract toxicities. , 2009, Regulatory toxicology and pharmacology : RTP.

[113]  William R. Hersh,et al.  A survey of current work in biomedical text mining , 2005, Briefings Bioinform..

[114]  Azadeh Nikfarjam,et al.  Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[115]  Manfred Hauben,et al.  Signal detection in pharmacovigilance: empirical evaluation of data mining tools , 2005, Pharmacoepidemiology and drug safety.

[116]  Jules J Berman,et al.  Implementation and evaluation of a negation tagger in a pipeline-based system for information extract from pathology reports. , 2004, Studies in health technology and informatics.

[117]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[118]  C Michael Stein,et al.  Postmarketing Surveillance for Drug Safety: Surely We Can Do Better , 2004, Clinical pharmacology and therapeutics.

[119]  Hans Paulussen,et al.  DILEMMA-2: A Lemmatizer-Tagger For Medical Abstracts , 1992, Applied Natural Language Processing Conference.

[120]  Xiaoyan Wang,et al.  Characterizing environmental and phenotypic associations using information theory and electronic health records , 2009, BMC Bioinformatics.

[121]  Abdul Mateen Rajput,et al.  Automatic detection of adverse events to predict drug label changes using text and data mining techniques , 2013, Pharmacoepidemiology and drug safety.