A pipeline to extract drug-adverse event pairs from multiple data sources

BackgroundPharmacovigilance aims to uncover and understand harmful side-effects of drugs, termed adverse events (AEs). Although the current process of pharmacovigilance is very systematic, the increasing amount of information available in specialized health-related websites as well as the exponential growth in medical literature presents a unique opportunity to supplement traditional adverse event gathering mechanisms with new-age ones.MethodWe present a semi-automated pipeline to extract associations between drugs and side effects from traditional structured adverse event databases, enhanced by potential drug-adverse event pairs mined from user-comments from health-related websites and MEDLINE abstracts. The pipeline was tested using a set of 12 drugs representative of two previous studies of adverse event extraction from health-related websites and MEDLINE abstracts.ResultsTesting the pipeline shows that mining non-traditional sources helps substantiate the adverse event databases. The non-traditional sources not only contain the known AEs, but also suggest some unreported AEs for drugs which can then be analyzed further.ConclusionA semi-automated pipeline to extract the AE pairs from adverse event databases as well as potential AE pairs from non-traditional sources such as text from MEDLINE abstracts and user-comments from health-related websites is presented.

[1]  S. Evans,et al.  Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports , 2001, Pharmacoepidemiology and drug safety.

[2]  Isao Yoshimura,et al.  Criteria Revision and Performance Comparison of Three Methods of Signal Detection Applied to the Spontaneous Reporting Database of a Pharmaceutical Manufacturer , 2007, Drug safety.

[3]  Thomas C. Rindflesch,et al.  MedPost: a part-of-speech tagger for bioMedical text , 2004, Bioinform..

[4]  D. Madigan,et al.  Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership , 2012, Statistics in medicine.

[5]  Wolfgang Nejdl,et al.  How valuable is medical social media data? Content analysis of the medical web , 2009, Inf. Sci..

[6]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[7]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[8]  Roy T. Fielding,et al.  Principled design of the modern Web architecture , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[9]  Halil Kilicoglu,et al.  Abstraction Summarization for Managing the Biomedical Research Literature , 2004, HLT-NAACL 2004.

[10]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[11]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[12]  Juliane Fluck,et al.  Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports , 2012, J. Biomed. Informatics.

[13]  Carol Friedman,et al.  Statistical Mining of Potential Drug Interaction Adverse Effects in FDA's Spontaneous Reporting System. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[14]  Marsha A Raebel,et al.  Design considerations, architecture, and use of the Mini‐Sentinel distributed data system , 2012, Pharmacoepidemiology and drug safety.

[15]  J J Cimino,et al.  Representation of clinical laboratory terminology in the Unified Medical Language System. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[16]  Laura Inés Furlong,et al.  The EU-ADR corpus: Annotated drugs, diseases, targets, and their relationships , 2012, J. Biomed. Informatics.

[17]  R. Rabadán,et al.  Discovering Disease Associations by Integrating Electronic Clinical Data and Medical Literature , 2011, PloS one.

[18]  M. Schuemie,et al.  Combining electronic healthcare databases in Europe to allow for large‐scale drug safety monitoring: the EU‐ADR Project , 2011, Pharmacoepidemiology and drug safety.

[19]  Naveen Sivadasan,et al.  TPX: Biomedical literature search made easy , 2012, Bioinformation.

[20]  A Bate,et al.  From association to alert—a revised approach to international signal analysis , 1999, Pharmacoepidemiology and drug safety.

[21]  Andrew Bate,et al.  Bayesian Confidence Propagation Neural Network , 2007, Drug safety.

[22]  A. Bate,et al.  A Bayesian neural network method for adverse drug reaction signal generation , 1998, European Journal of Clinical Pharmacology.

[23]  S A Ackroyd-Stolarz,et al.  Adverse events related to medications identified by a Canadian poison centre. , 2011, Journal of population therapeutics and clinical pharmacology = Journal de la therapeutique des populations et de la pharamcologie clinique.

[24]  L. Whitehead Methodological and ethical issues in Internet-mediated research in the field of health: an integrated review of the literature. , 2007, Social science & medicine.

[25]  Luca Toldo,et al.  Extraction of potential adverse drug events from medical case reports , 2012, Journal of biomedical semantics.

[26]  George Hripcsak,et al.  Integrating heterogeneous knowledge sources to acquire executable drug-related knowledge. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[27]  A. Pariente,et al.  Data mining on electronic health record databases for signal detection in pharmacovigilance: which events to monitor? , 2009, Pharmacoepidemiology and drug safety.

[28]  R. Meyboom,et al.  Signalling possible drug-drug interactions in a spontaneous reporting system: delay of withdrawal bleeding during concomitant use of oral contraceptives and itraconazole. , 1999, British journal of clinical pharmacology.

[29]  Cassandra Ford,et al.  Using Data From the Internet to Teach Ethical Principles for Critiquing Research Studies , 2010, Nurse educator.

[30]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[31]  Joseph M. Tonning,et al.  Pharmacovigilance in the 21st Century: New Systematic Tools for an Old Problem , 2004, Pharmacotherapy.

[32]  G. Eysenbach,et al.  Ethical issues in qualitative research on internet communities , 2001, BMJ : British Medical Journal.

[33]  C. Friedman,et al.  A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from PubMed citations. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[34]  Syed Ghulam Sarwar Shah,et al.  Patients' perspectives on self-testing of oral anticoagulation therapy: Content analysis of patients' internet blogs , 2011, BMC health services research.