Evaluation of Linked, Open Data Sources for Mining Adverse Drug Reaction Signals

Linked Data is an emerging paradigm of publishing data in the Internet, accompanied with semantic annotations in a machine understandable fashion. The Internet provides vast data, useful in identifying Public Health trends, e.g. concerning the use of drugs, or the spread of diseases. Current practice of exploiting such data includes their combination from different sources, in order to reinforce their exploitation potential, based on unstructured data management practices and the Linked Data paradigm. In this paper, we present the design, the challenges and an evaluation of a Linked Data model to be used in the context of a platform exploiting social media and bibliographic data sources (namely, Twitter and PubMed), focusing on the application of Adverse Drug Reaction (ADR) signal identification. More specifically, we present the challenges of exploiting Bio2RDF as a Linked Open Data source in this respect, focusing on collecting, updating and normalizing data with the ultimate goal of identifying ADR signals, and evaluate the presented model against three reference evaluation datasets.

[1]  Yiannis Kompatsiaris,et al.  GalenOWL: Ontology-based drug recommendations discovery , 2012, J. Biomed. Semant..

[2]  Oktie Hassanzadeh,et al.  Extending the “Web of Drug Identity” with Knowledge Extracted from United States Product Labels , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[3]  Hyeon-Eui Kim,et al.  Deep mining heterogeneous networks of biomedical linked data to predict novel drug‐target associations , 2017, Bioinform..

[4]  Vassilis Koutkias,et al.  A Public Health Surveillance Platform Exploiting Free-Text Sources via Natural Language Processing and Linked Data: Application in Adverse Drug Reaction Signal Detection Using PubMed and Twitter , 2016, KR4HC/ProHealth@HEC.

[5]  R. Altman,et al.  Pharmacogenomics Knowledge for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.

[6]  Carole A. Goble,et al.  API-centric Linked Data integration: The Open PHACTS Discovery Platform case study , 2014, J. Web Semant..

[7]  M. Schuemie,et al.  Defining a Reference Set to Support Methodological Research in Drug Safety , 2013, Drug Safety.

[8]  Christophe G. Lambert,et al.  Bridging Islands of Information to Establish an Integrated Knowledge Base of Drugs and Health Outcomes of Interest , 2014, Drug Safety.

[9]  Damian Szklarczyk,et al.  STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data , 2015, Nucleic Acids Res..

[10]  Yen S. Low,et al.  Text Mining for Adverse Drug Events: the Promise, Challenges, and State of the Art , 2014, Drug Safety.

[11]  Milos Jovanovik,et al.  Consolidating drug data on a global scale using Linked Data , 2017, J. Biomed. Semant..

[12]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[13]  Michel Dumontier,et al.  Bio2RDF Release 2: Improved Coverage, Interoperability and Provenance of Life Science Linked Data , 2013, ESWC.

[14]  Paloma Martínez,et al.  DINTO: Using OWL Ontologies and SWRL Rules to Infer Drug-Drug Interactions and Their Mechanisms , 2015, J. Chem. Inf. Model..

[15]  Vassilis Koutkias,et al.  Large-scale adverse effects related to treatment evidence standardization (LAERTES): an open scalable system for linking pharmacovigilance evidence sources with clinical data , 2017, J. Biomed. Semant..

[16]  Olivier Bodenreider,et al.  A time-indexed reference standard of adverse drug reactions , 2014, Scientific Data.

[17]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[18]  Z. Bankowski,et al.  Council for International Organizations of Medical Sciences , 1991 .

[19]  Bin Chen,et al.  Assessing Drug Target Association Using Semantic Linked Data , 2012, PLoS Comput. Biol..

[20]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[21]  Cui Tao,et al.  Exploring the Pharmacogenomics Knowledge Base (PharmGKB) for Repositioning Breast Cancer Drugs by Leveraging Web Ontology Language (OWL) and Cheminformatics Approaches , 2013, Pacific Symposium on Biocomputing.

[22]  Takahiro Kawamura,et al.  IMAGE-BASED LITERAL NODE MATCHING FOR LINKED DATA INTEGRATION , 2014 .

[23]  Martijn J. Schuemie,et al.  A Reference Standard for Evaluation of Methods for Drug Safety Signal Detection Using Electronic Healthcare Record Databases , 2012, Drug Safety.

[24]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[25]  Peter J. Denning The locality principle , 2005, Commun. ACM.

[26]  J. Bajorath,et al.  Learning from 'big data': compounds and targets. , 2014, Drug discovery today.

[27]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[28]  Asunción Gómez-Pérez,et al.  The NeOn Methodology for Ontology Engineering , 2012, Ontology Engineering in a Networked World.

[29]  Yiannis Kompatsiaris,et al.  Panacea, a semantic-enabled drug recommendations discovery framework , 2014, VDOS+DO@ICBO.

[30]  Chris T. A. Evelo,et al.  Applying linked data approaches to pharmacology: Architectural decisions and implementation , 2014, Semantic Web.

[31]  Adrien Coulet,et al.  Learning from biomedical linked data to suggest valid pharmacogenes , 2017, Journal of Biomedical Semantics.

[32]  Benjamin M. Good,et al.  WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata , 2017, bioRxiv.

[33]  José Leomar Todesco,et al.  Knowledge Engineering: Survey of Methodologies, Techniques and Tools , 2014 .

[34]  Vít Novácek,et al.  Using Drug Similarities for Discovery of Possible Adverse Reactions , 2016, AMIA.

[35]  Sirarat Sarntivijai,et al.  Use of Biomedical Ontologies for Integration of Biological Knowledge for Learning and Prediction of Adverse Drug Reactions , 2017, Gene regulation and systems biology.

[36]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[37]  R. Doyle The American terrorist. , 2001, Scientific American.

[38]  Janet Sultana,et al.  Clinical and economic burden of adverse drug reactions , 2013, Journal of pharmacology & pharmacotherapeutics.

[39]  Egon L. Willighagen,et al.  Linked open drug data for pharmaceutical research and development , 2011, J. Cheminformatics.

[40]  Martin Necaský,et al.  Drug Encyclopedia - Linked Data Application for Physicians , 2015, International Semantic Web Conference.

[41]  Mark A. Musen,et al.  BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF , 2013, Semantic Web.

[42]  Benjamin M. Good,et al.  Wikidata: A platform for data integration and dissemination for the life sciences and beyond , 2015, bioRxiv.