OpenPVSignal: Advancing Information Search, Sharing and Reuse on Pharmacovigilance Signals via FAIR Principles and Semantic Web Technologies

Signal detection and management is a key activity in pharmacovigilance (PV). When a new PV signal is identified, the respective information is publicly communicated in the form of periodic newsletters or reports by organizations that monitor and investigate PV-related information (such as the World Health Organization and national PV centers). However, this type of communication does not allow for systematic access, discovery and explicit data interlinking and, therefore, does not facilitate automated data sharing and reuse. In this paper, we present OpenPVSignal, a novel ontology aiming to support the semantic enrichment and rigorous communication of PV signal information in a systematic way, focusing on two key aspects: (a) publishing signal information according to the FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles, and (b) exploiting automatic reasoning capabilities upon the interlinked PV signal report data. OpenPVSignal is developed as a reusable, extendable and machine-understandable model based on Semantic Web standards/recommendations. In particular, it can be used to model PV signal report data focusing on: (a) heterogeneous data interlinking, (b) semantic and syntactic interoperability, (c) provenance tracking and (d) knowledge expressiveness. OpenPVSignal is built upon widely-accepted semantic models, namely, the provenance ontology (PROV-O), the Micropublications semantic model, the Web Annotation Data Model (WADM), the Ontology of Adverse Events (OAE) and the Time ontology. To this end, we describe the design of OpenPVSignal and demonstrate its applicability as well as the reasoning capabilities enabled by its use. We also provide an evaluation of the model against the FAIR data principles. The applicability of OpenPVSignal is demonstrated by using PV signal information published in: (a) the World Health Organization's Pharmaceuticals Newsletter, (b) the Netherlands Pharmacovigilance Centre Lareb Web site and (c) the U.S. Food and Drug Administration (FDA) Drug Safety Communications, also available on the FDA Web site.

[1]  Patrick B. Ryan,et al.  Accuracy of an automated knowledge base for identifying drug adverse reactions , 2017, J. Biomed. Informatics.

[2]  Christophe G. Lambert,et al.  Bridging Islands of Information to Establish an Integrated Knowledge Base of Drugs and Health Outcomes of Interest , 2014, Drug Safety.

[3]  Oktie Hassanzadeh,et al.  Extending the “Web of Drug Identity” with Knowledge Extracted from United States Product Labels , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[4]  Christopher G. Chute,et al.  An Ontological Representation of Adverse Drug Events , 2011, ICBO.

[5]  Vassilis Koutkias,et al.  Large-scale adverse effects related to treatment evidence standardization (LAERTES): an open scalable system for linking pharmacovigilance evidence sources with clinical data , 2017, Journal of Biomedical Semantics.

[6]  Ian Horrocks,et al.  Handbook of Knowledge Representation Edited Description Logics 3.1 Introduction , 2022 .

[7]  Asunción Gómez-Pérez,et al.  The NeOn Methodology for Ontology Engineering , 2012, Ontology Engineering in a Networked World.

[8]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[9]  Sydney Nsw,et al.  Re: Version 2 of the National Safety and Quality Health Service Standards , 2015 .

[10]  Carole A. Goble,et al.  Micropublications: a semantic model for claims, evidence, arguments and annotations in biomedical communications , 2013, Journal of Biomedical Semantics.

[11]  James A. Hendler,et al.  The Semantic Web: A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities , 2001 .

[12]  Michel Dumontier,et al.  Bio2RDF Release 2: Improved Coverage, Interoperability and Provenance of Life Science Linked Data , 2013, ESWC.

[13]  Valentin Grouès,et al.  BioKB - Text mining and semantic technologies for the biomedical content discovery , 2017, SWAT4LS.

[14]  Carol Ezzell The $13-Billion Man , 2001 .

[15]  Marie-Christine Jaulent,et al.  Computational Approaches for Pharmacovigilance Signal Detection: Toward Integrated and Semantically-Enriched Frameworks , 2015, Drug Safety.

[16]  Christopher G. Chute,et al.  ADEpedia 2.0: Integration of Normalized Adverse Drug Events (ADEs) Knowledge from the UMLS , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[17]  Vassilis Koutkias,et al.  Evaluation of Linked, Open Data Sources for Mining Adverse Drug Reaction Signals , 2017, INSCI.

[18]  J. Bajorath,et al.  Learning from 'big data': compounds and targets. , 2014, Drug discovery today.

[19]  Dexter Hadley,et al.  Systematic integration of biomedical knowledge prioritizes drugs for repurposing , 2017, bioRxiv.

[20]  Cui Tao,et al.  OAE: The Ontology of Adverse Events , 2014, J. Biomed. Semant..

[21]  Alan Ruttenberg,et al.  The Logic of Surveillance Guidelines: An Analysis of Vaccine Adverse Event Reports from an Ontological Perspective , 2014, PloS one.

[22]  Sirarat Sarntivijai,et al.  Use of Biomedical Ontologies for Integration of Biological Knowledge for Learning and Prediction of Adverse Drug Reactions , 2017, Gene regulation and systems biology.

[23]  Robert Stevens,et al.  Post-coordination: Making things up as you go along , 2013 .

[24]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[25]  Sören Auer,et al.  The emerging web of linked data , 2011, ISWSA '11.

[26]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[27]  Jesse Weaver,et al.  Facebook Linked Data via the Graph API , 2013, Semantic Web.

[28]  Marie-Christine Jaulent,et al.  Exploiting heterogeneous publicly available data sources for drug safety surveillance: computational framework and case studies , 2017, Expert opinion on drug safety.

[29]  Mark A. Musen,et al.  The protégé project: a look back and a look forward , 2015, SIGAI.

[30]  S. Dzimira,et al.  Changes in gene expression in the lungs of Mg-deficient mice are related to an inflammatory process. , 2004, Magnesium research.

[31]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[32]  Simon Cox,et al.  Time Ontology in OWL , 2017 .

[33]  Marie-Christine Jaulent,et al.  OntoADR a semantic resource describing adverse drug reactions to support searching, coding, and information retrieval , 2016, J. Biomed. Informatics.

[34]  Alban Gaignard,et al.  From Scientific Workflow Patterns to 5-star Linked Open Data , 2016, TaPP.

[35]  Janet Sultana,et al.  Clinical and economic burden of adverse drug reactions , 2013, Journal of pharmacology & pharmacotherapeutics.

[36]  Egon L. Willighagen,et al.  Linked open drug data for pharmaceutical research and development , 2011, J. Cheminformatics.

[37]  Z. Bankowski,et al.  Council for International Organizations of Medical Sciences , 1991 .

[38]  Steven Pemberton,et al.  Web Annotation Data Model , 2017 .

[39]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[40]  Marie-Christine Jaulent,et al.  Formalizing MedDRA to support semantic reasoning on adverse drug reaction terms , 2014, J. Biomed. Informatics.