Knowledge Beacons: Web Service Workflow for FAIR Data Harvesting of Distributed Biomedical Knowledge

The continually expanding distributed global compendium of biomedical knowledge is diffuse, heterogeneous and huge, posing a serious challenge for biomedical researchers in knowledge harvesting: accessing, compiling, integrating and interpreting data, information and knowledge. In order to accelerate research towards effective medical treatments and optimizing health, it is critical that efficient and automated tools for identifying key research concepts and their experimentally discovered interrelationships are developed. As an activity within the feasibility phase of a project called “Translator” (https://ncats.nih.gov/translator) funded by the National Center for Advancing Translational Sciences (NCATS) to develop a biomedical science knowledge management platform, we designed a Representational State Transfer (REST) web services Application Programming Interface (API) specification, which we call a Knowledge Beacon. Knowledge Beacons provide a standardized basic workflow for the discovery of concepts, their relationships and associated supporting evidence from distributed online repositories of biomedical knowledge. This specification also enforces the annotation of knowledge concepts and statements to the NCATS endorsed the Biolink Model data model and semantic encoding standards (https://biolink.github.io/biolink-model/). Implementation of this API on top of diverse knowledge sources potentially enables their uniform integration behind client software which will facilitate research access and integration of biomedical knowledge. Availability The API and associated software is open source and currently available for access at https://github.com/NCATS-Tangerine/translator-knowledge-beacon.

[1]  David S. Wishart,et al.  SMPDB 2.0: Big Improvements to the Small Molecule Pathway Database , 2013, Nucleic Acids Res..

[2]  Alasdair J. G. Gray,et al.  Addendum: The FAIR Guiding Principles for scientific data management and stewardship , 2019, Scientific Data.

[3]  Ruben Verborgh,et al.  Triple Pattern Fragments: A low-cost knowledge graph interface for the Web , 2016, J. Web Semant..

[4]  Mark D. Wilkinson,et al.  SADI, SHARE, and the in silico scientific method , 2010, BMC Bioinformatics.

[5]  Anne Morgat,et al.  Updates in Rhea – an expert curated resource of biochemical reactions , 2017, Nucleic Acids Res..

[6]  Tudor Groza,et al.  The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species , 2019, Nucleic Acids Res..

[7]  Halil Kilicoglu,et al.  SemMedDB: a PubMed-scale repository of biomedical semantic predications , 2012, Bioinform..

[8]  Dexter Pratt,et al.  NDEx 2.0: A Clearinghouse for Research on Cancer Pathways. , 2017, Cancer research.

[9]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[10]  Robert Stevens,et al.  Wrapping and Interoperating Bioinformatics Resources Using CORBA , 2000, Briefings Bioinform..

[11]  Erik Schultes,et al.  Knowledge.Bio: A Web Application for Exploring, Building and Sharing Webs of Biomedical Relationships Mined from PubMed , 2016 .

[12]  Timothy Clark,et al.  The FAIR Guiding Principles for scientific data management and stewardship (vol 15, 160018, 2016) , 2019 .

[13]  Alfonso Valencia,et al.  Interoperability with Moby 1.0--it's better than sharing your toothbrush! , 2008, Briefings in bioinformatics.