Visualization Environment for Federated Knowledge Graphs: Development of an Interactive Biomedical Query Language and Web Application Interface

Background Efforts are underway to semantically integrate large biomedical knowledge graphs using common upper-level ontologies to federate graph-oriented application programming interfaces (APIs) to the data. However, federation poses several challenges, including query routing to appropriate knowledge sources, generation and evaluation of answer subsets, semantic merger of those answer subsets, and visualization and exploration of results. Objective We aimed to develop an interactive environment for query, visualization, and deep exploration of federated knowledge graphs. Methods We developed a biomedical query language and web application interphase—termed as Translator Query Language (TranQL)—to query semantically federated knowledge graphs and explore query results. TranQL uses the Biolink data model as an upper-level biomedical ontology and an API standard that has been adopted by the Biomedical Data Translator Consortium to specify a protocol for expressing a query as a graph of Biolink data elements compiled from statements in the TranQL query language. Queries are mapped to federated knowledge sources, and answers are merged into a knowledge graph, with mappings between the knowledge graph and specific elements of the query. The TranQL interactive web application includes a user interface to support user exploration of the federated knowledge graph. Results We developed 2 real-world use cases to validate TranQL and address biomedical questions of relevance to translational science. The use cases posed questions that traversed 2 federated Translator API endpoints: Integrated Clinical and Environmental Exposures Service (ICEES) and Reasoning Over Biomedical Objects linked in Knowledge Oriented Pathways (ROBOKOP). ICEES provides open access to observational clinical and environmental data, and ROBOKOP provides access to linked biomedical entities, such as “gene,” “chemical substance,” and “disease,” that are derived largely from curated public data sources. We successfully posed queries to TranQL that traversed these endpoints and retrieved answers that we visualized and evaluated. Conclusions TranQL can be used to ask questions of relevance to translational science, rapidly obtain answers that require assertions from a federation of knowledge sources, and provide valuable insights for translational research and clinical practice.

[1]  A. Arnold,et al.  Hypercalcemia and ectopic secretion of parathyroid hormone by an ovarian carcinoma with rearrangement of the gene for parathyroid hormone. , 1990, The New England journal of medicine.

[2]  Toward A Universal Biomedical Data Translator , 2018, Clinical and translational science.

[3]  Karamarie Fecho,et al.  ROBOKOP KG and KGB: Integrated Knowledge Graphs from Federated Sources , 2019, J. Chem. Inf. Model..

[4]  Paul A Clemons,et al.  The Biomedical Data Translator Program: Conception, Culture, and Community , 2018, Clinical and translational science.

[5]  Christopher P. Austin,et al.  Deconstructing the Translational Tower of Babel , 2019, Clinical and translational science.

[6]  Gail Steinhart,et al.  DataStaR: Using the Semantic Web approach for Data Curation , 2011, Int. J. Digit. Curation.

[7]  Karamarie Fecho,et al.  A novel approach for exposing and sharing clinical data: the Translator Integrated Clinical and Environmental Exposures Service , 2019, J. Am. Medical Informatics Assoc..

[8]  Maria Anisimova,et al.  Enabling semantic queries across federated bioinformatics databases , 2019, bioRxiv.

[9]  Daniel J. Vreeman,et al.  Semantic Integration of Clinical Laboratory Tests from Electronic Health Records for Deep Phenotyping and Biomarker Discovery , 2019 .

[10]  Karamarie Fecho,et al.  ROBOKOP: an abstraction layer and user interface for knowledge graphs to support question answering , 2019, Bioinform..

[11]  Yingmei Wang,et al.  Ovarian cancer presenting with hypercalcemia: two cases with similar manifestations but different mechanisms , 2018, Cancer biology & medicine.

[12]  J. Froines,et al.  UNITED STATES ENVIRONMENTAL PROTECTION AGENCY , 1995 .

[13]  Tatiana Levashova,et al.  Knowledge fusion patterns: A survey , 2019, Inf. Fusion.