Biomedical data integration and ontology-driven multi-facets visualization

With the proliferation of different heterogeneous biomedical data sources and with the growing amount of their content available over the Web, there is, on one side, the need to support mashing and data integration and, on the other side, the more urgent need to relate literature and research results that are often enclosed in unstructured textual documents. Nowadays, ontologies have been used as a common access knowledge layer playing a crucial role to support categorized access to the information resources. Moreover, manual construction of a domain-specific ontology and content categorization is a labor intensive and a time-consuming process. This work focuses on the development of a novel biomedical ontology-driven multi-facets visualization to support categorized access to heterogeneous and unstructured biomedical data sources (e.g., PubMed, WikiGenes). Specifically, the framework relies on: knowledge extraction methodology, to automatically extract ontology exploiting the Fuzzy Formal Concept Analysis theory; and ontology matching strategy to find relation between extracted ontology and the available ones in the field of biomedicine (e.g., Ontology of Gene and Genomes, Gene Ontology, Protein Ontology). The evaluation will be shown in terms of Precision and Recall by using biomedical ontology concepts as input query to the multi-facets visualization engine.

[1]  Hai-Tao Zheng,et al.  GOClonto: An ontological clustering approach for conceptualizing PubMed abstracts , 2010, J. Biomed. Informatics.

[2]  Ronald Fagin,et al.  Multi-structural databases , 2005, PODS '05.

[3]  Marti A. Hearst,et al.  Automating Creation of Hierarchical Faceted Metadata Structures , 2007, NAACL.

[4]  C Desiderio,et al.  Investigation of the in vitro biotransformation of R-(+)-thalidomide by HPLC, nano-HPLC, CEC and HPLC--APCI-MS. , 1999, Journal of chromatography. B, Biomedical sciences and applications.

[5]  Dunja Mladenic,et al.  OntoGen: Semi-automatic Ontology Editor , 2007, HCI.

[6]  Wojciech Jamroga,et al.  On module checking and strategies , 2014, AAMAS.

[7]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[8]  Eero Hyvönen,et al.  User-Centric Faceted Search for Semantic Portals , 2007, ESWC.

[9]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[10]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[11]  Torben Bach Pedersen,et al.  Using Semantic Web Technologies for Exploratory OLAP: A Survey , 2015, IEEE Transactions on Knowledge and Data Engineering.

[12]  Margherita Napoli,et al.  Verification of scope-dependent hierarchical state machines , 2008, Inf. Comput..

[13]  Vincenzo Loia,et al.  Hierarchical web resources retrieval by exploiting Fuzzy Formal Concept Analysis , 2012, Inf. Process. Manag..

[14]  Vedika Gupta,et al.  Deriving Business Intelligence from Unstructured Data , 2013 .

[15]  R. Hoffmann A wiki for the life sciences where authorship matters , 2008, Nature Genetics.

[16]  Chris Maloney,et al.  PubMed Central , 2017 .

[17]  S. M. González,et al.  Considering unstructured data for OLAP: a feasibility study using a systematic review , 2014 .

[18]  Muhammad Saleem,et al.  Big linked cancer data: Integrating linked TCGA and PubMed , 2014, J. Web Semant..

[19]  Paul Buitelaar,et al.  A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis , 2004, ESWS.

[20]  Benedikt Kämpgen,et al.  Interacting with Statistical Linked Data via OLAP Operations , 2012, ILD@ESWC.

[21]  Aldo Gangemi,et al.  Ontology Learning and Its Application to Automated Terminology Translation , 2003, IEEE Intell. Syst..

[22]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[23]  Panagiotis G. Ipeirotis,et al.  Automatic construction of multifaceted browsing interfaces , 2005, CIKM '05.

[24]  Mike Thelwall,et al.  Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[25]  Daniel Tunkelang,et al.  Faceted Search , 2009, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[26]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[27]  Steffen Staab,et al.  OntoEdit: Collaborative Ontology Development for the Semantic Web , 2002, SEMWEB.

[28]  L. Beran,et al.  [Formal concept analysis]. , 1996, Casopis lekaru ceskych.

[29]  Ian Dickinson,et al.  Humboldt: Exploring Linked Data , 2008, LDOW.