Survey of modular ontology techniques and their applications in the biomedical domain

In the past several years, various ontologies and terminologies such as the Gene Ontology have been developed to enable interoperability across multiple diverse medical information systems. They provide a standard way of representing terms and concepts thereby supporting easy transmission and interpretation of data for various applications. However, with their growing utilization, not only has the number of available ontologies increased considerably, but they are also becoming larger and more complex to manage. Toward this end, a growing body of work is emerging in the area of modular ontologies where the emphasis is on either extracting and managing "modules" of an ontology relevant to a particular application scenario (ontology decomposition) or developing them independently and integrating into a larger ontology (ontology composition). In this paper, we investigate state-of-the-art approaches in modular ontologies focusing on techniques that are based on rigorous logical formalisms as well as well-studied graph theories. We analyze and compare how such approaches can be leveraged in developing tools and applications in the biomedical domain. We conclude by highlighting some of the limitations of the modular ontology formalisms and put forward additional requirements to steer their future development.

[1]  Ian Horrocks,et al.  Modular Reuse of Ontologies: Theory and Practice , 2008, J. Artif. Intell. Res..

[2]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .

[3]  Mark A. Musen,et al.  A Framework for Ontology Evolution in Collaborative Environments , 2006, SEMWEB.

[4]  Alan L. Rector,et al.  Web ontology segmentation: analysis, classification and use , 2006, WWW '06.

[5]  Vasant Honavar,et al.  Modular Ontologies - A Formal Investigation of Semantics and Expressivity , 2006, ASWC.

[6]  Omer F. Rana,et al.  Combining structure and function-based descriptors for component retrieval in software digital libraries , 2008, Integr. Comput. Aided Eng..

[7]  Carsten Lutz,et al.  Conservative Extensions in Expressive Description Logics , 2007, IJCAI.

[8]  Vasant Honavar,et al.  Algorithms and Software for Collaborative Discovery from Autonomous, Semantically Heterogeneous, Distributed Information Sources , 2005, ALT.

[9]  Yugyung Lee,et al.  A pragmatic approach to mapping the open biomedical ontologies , 2007, Int. J. Bioinform. Res. Appl..

[10]  Peter F. Patel-Schneider,et al.  FaCT and DLP , 1998, TABLEAUX.

[11]  Boris Konev,et al.  Logical Difference and Module Extraction with CEX and MEX , 2008, Description Logics.

[12]  Ian Horrocks,et al.  Just the right amount: extracting modules from ontologies , 2007, WWW '07.

[13]  Frank van Harmelen,et al.  Web Ontology Language: OWL , 2004, Handbook on Ontologies.

[14]  Franz Baader,et al.  Pushing the EL Envelope , 2005, IJCAI.

[15]  James A. Hendler,et al.  Swoop: A Web Ontology Editing Browser , 2006, J. Web Semant..

[16]  Asunción Gómez-Pérez,et al.  WebODE: a scalable workbench for ontological engineering , 2001, K-CAP '01.

[17]  Edith Schonberg,et al.  Matching Patient Records to Clinical Trials Using Ontologies , 2007, ISWC/ASWC.

[18]  Edith Schonberg,et al.  Ontology Reasoning with Large Data Repositories , 2008, Ontology Management.

[19]  Vasant Honavar,et al.  Information Integration from Semantically Heterogeneous Biological Data Sources , 2005, 16th International Workshop on Database and Expert Systems Applications (DEXA'05).

[20]  Jie Bao,et al.  Privacy-Preserving Reasoning on the SemanticWeb , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[21]  Vasant Honavar,et al.  Representing and Reasoning with Modular Ontologies , 2006, AAAI Fall Symposium: Semantic Web for Collaborative Knowledge Acquisition.

[22]  Rafael Berlanga Llavori,et al.  Safe and Economic Re-Use of Ontologies: A Logic-Based Methodology and Tool Support , 2008, OWLED.

[23]  Heiner Stuckenschmidt,et al.  RDF Storage and Retrieval Systems , 2009, Handbook on Ontologies.

[24]  I. Horrocks,et al.  A Tableau Decision Procedure for $\mathcal{SHOIQ}$ , 2007, Journal of Automated Reasoning.

[25]  Sushil Jajodia,et al.  Unauthorized inferences in semistructured databases , 2006, Inf. Sci..

[26]  Michel C. A. Klein,et al.  Structure-Based Partitioning of Large Concept Hierarchies , 2004, SEMWEB.

[27]  Mengjie Zhang,et al.  Detecting data records in semi-structured web sites based on text token clustering , 2008, Integr. Comput. Aided Eng..

[28]  François Scharffe,et al.  Ontology alignments: an ontology management perspective , 2008 .

[29]  Ian Horrocks,et al.  A Tableaux Decision Procedure for SHOIQ , 2005, IJCAI.

[30]  Enrico Motta,et al.  Modularization: a Key for the Dynamic Selection of Relevant Knowledge Components , 2006, WoMO.

[31]  Wacław Kunierczyk,et al.  Taxonomy-based partitioning of the Gene Ontology , 2008 .

[32]  Vasant Honavar,et al.  A Semantic Importing Approach to Knowledge Reuse from Multiple Ontologies , 2007, AAAI.

[33]  Edith Schonberg,et al.  Scalable Semantic Retrieval through Summarization and Refinement , 2007, AAAI.

[34]  Michel C. A. Klein,et al.  Integrity and Change in Modular Ontologies , 2003, IJCAI.

[35]  Markus Krötzsch,et al.  Semantic Wikipedia , 2007, WWW '06.

[36]  Midori A. Harris,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm112 Databases and ontologies OBO-Edit—an ontology editor for biologists , 2007 .

[37]  Steffen Staab,et al.  International Handbooks on Information Systems , 2013 .

[38]  Heiner Stuckenschmidt,et al.  Repairing Ontology Mappings , 2007, AAAI.

[39]  Yimin Wang,et al.  Evaluating Formalisms for Modular Ontologies in Distributed Information Systems , 2007, RR.

[40]  Patrick Lambrix,et al.  SAMBO - A system for aligning and merging biomedical ontologies , 2006, J. Web Semant..

[41]  Gert Smolka,et al.  Attributive Concept Descriptions with Complements , 1991, Artif. Intell..

[42]  Peter F. Patel-Schneider,et al.  Next Steps for OWL , 2006, OWLED.

[43]  Mark A. Musen,et al.  Specifying Ontology Views by Traversal , 2004, International Semantic Web Conference.

[44]  Luciano Serafini,et al.  Distributed Description Logics: Assimilating Information from Peer Sources , 2003, J. Data Semant..

[45]  Vasant Honavar,et al.  Learning Classifiers from Semantically Heterogeneous Data , 2004, CoopIS/DOA/ODBASE.

[46]  Bernardo Cuenca Grau,et al.  History Matters: Incremental Ontology Reasoning Using Modules , 2007, ISWC/ASWC.

[47]  Luciano Serafini,et al.  DRAGO: Distributed Reasoning Architecture for the Semantic Web , 2005, ESWC.

[48]  Ian Horrocks,et al.  A Description Logic with Transitive and Inverse Roles and Role Hierarchies , 1999, J. Log. Comput..

[49]  Bijan Parsia,et al.  Modularity and Web Ontologies , 2006, KR.

[50]  Boontawee Suntisrivaraporn Module Extraction and Incremental Classification: A Pragmatic Approach for Ontologies , 2008, ESWC.

[51]  Luigi Iannone,et al.  Ontology module extraction for ontology reuse: an ontology engineering perspective , 2007, CIKM '07.

[52]  Philippe Besnard,et al.  Ontology-based inference for causal explanation , 2008, Integr. Comput. Aided Eng..

[53]  Vasant Honavar,et al.  Learning Classifiers from Distributed, Ontology-Extended Data Sources , 2006, DaWaK.

[54]  Boris Motik,et al.  OWL 2: The next step for OWL , 2008, J. Web Semant..

[55]  Volker Haarslev,et al.  RACER System Description , 2001, IJCAR.

[56]  Boris Konev,et al.  Semantic Modularity and Module Extraction in Description Logics , 2008, ECAI.

[57]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[58]  Bijan Parsia,et al.  Combining OWL ontologies using epsilon-Connections , 2006, J. Web Semant..

[59]  Ian Horrocks,et al.  A Logical Framework for Modularity of Ontologies , 2007, IJCAI.

[60]  Richi Nayak,et al.  A knowledge retrieval model using ontology mining and user profiling , 2008, Integr. Comput. Aided Eng..

[61]  Holger Knublauch,et al.  The Protégé OWL Plugin: An Open Development Environment for Semantic Web Applications , 2004, SEMWEB.

[62]  James A. Hendler,et al.  Metcalfe's law, Web 2.0, and the Semantic Web , 2008, J. Web Semant..

[63]  Boris Motik,et al.  Optimized Reasoning in Description Logics Using Hypertableaux , 2007, CADE.

[64]  Vasant Honavar,et al.  Information Integration and Knowledge Acquisition from Semantically Heterogeneous Biological Data Sources , 2005, DILS.

[65]  Franz Baader,et al.  Pushing the EL Envelope Further , 2008, OWLED.

[66]  Christopher G. Chute,et al.  Adopting Graph Traversal Techniques for Context-Driven Value Sets Extraction from Biomedical Knowledge Sources , 2008, 2008 IEEE International Conference on Semantic Computing.

[67]  Carole A. Goble,et al.  State of the nation in data integration for bioinformatics , 2008, J. Biomed. Informatics.

[68]  Maria Soledad Pera,et al.  Utilizing phrase-similarity measures for detecting and clustering informative RSS news articles , 2008, Integr. Comput. Aided Eng..

[69]  Yuzhong Qu,et al.  Falcon-AO: A practical ontology matching system , 2008, J. Web Semant..