Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the oMAP Framework

This paper introduces a general methodology for performing distributed search in the Semantic Web. We propose to define this task as a three steps process, namely resource selection, query reformulation/ontology alignment and rank aggregation/data fusion. For the second problem, we have implemented oMAP, a formal framework for automatically aligning OWL ontologies. In oMAP, different components are combined for finding suitable mapping candidates (together with their weights), and the set of rules with maximum matching probability is selected. Among these components, traditional terminological-based classifiers, machine learning-based classifiers and a new classifier using the structure and the semantics of the OWL ontologies are proposed. oMAP has been evaluated on international test sets.

[1]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[2]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[3]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[4]  King-Lup Liu,et al.  Efficient and effective metasearch for a large number of text databases , 1999, CIKM '99.

[5]  Norbert Fuhr Probabilistic Datalog: implementing logical information retrieval for advanced applications , 2000 .

[6]  James P. Callan,et al.  Query-based sampling of text databases , 2001, TOIS.

[7]  Mark A. Musen,et al.  Anchor-PROMPT: Using Non-Local Context for Semantic Matching , 2001, OIS@IJCAI.

[8]  Jamie Callan,et al.  DISTRIBUTED INFORMATION RETRIEVAL , 2002 .

[9]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[10]  Ian Horrocks,et al.  From SHIQ and RDF to OWL: the making of a Web Ontology Language , 2003, J. Web Semant..

[11]  Pedro M. Domingos,et al.  Learning to match ontologies on the Semantic Web , 2003, The VLDB Journal.

[12]  Umberto Straccia,et al.  Web metasearch: rank vs. score based rank aggregation methods , 2003, SAC '03.

[13]  Jérôme Euzenat,et al.  Similarity-Based Ontology Alignment in OWL-Lite , 2004, ECAI.

[14]  Ian Horrocks,et al.  A proposal for an owl rules language , 2004, WWW '04.

[15]  Steffen Staab,et al.  QOM - Quick Ontology Mapping , 2004, GI Jahrestagung.

[16]  Marc Ehrig,et al.  State of the art on ontology alignment , 2013 .

[17]  Jérôme Euzenat,et al.  An API for Ontology Alignment , 2004, SEMWEB.

[18]  Umberto Straccia,et al.  oMAP: Combining Classifiers for Aligning Automatically OWL Ontologies , 2005, WISE.

[19]  Umberto Straccia,et al.  oMAP: Results of the Ontology Alignment Contest , 2005, Integrating Ontologies.

[20]  Umberto Straccia,et al.  sPLMap: A Probabilistic Approach to Schema Matching , 2005, ECIR.

[21]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[22]  Steffen Staab,et al.  Bootstrapping Ontology Alignment Methods with APFEL , 2005, International Semantic Web Conference.

[23]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[24]  Fausto Giunchiglia,et al.  A Large Scale Taxonomy Mapping Evaluation , 2005, International Semantic Web Conference.

[25]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.