Reducing the Semantic Heterogeneity of Unstructured P2P Systems: A Contribution Based on a Dissemination Protocol

In resource sharing P2P systems with autonomous participants, each peer is free to use the ontology with which it annotates its resources. Semantic heterogeneity occurs when the peers do not use the same ontology. For example, a contributing peer A (e.g. a doctor) may annotate its photos, diagrams, data sets with some ontology of its own, while peer B (e.g. a genetician) uses another one. In order to answer a query issued in the system, peers need to know alignments that state correspondences between entities of two ontologies. Assuming that each peer has some partial initial knowledge of some alignments, we focus on correspondences sharing between the peers as a means to learn additional correspondences. We first provide several measures of semantic heterogeneity that enable to draw a semantic picture of the system and to evaluate the efficiency of protocols independently of query evaluation. We propose CorDis, a gossip-based protocol that disseminates the correspondences that the peers want to share in the system. To overcome the peers’ storage limitations, we propose to consider a history of past queries and to favor the correspondences involving frequently used entities. We study several policies that a peer may adopt in case of inconsistency i.e. when shared correspondences conflict with its own knowledge. We conduct experiments with a set of 93 ontologies actively used in the biomedical domain. We evaluate the CorDis protocol with respect to the proposed measures of semantic heterogeneity and show its good behavior for decreasing them in several contexts.

[1]  Patrick Valduriez,et al.  Query processing in P2P systems , 2007 .

[2]  John Mylopoulos,et al.  The Semantic Web - ISWC 2003 , 2003, Lecture Notes in Computer Science.

[3]  Karl Aberer,et al.  Self-Organizing Schema Mappings in the GridVine Peer Data Management System , 2007, VLDB.

[4]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[5]  Jeffrey M. Bradshaw,et al.  Applying KAoS Services to Ensure Policy Compliance for Semantic Web Services Workflow Composition and Enactment , 2004, SEMWEB.

[6]  Jérôme David,et al.  Ontology Similarity in the Alignment Space , 2010, International Semantic Web Conference.

[7]  Daniel Tabak Risc Systems and Applications , 1996 .

[8]  Robert Meersman,et al.  On the Move to Meaningful Internet Systems 2004: CoopIS, DOA, and ODBASE , 2004, Lecture Notes in Computer Science.

[9]  Ana Maria de Carvalho Moura,et al.  Integrating Ontologies Based on P2P Mappings , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[10]  Anthony Hunter,et al.  Measuring Inconsistency through Minimal Inconsistent Sets , 2008, KR.

[11]  Sonia Bergamaschi,et al.  Agents and Peer-to-Peer Computing - 5th International Workshop, AP2PC 2006, Hakodate, Japan, May 9, 2006, Revised and Invited Papers , 2008, AP2PC.

[12]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[13]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[14]  Erhard Rahm,et al.  Recent Advances in Schema and Ontology Evolution , 2011, Schema Matching and Mapping.

[15]  Thomas Cerqueus,et al.  Semantic Heterogeneity Measures of Unstructured P2P Systems , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[16]  Karl Aberer,et al.  GridVine: An Infrastructure for Peer Information Management , 2007, IEEE Internet Computing.

[17]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[18]  Patrick Valduriez,et al.  Reducing network traffic in unstructured P2P systems using Top-k queries , 2006, Distributed and Parallel Databases.

[19]  Larry Wos,et al.  What Is Automated Reasoning? , 1987, J. Autom. Reason..

[20]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[21]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[22]  Rüdiger Schollmeier,et al.  First and Second Generation of Peer-to-Peer Systems , 2005, Peer-to-Peer Systems and Applications.

[23]  Anne-Marie Kermarrec,et al.  Gossiping in distributed systems , 2007, OPSR.

[24]  Frank van Harmelen,et al.  Web Ontology Language: OWL , 2004, Handbook on Ontologies.

[25]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[26]  Michael Clarke,et al.  Symbolic and Quantitative Approaches to Reasoning and Uncertainty , 1991, Lecture Notes in Computer Science.

[27]  François Goasdoué,et al.  Robust Module-Based Data Management , 2013, IEEE Transactions on Knowledge and Data Engineering.

[28]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[29]  Abdelkader Hameurlain,et al.  Data Management in Grid and Peer-to-Peer Systems , 2008 .

[30]  Raymond Reiter,et al.  A Theory of Diagnosis from First Principles , 1986, Artif. Intell..

[31]  Klaus Wehrle,et al.  Peer-to-Peer Systems and Applications , 2005, Peer-to-Peer Systems and Applications.

[32]  Ana Carolina Salgado,et al.  A Semantic-Based Ontology Matching Process for PDMS , 2009, Globe.

[33]  Anne-Marie Kermarrec,et al.  The Peer Sampling Service: Experimental Evaluation of Unstructured Gossip-Based Implementations , 2004, Middleware.

[34]  Stefano Lodi,et al.  Semantic peer, here are the neighbors you want! , 2008, EDBT '08.

[35]  Enrico Motta,et al.  Identifying Key Concepts in an Ontology, through the Integration of Cognitive Principles with Statistical and Topological Measures , 2008, ASWC.

[36]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[37]  Martin Doerr,et al.  Web Ontology Languages , 2007 .

[38]  Karl Aberer,et al.  A Necessary Condition for Semantic Interoperability in the Large , 2004, CoopIS/DOA/ODBASE.

[39]  Frank van Harmelen,et al.  Debugging Incoherent Terminologies , 2007, Journal of Automated Reasoning.

[40]  Erhard Rahm,et al.  Schema Matching and Mapping , 2013, Schema Matching and Mapping.

[41]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[42]  Karl Aberer,et al.  The chatty web: emergent semantics through gossiping , 2003, WWW '03.

[43]  Krishna P. Gummadi,et al.  Measuring and analyzing the characteristics of Napster and Gnutella hosts , 2003, Multimedia Systems.

[44]  Guilin Qi,et al.  Computing inconsistency measure based on paraconsistent semantics , 2011, J. Log. Comput..

[45]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[46]  Li Rui,et al.  A Necessary Condition , 2003 .

[47]  Laks V. S. Lakshmanan,et al.  Schema mapping and query translation in heterogeneous P2P XML databases , 2010, The VLDB Journal.

[48]  Bernard De Baets,et al.  ONTO-PERL: An API for supporting the development and analysis of bio-ontologies , 2008, Bioinform..

[49]  Steffen Staab,et al.  The Semantic Web - ISWC 2008, 7th International Semantic Web Conference, ISWC 2008, Karlsruhe, Germany, October 26-30, 2008. Proceedings , 2008, SEMWEB.

[50]  Ian Horrocks,et al.  FaCT++ Description Logic Reasoner: System Description , 2006, IJCAR.

[51]  Jérôme David,et al.  Comparison between Ontology Distances (Preliminary Results) , 2008, SEMWEB.

[52]  Guilin Qi,et al.  Measuring Inconsistency for Description Logics Based on Paraconsistent Semantics , 2007, Description Logics.

[53]  Ian Horrocks,et al.  The Semantic Web – ISWC 2010: 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I , 2010, SEMWEB.

[54]  Brian Davis,et al.  Knowledge Engineering and Knowledge Management , 2012, Lecture Notes in Computer Science.

[55]  Steffen Staab,et al.  Handbook on Ontologies (International Handbooks on Information Systems) , 2004 .

[56]  Thomas Cerqueus,et al.  Gossiping Correspondences to Reduce Semantic Heterogeneity of Unstructured P2P Systems , 2011, Globe.

[57]  Raphael Volz,et al.  Cooking the Semantic Web with the OWL API , 2003, SEMWEB.

[58]  M. Jelasity,et al.  T-Man : Fast Gossip-based Construction of Large-Scale Overlay Topologies 1 , 2004 .