The chatty web: emergent semantics through gossiping

This paper describes a novel approach for obtaining semantic interoperability among data sources in a bottom-up, semi-automatic manner without relying on pre-existing, global semantic models. We assume that large amounts of data exist that have been organized and annotated according to local schemas. Seeing semantics as a form of agreement, our approach enables the participating data sources to incrementally develop global agreement in an evolutionary and completely decentralized process that solely relies on pair-wise, local interactions: Participants provide translations between schemas they are interested in and can learn about other translations by routing queries (gossiping). To support the participants in assessing the semantic quality of the achieved agreements we develop a formal framework that takes into account both syntactic and semantic criteria. The assessment process is incremental and the quality ratings are adjusted along with the operation of the system. Ultimately, this process results in global agreement, i.e., the semantics that all participants understand. We discuss strategies to efficiently find translations and provide results from a case study to justify our claims. Our approach applies to any system which provides a communication infrastructure (existing websites or databases, decentralized systems, P2P systems) and offers the opportunity to study semantic interoperability as a global phenomenon in a network of information sharing parties.

[1]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[2]  Dieter Pfoser Indexing the Trajectories of Moving Objects , 2002 .

[3]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[4]  J. Frankel,et al.  The gnutella protocol specification v0.4 document revision 1.2 , 2000 .

[5]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[6]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[7]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[8]  Jon M Kleinberg,et al.  Hubs, authorities, and communities , 1999, CSUR.

[9]  Ian Horrocks,et al.  OIL in a Nutshell , 2000, EKAW.

[10]  Trevor J. M. Bench-Capon,et al.  Kraft: An Agent Architecture for Knowledge Fusion , 2001, Int. J. Cooperative Inf. Syst..

[11]  Aris M. Ouksel,et al.  Ontologies are not the Panacea in Data Integration: A Flexible Coordinator to Mediate Context Construction , 2004, Distributed and Parallel Databases.

[12]  James A. Hendler,et al.  Owl web ontology language 1 , 2002 .

[13]  Babak Esfandiari,et al.  U-P2P: a peer-to-peer system for description and discovery of resource-sharing communities , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[14]  Vipul Kashyap,et al.  OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation Across Pre-Existing Ontologies , 2000, Distributed and Parallel Databases.

[15]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.