Don't Mind Your Vocabulary: Data Sharing Across Heterogeneous Peers

The strong dynamics of peer-to-peer networks, coupled with the diversity of peer vocabularies, makes query processing in peer database systems a very challenging task. In this paper, we propose a framework for translating expressive relational queries across heterogeneous peer databases. Our framework avoids an integrated global schema or centralized structures common to the involved peers. The cornerstone of our approach is the use of both syntax and instance level schema mappings that each peer constructs and shares with other peers. Based on this user provided mapping information, our algorithm applies generic translation rules to translate SQL queries. Our approach supports both query translation and propagation among the peers preserving the autonomy of individual peers. The proposal combines both syntax and instance level mappings into a more general framework for query translation across heterogeneous boundaries. We have developed a prototype as a query service layer wrapped around a basic service providing heterogeneity management. The prototype has been evaluated on a small peer-to-peer network to demonstrate the viability of the approach.

[1]  Renée J. Miller,et al.  Data mapping in peer-to-peer systems: Semantics and algorithmic issues , 2003, SIGMOD 2003.

[2]  Laura M. Haas,et al.  Schema Mapping as Query Discovery , 2000, VLDB.

[3]  Kevin Chen-Chuan Chang,et al.  Mind your vocabulary: query mapping across heterogeneous information sources , 1999, SIGMOD '99.

[4]  Verena Kantere,et al.  The hyperion project: from data integration to data coordination , 2003, SGMD.

[5]  Klaus R. Dittrich,et al.  Query Explorativeness for Integrated Search in Heterogeneous Data Sources , 2002, CAiSE.

[6]  Fausto Giunchiglia,et al.  Making Peer Databases Interact - A Vision for an Architecture Supporting Data Coordination , 2002, CIA.

[7]  Fausto Giunchiglia,et al.  Local Relational Model: A Logical Formalization of Database Coordination , 2003, CONTEXT.

[8]  Gabriel M. Kuper,et al.  The coDB Robust Peer-to-Peer Database System , 2004, SEBD.

[9]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[10]  Renée J. Miller,et al.  Mapping data in peer-to-peer systems: semantics and algorithmic issues , 2003, SIGMOD '03.

[11]  Beng Chin Ooi,et al.  PeerDB: a P2P-based system for distributed data sharing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[12]  Marcelo Arenas,et al.  Data Sharing Through Query Translation in Autonomous Sources , 2004, VLDB.

[13]  Gabriel M. Kuper,et al.  Queries and Updates in the coDB Peer to Peer Database System , 2004, VLDB.

[14]  Amin Vahdat,et al.  Efficient Peer-to-Peer Keyword Searching , 2003, Middleware.

[15]  Verena Kantere,et al.  Coordinating Peer Databases Using ECA Rules , 2003, DBISP2P.

[16]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[17]  Dan Suciu,et al.  The Piazza peer data management project , 2003, SGMD.

[18]  Dan Suciu,et al.  Schema mediation in peer data management systems , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[19]  Michael Boyd,et al.  AutoMed: A BAV Data Integration System for Heterogeneous Data Sources , 2004, CAiSE.