Semantic Data Integration in P2P Systems

In this paper, we study the problem of data integration in P2P systems. Differently from the traditional setting, data integration in these systems is not based on the existence of a global view. Instead, each peer exports data in terms of its own schema, and information integration is achieved by establishing mappings among the various peer schemas. We present a framework that captures this general architecture, and then we discuss the problem of characterizing the semantics of such framework. We show that the usual approach of resorting to a first-order logic intepretation of P2P mappings, leads both to a poor modeling of the whole system, and to undecidability of query answering, even for mappings of a restricted form. This motivates the need of a new semantics for P2P system. We then present a novel proposal, based on epistemic logic, and show that not only it adequately models the interactions among peers, but it also supports decidable query answering. In particular, for the restricted form of mapping mentioned above, query answering is polynomial with respect to the size of data stored in the peers.

[1]  Karl Aberer,et al.  Improving Data Access in P2P Systems , 2002, IEEE Internet Comput..

[2]  Thomas Lukasiewicz Proceedings of the 7th International Symposium on the Foundations of Information and Knowledge Systems‚ FoIKS 2012‚ Kiel‚ Germany‚ March 5−9‚ 2012 , 2000 .

[3]  Dan Suciu,et al.  What Can Database Do for Peer-to-Peer? , 2001, WebDB.

[4]  Andrea Calì,et al.  Query rewriting and answering under constraints in data integration systems , 2003, IJCAI.

[5]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[6]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[7]  Gerhard Lakemeyer,et al.  The logic of knowledge bases , 2000 .

[8]  Ronald Fagin,et al.  Data exchange: getting to the core , 2003, PODS '03.

[9]  Andrea Calì,et al.  Data integration under integrity constraints , 2004, Inf. Syst..

[10]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[11]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[12]  Andrea Calì,et al.  On the decidability and complexity of query answering over inconsistent and incomplete databases , 2003, PODS.

[13]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[14]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[15]  Christoph Koch,et al.  Query rewriting with symmetric constraints , 2002, AI Commun..

[16]  Maurizio Lenzerini,et al.  Representing and Using Interschema Knowledge in Cooperative Information Systems , 1993, Int. J. Cooperative Inf. Syst..

[17]  Gabriel M. Kuper,et al.  Structural Properties of XPath Fragments , 2003, ICDT.

[18]  Todd D. Millstein,et al.  Navigational Plans For Data Integration , 1999, AAAI/IAAI.

[19]  G KolaitisPhokion,et al.  Data exchange: getting to the core , 2005 .

[20]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.