Data management in the APPA P2P system

Peer-to-peer (P2P) computing offers new opportunities for building highly distributed data systems. Unlike client-server computing, P2P is a very dynamic environment where peers can join and leave the network at any time and offers important advantages such as operation without central coordination, peers autonomy, and scale up to large number of peers. However, providing high-level data management services (schema, queries, replication, availability, etc.) in a P2P system implies revisiting distributed database technology in major ways. In this paper, we present APPA (Atlas Peer-to-Peer Architecture) and its high-level data management services. APPA has a network-independent architecture that can be implemented over various structured and super-peer P2P networks. It uses novel solutions for persistent data management with updates, data replication with semantic-based reconciliation and query processing. APPA's services are implemented using the JXTA framework.

[1]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[2]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[3]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[4]  Vicki L. O'Day,et al.  Widening the net: workshop report on the theory and practice of physical and network communities , 1997, SGCH.

[5]  Patrick Valduriez,et al.  Scaling Access to Heterogeneous Data Sources with DISCO , 1998, IEEE Trans. Knowl. Data Eng..

[6]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[7]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[8]  Antony I. T. Rowstron,et al.  The IceCube approach to the reconciliation of divergent replicas , 2001, PODC '01.

[9]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[10]  Ian Clarke,et al.  Protecting Free Expression Online with Freenet , 2002, IEEE Internet Comput..

[11]  Dan Suciu,et al.  The Piazza peer data management project , 2003, SGMD.

[12]  Ioana Manolescu,et al.  Dynamic XML documents with distribution and replication , 2003, SIGMOD '03.

[13]  Wolfgang Nejdl,et al.  Design issues and challenges for RDF- and schema-based peer-to-peer systems , 2003, SGMD.

[14]  Marc Shapiro,et al.  Semantics-Based Reconciliation for Collaborative and Mobile Environments , 2003, OTM.

[15]  Scott Shenker,et al.  Querying the Internet with PIER , 2003, VLDB.

[16]  Patrick Valduriez,et al.  A Dynamic Distributed Algorithm for Semantic Reconciliation , 2005 .

[17]  Patrick Valduriez,et al.  Parallel database systems: Open problems and new issues , 1993, Distributed and Parallel Databases.

[18]  Yasushi Saito,et al.  Optimistic replication , 2005, CSUR.

[19]  Patrick Valduriez,et al.  Reconciliation in the APPA P2P system , 2006, 12th International Conference on Parallel and Distributed Systems - (ICPADS'06).

[20]  Patrick Valduriez,et al.  Top-k Query Processing in the APPA P2P System , 2006, VECPAR.

[21]  Patrick Valduriez,et al.  Design and Implementation of Atlas P2P Architecture , 2006 .