Continuous Timestamping for Efficient Replication Management in DHTs

Distributed Hash Tables (DHTs) provide an efficient solution for data location and lookup in large-scale P2P systems. However, it is up to the applications to deal with the availability of the data they store in the DHT, e.g. via replication. To improve data availability, most DHT applications rely on data replication. However, efficient replication management is quite challenging, in particular because of concurrent and missed updates. In this paper, we propose an efficient solution to data replication in DHTs. We propose a new service, called Continuous Timestamp based Replication Management (CTRM), which deals with the efficient storage, retrieval and updating of replicas in DHTs. To perform updates on replicas, we propose a new protocol that stamps update actions with timestamps generated in a distributed fashion. Timestamps are not only monotonically increasing but also continuous, i.e. without gap. The property of monotonically increasing allows applications to determine a total order on updates. The other property, i.e. continuity, enables applications to deal with missed updates. We evaluated the performance of our solution through simulation and experimentation. The results show its effectiveness for replication management in DHTs.

[1]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[2]  Patrick Valduriez,et al.  P2P logging and timestamping for reconciliation , 2008, Proc. VLDB Endow..

[3]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[4]  Amit Kumar Saha,et al.  Adaptive clock synchronization in sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[5]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[6]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[7]  John Kubiatowicz,et al.  Handling churn in a DHT , 2004 .

[8]  Ben Y. Zhao,et al.  Awarded Best Student Paper! - Pond: The OceanStore Prototype , 2003 .

[9]  Patrick Valduriez,et al.  Data currency in replicated DHTs , 2007, SIGMOD '07.

[10]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[11]  Amit Kumar Saha,et al.  Adaptive clock synchronization in sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[12]  ShenkerScott,et al.  A case study in building layered DHT applications , 2005 .

[13]  Sriram Ramabhadran,et al.  A case study in building layered DHT applications , 2005, SIGCOMM '05.

[14]  Andreas Wombacher,et al.  Enabling High Data Availability in a DHT , 2005, 16th International Workshop on Database and Expert Systems Applications (DEXA'05).

[15]  John Kubiatowicz,et al.  Handling Churn in a DHT (Awarded Best Paper!) , 2004, USENIX Annual Technical Conference, General Track.

[16]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[17]  Ben Y. Zhao,et al.  Pond: The OceanStore Prototype , 2003, FAST.