Data Currency in Replicated Distributed Storage System

Application-level storage aggregation provides a massive storage capacity with high scalability and low cost. However, these systems usually only support their special sites, which ignores the legacy storage systems. So we developed a distributed storage system to aggregate these popular storages and provide a uniform access and management interface for these sites. To ensure high data availability, our distributed storage system utilizes a sophisticated way known as data replication. In this paper, we present the data currency scenario employed in our distributed storage system, which can be also used in other replicated distributed storage systems. We propose a Replica Access Service (RAS) to deal with data availability and efficient retrieval of current replicas based on version controlling, which can balance the data accessibility and replica consistency. We validate our solution's performance and scalability through simulation up to 10,000 distributed sites. The simulation results show that our algorithm used in RAS achieves major performance gains, in terms of response time, compared with a baseline algorithm.

[1]  Magnus Karlsson,et al.  Taming aggressive replication in the Pangaea wide-area file system , 2002, OPSR.

[2]  Arie Segev,et al.  Currency-based updates to distributed materialized views , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[3]  Christopher Chute,et al.  The Diverse and Exploding Digital Universe , 2011 .

[4]  Antony I. T. Rowstron,et al.  Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility , 2001, SOSP.

[5]  Heiko Schuldt,et al.  FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components , 2002, VLDB.

[6]  Setsuo Ohsuga,et al.  INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES , 1977 .

[7]  Jonathan Goldstein,et al.  Relaxed currency and consistency: how to say "good enough" in SQL , 2004, SIGMOD '04.

[8]  Raghu Ramakrishnan,et al.  Caching with 'Good Enough' Currency, Consistency, and Completeness , 2005, VLDB.

[9]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[10]  Ben Y. Zhao,et al.  Pond: The OceanStore Prototype , 2003, FAST.

[11]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[12]  Philip A. Bernstein,et al.  Relaxed-currency serializability for middle-tier caching and replication , 2006, SIGMOD Conference.

[13]  Flavia Donno,et al.  Replica Consistency in a Data Grid , 2004 .

[14]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[15]  Fred W. Howell,et al.  Using Java for Discrete Event Simulation , 1996 .