Consistency-Latency Trade-Off of the LibRe Protocol: A Detailed Study

In multi-writer, multi-reader systems, data consistency is ensured by the number of replica nodes contacted during read and write operations. Contacting a sufficient number of nodes in order to ensure data consistency comes with a communication cost and a risk to data availability. In this paper, we extend our previous work on a consistency protocol called LibRe, which helps to read the latest version of a data item by contacting a minimum number of replica nodes. The protocol uses a registry that records the set of replica nodes containing the most recent version of the data items until all replicas of this data item converge to a consistent state. Hence, referring to the registry during read time helps to forward the read requests to the replica nodes holding the most recent version of the needed data item. In the following work, we show that this protocol provides a new trade-off between consistency and latency for distributed data storage systems. We provide a formal description of the protocol and its reliability and evaluate the scalability of the protocol up to one hundred nodes by simulation. We also demonstrate the effectiveness of the approach in practice by providing a proof-of-concept implementation of the protocol inside the Cassandra distributed data store. The test results prove that using LibRe protocol, an application would experience a similar number of stale reads compared to strong consistency options offered by Cassandra, while achieving lower latency, and similar availability.

[1]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[2]  Yasushi Saito,et al.  Optimistic replication , 2005, CSUR.

[3]  Nenghai Yu,et al.  Distributed Hash Table: Theory, Platforms and Applications , 2013 .

[4]  Rusty Klophaus,et al.  Riak Core: building distributed applications without shared state , 2010, CUFP '10.

[5]  Nancy A. Lynch,et al.  Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services , 2002, SIGA.

[6]  Sebastian Burckhardt,et al.  Principles of Eventual Consistency , 2014, Found. Trends Program. Lang..

[7]  David K. Gifford,et al.  Weighted voting for replicated data , 1979, SOSP '79.

[8]  Bharat K. Bhargava,et al.  Replication Techniques in Distributed Systems , 1996, Advances in Database Systems.

[9]  Derek L. Eager,et al.  Achieving robustness in distributed database systems , 1983, TODS.

[10]  Eric Gressier-Soudan,et al.  LibRe: a consistency protocol for modern storage systems , 2013, COMPUTE.

[11]  Raja Chiky,et al.  Simizer: evaluating consistency trade offs through simulation , 2014, PaPEC '14.

[12]  Edward D. Lazowska,et al.  An efficient and highly available read-one write-all protocol for replicated data management , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[13]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[14]  Nenghai Yu,et al.  Distributed Hash Table , 2013, SpringerBriefs in Computer Science.

[15]  Marko Vukolic Remarks: The Origin of Quorum Systems , 2010, Bull. EATCS.

[16]  Michael K. Reiter,et al.  Probabilistic quorum systems , 1997, PODC '97.

[17]  Raja Chiky,et al.  Evaluating consistency on the fly using YCSB , 2014, 2014 International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM).

[18]  Robert H. Thomas,et al.  A Majority consensus approach to concurrency control for multiple copy databases , 1979, ACM Trans. Database Syst..

[19]  Eric Gressier-Soudan,et al.  CaLibRe: A Better Consistency-Latency Tradeoff for Quorum Based Replication Systems , 2015, DEXA.

[20]  Moni Naor,et al.  The Load, Capacity, and Availability of Quorum Systems , 1998, SIAM J. Comput..

[21]  Daniel J. Abadi,et al.  Consistency Tradeoffs in Modern Distributed Database System Design: CAP is Only Part of the Story , 2012, Computer.

[22]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[23]  Ion Stoica,et al.  Probabilistically Bounded Staleness for Practical Partial Quorums , 2012, Proc. VLDB Endow..