A Pragmatic Protocol for Database Replication in Interconnected Clusters

Multi-master update everywhere database replication, as achieved by protocols based on group communication such as DBSM and Postgres-R, addresses both performance and availability. By scaling it to wide area networks, one could save costly bandwidth and avoid large round-trips to a distant master server. Also, by ensuring that updates are safely stored at a remote site within transaction boundaries, disaster recovery is guaranteed. Unfortunately, scaling existing cluster based replication protocols is troublesome. In this paper we present a database replication protocol based on group communication that targets interconnected clusters. In contrast with previous proposals, it uses a separate multicast group for each cluster and thus does not impose any additional requirements on group communication, easing implementation and deployment in a real setting. Nonetheless, the protocol ensures one-copy equivalence while allowing all sites to execute update transactions. Experimental evaluation using the workload of the industry standard TPC-C benchmark confirms the advantages of the approach

[1]  Francisco Moura,et al.  Group-Based Replication of On-Line Transaction Processing Servers , 2005, LADC.

[2]  Fernando Pedone,et al.  Database replication using generalized snapshot isolation , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[3]  Rachid Guerraoui,et al.  Dictatorial Transaction Processing: Atomic Commitment Without Veto Right , 2004, Distributed and Parallel Databases.

[4]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[5]  Luís E. T. Rodrigues,et al.  From spontaneous total order to uniform total order: different degrees of optimistic delivery , 2006, SAC '06.

[6]  Francisco Moura,et al.  Testing the dependability and performance of group communication based database replication protocols , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[7]  José Pereira,et al.  Experimental performability evaluation of middleware for large-scale distributed systems , 2005 .

[8]  Gustavo Alonso,et al.  Don't Be Lazy, Be Consistent: Postgres-R, A New Way to Implement Database Replication , 2000, VLDB.

[9]  Francisco Moura,et al.  Optimistic total order in wide area networks , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[10]  Rachid Guerraoui,et al.  Software-Based Replication for Fault Tolerance , 1997, Computer.

[11]  Fred B. Schneider,et al.  Replication management using the state-machine approach , 1993 .

[12]  Avishai Wool,et al.  Replication, consistency, and practicality: are these mutually exclusive? , 1998, SIGMOD '98.

[13]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[14]  Rachid Guerraoui,et al.  The Database State Machine Approach , 2003, Distributed and Parallel Databases.

[15]  Sam Toueg,et al.  A Modular Approach to Fault-Tolerant Broadcasts and Related Problems , 1994 .

[16]  J. T. Robinson,et al.  On optimistic methods for concurrency control , 1979, TODS.

[17]  Madhu Tumma Oracle Streams-High Speed Replication and Data Sharing , 2004 .

[18]  Miron Livny,et al.  Conflict detection tradeoffs for replicated data , 1991, TODS.

[19]  Ciprian Tutu,et al.  Practical Wide-Area Database Replication 1 , 2002 .

[20]  Bettina Kemme,et al.  Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[21]  Luís E. T. Rodrigues,et al.  Appia, a flexible protocol kernel supporting multiple coordinated channels , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[22]  Ricardo Jiménez-Peris,et al.  Consistent Data Replication: Is It Feasible in WANs? , 2005, Euro-Par.

[23]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[24]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.