Preventive Replication in a Database Cluster

In a database cluster, preventive replication can provide strong consistency without the limitations of synchronous replication. In this paper, we present a full solution for preventive replication that supports multi-master and partial configurations, where databases are partially replicated at different nodes. To increase transaction throughput, we propose an optimization that eliminates delay at the expense of a few transaction aborts and we introduce concurrent replica refreshment. We describe large-scale experimentation of our algorithm based on our RepDB* prototype (http://www.sciences.univ-nantes.fr./lina/ATLAS/RepDB) over a cluster of 64 nodes running the PostgreSQL DBMS. Our experimental results using the TPC-C Benchmark show that the proposed approach yields excellent scale-up and speed-up.

[1]  Jack Dongarra,et al.  High Performance Computing for Computational Science , 2003 .

[2]  John A. Clark,et al.  Holistic schedulability analysis for distributed hard real-time systems , 1994, Microprocess. Microprogramming.

[3]  Esther Pacitti,et al.  Replica Consistency in Lazy Master Replicated Databases , 2001, Distributed and Parallel Databases.

[4]  Fernando Pedone,et al.  Partial replication in the Database State Machine , 2001, Proceedings IEEE International Symposium on Network Computing and Applications. NCA 2001.

[5]  Avishai Wool,et al.  Replication, consistency, and practicality: are these mutually exclusive? , 1998, SIGMOD '98.

[6]  Bettina Kemme,et al.  Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[7]  Patrick Valduriez,et al.  Consistency management for partial replication in a high performance database cluster , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[8]  Patrick Valduriez,et al.  Parallel database systems: Open problems and new issues , 1993, Distributed and Parallel Databases.

[9]  Patrick Valduriez,et al.  Replicated Databases: concepts, Architectures and Techniques , 1998, Netw. Inf. Syst. J..

[10]  André Schiper,et al.  Optimistic Atomic Broadcast , 1998, DISC.

[11]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[12]  Gustavo Alonso,et al.  Ganymed: Scalable Replication for Transactional Web Applications , 2004, Middleware.

[13]  Gustavo Alonso,et al.  Don't Be Lazy, Be Consistent: Postgres-R, A New Way to Implement Database Replication , 2000, VLDB.

[14]  Francois Raab,et al.  TPC-C - The Standard Benchmark for Online transaction Processing (OLTP) , 1993, The Benchmark Handbook.

[15]  Francisco Moura,et al.  Optimistic total order in wide area networks , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[16]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[17]  Christian Plattner,et al.  Ganymed: Scalable Replication for Transactional Web Applications , 2004, International Middleware Conference.

[18]  Patrick Valduriez,et al.  The RepDB* prototype: Preventive Replication in a Database Cluster , 2004, BDA.

[19]  Ricardo Jiménez-Peris,et al.  Middleware based data replication providing snapshot isolation , 2005, SIGMOD '05.

[20]  Gustavo Alonso,et al.  Improving the scalability of fault-tolerant database clusters , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[21]  Esther Pacitti,et al.  Preventive Multi-master Replication in a Cluster of Autonomous Databases , 2003, Euro-Par.

[22]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[23]  Sam Toueg,et al.  Fault-tolerant broadcasts and related problems , 1993 .

[24]  Patrick Valduriez,et al.  Scaling Up the Preventive Replication of Autonomous Databases in Cluster Systems , 2004, VECPAR.