A Hybrid Approach for Database Replication: Finding the Optimal Configuration between Update Everywhere and Primary Copy Paradigms

Database replication has been subject of two different approaches, namely primary copy and update everywhere protocols. The former only allows performing update transactions in the primary replica, while the rest are only used to execute read-only transactions. Update everywhere protocols, on the other hand, allow the system to schedule update transactions in any replica, thus increasing its capacity to deal with update intensive workloads and overcoming failures. However, synchronization costs augment and its throughput may fall below the ones obtained by primary copy approaches. Under these circumstances, we propose a new database replication paradigm, halfway between primary copy and update everywhere approaches, which improve system’s performance by adapting its configuration depending on the workload submitted to the system. The core of this approach is a deterministic replication protocol which propagate changes so that broadcast transactions are never aborted. We also propose a recovery algorithm to ensure fault tolerance.

[1]  Gustavo Alonso,et al.  Extending DBMSs with satellite databases , 2008, The VLDB Journal.

[2]  Gianluca Dini,et al.  Enriched View Synchrony: A Programming Paradigm for Partitionable Asynchronous Distributed Systems , 1997, IEEE Trans. Computers.

[3]  Gustavo Alonso,et al.  Non-intrusive, parallel recovery of replicated data , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[4]  Gustavo Alonso,et al.  Using Optimistic Atomic Broadcast in Transaction Processing Systems , 2003, IEEE Trans. Knowl. Data Eng..

[5]  José Ramón González de Mendívil,et al.  On the Cost of Database Clusters Reconfiguration , 2009, 2009 28th IEEE International Symposium on Reliable Distributed Systems.

[6]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems 2007: CoopIS, DOA, ODBASE, GADA, and IS, OTM Confederated International Conferences CoopIS, DOA, ODBASE, GADA, and IS 2007, Vilamoura, Portugal, November 25-30, 2007, Proceedings, Part II , 2007, OTM Conferences.

[7]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[8]  André Schiper,et al.  Comparison of database replication techniques based on total order broadcast , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[10]  Ricardo Jiménez-Peris,et al.  Middleware based data replication providing snapshot isolation , 2005, SIGMOD '05.

[11]  Kenneth P. Birman,et al.  Exploiting virtual synchrony in distributed systems , 1987, SOSP '87.

[12]  Kenneth Salem,et al.  Lazy database replication with snapshot isolation , 2006, VLDB.

[13]  Francesc D. Muñoz-Escoí,et al.  Revisiting Certification-Based Replicated Database Recovery , 2007, OTM Conferences.

[14]  Fernando Pedone,et al.  Database replication using generalized snapshot isolation , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[15]  Gustavo Alonso,et al.  A new approach to developing and implementing eager database replication protocols , 2000, TODS.

[16]  Bettina Kemme,et al.  Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[17]  Alberto Bartoli,et al.  Online reconfiguration in replicated databases based on group communication , 2001, 2001 International Conference on Dependable Systems and Networks.

[18]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.