Optimizing Certification-Based Database Recovery

Certification-based database replication protocols are a good basis to develop replica recovery when they provide the snapshot isolation level. For such isolation level, no readset needs to be transferred between replicas nor checked in the certification phase. Additionally, these protocols need to maintain a historic list of writesets that is used for certifying the transactions that arrive to the commit phase. Such historic list can be used to transfer the missed state of a recovering replica. We study the performance of the basic recovery approach - to transfer all missed writesets - and a version-based optimization - to transfer the latest version of each missed item, compacting thus the writeset list - and the results show that such optimization reduces a lot the recovery time.

[1]  Hendrik Decker,et al.  COPLA* - a Middleware for Distributed Databases , 2002, Asian Symposium on Programming Languages and Systems.

[2]  Francisco Castro-Company,et al.  MADIS: A Slim Middleware for Database Replication , 2005, Euro-Par.

[3]  André Schiper,et al.  Comparison of database replication techniques based on total order broadcast , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Francisco Castro-Company,et al.  CLOB: communication support for efficient replicated database recovery , 2005, 13th Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[5]  Francisco Castro-Company,et al.  FOBr: a version-based recovery protocol for replicated databases , 2005, 13th Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[6]  Luis Irún-Briz,et al.  Managing Transaction Conflicts in Middleware-based Database Replication Architectures , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[7]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[8]  Luis Ir,et al.  Lazy Recovery in a Hybrid Database Replication Protocol , 2003 .

[9]  Sam Toueg,et al.  Fault-tolerant broadcasts and related problems , 1993 .

[10]  Fernando Pedone,et al.  Database replication using generalized snapshot isolation , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[11]  Gustavo Alonso,et al.  Non-intrusive, parallel recovery of replicated data , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[12]  R. Jiménez-Peris,et al.  An Algorithm for Non-Intrusive , Parallel Recovery of Replicated Data and its Correctness , 2002 .

[13]  José Enrique Armendáriz Iñigo,et al.  Design and implementation of database replication protocols in the madis architecture , 2006 .

[14]  Gustavo Alonso,et al.  Database replication techniques: a three parameter classification , 2000, Proceedings 19th IEEE Symposium on Reliable Distributed Systems SRDS-2000.

[15]  Pedro Vicente,et al.  Strong Replication in the GlobData Middleware , 2002 .

[16]  Flaviu Cristian,et al.  Understanding fault-tolerant distributed systems , 1991, CACM.

[17]  JoAnne Holliday Replicated database recovery using multicast communication , 2001, Proceedings IEEE International Symposium on Network Computing and Applications. NCA 2001.

[18]  Alberto Bartoli,et al.  Online reconfiguration in replicated databases based on group communication , 2001, 2001 International Conference on Dependable Systems and Networks.

[19]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.