Persistent Logical Synchrony

The virtually synchronous execution model provides an appropriate support for developing reliable applications when the crash failure model is being assumed. Using it, group broadcasts only need to be based on asynchronous communication. Synchronization points are set when a view change arises, guaranteeing an efficient execution of such reliable applications. But a crash failure model is not always appropriate for all applications. Indeed, those using persistent or large state, like replicated databases, need a recoverable model. In such cases, the virtual synchrony property needs to be partially extended for adequately supporting more intricate recovery protocols. Persistent logical synchrony is one variation of this kind, that extends the synchronization actions to be taken when a view change arises, allowing a good support for partial recovery when the primary component membership is being assumed.

[1]  André Schiper,et al.  A new look at atomic broadcast in the asynchronous crash-recovery model , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[2]  Leslie Lamport,et al.  How to Make a Multiprocessor Computer That Correctly Executes Multiprocess Programs , 2016, IEEE Transactions on Computers.

[3]  P BirmanKenneth The process group approach to reliable distributed computing , 1993 .

[4]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[5]  Sam Toueg,et al.  Fault-tolerant broadcasts and related problems , 1993 .

[6]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[7]  Francesc D. Muñoz-Escoí,et al.  Reviewing Amnesia Support in Database Recovery Protocols , 2007, OTM Conferences.

[8]  André Schiper,et al.  Lightweight causal and atomic group multicast , 1991, TOCS.

[9]  André Schiper,et al.  Beyond 1-Safety and 2-Safety for Replicated Databases: Group-Safety , 2004, EDBT.

[10]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[11]  Richard D. Schlichting,et al.  Fail-stop processors: an approach to designing fault-tolerant computing systems , 1983, TOCS.

[12]  Richard D. Schlichting,et al.  Fail-Stop Processors: An Approach to Designing Computing Systems , 1983 .

[13]  Kenneth P. Birman,et al.  The process group approach to reliable distributed computing , 1992, CACM.

[14]  R. Jiménez-Peris,et al.  An Algorithm for Non-Intrusive , Parallel Recovery of Replicated Data and its Correctness , 2002 .

[15]  André Schiper,et al.  Comparison of database replication techniques based on total order broadcast , 2005, IEEE Transactions on Knowledge and Data Engineering.

[16]  Francisco Castro-Company,et al.  CLOB: communication support for efficient replicated database recovery , 2005, 13th Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[17]  Francesc D. Muñoz-Escoí,et al.  Revisiting Certification-Based Replicated Database Recovery , 2007, OTM Conferences.

[18]  Alberto Bartoli,et al.  Online reconfiguration in replicated databases based on group communication , 2001, 2001 International Conference on Dependable Systems and Networks.

[19]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[20]  JoAnne Holliday Replicated database recovery using multicast communication , 2001, Proceedings IEEE International Symposium on Network Computing and Applications. NCA 2001.

[21]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .

[22]  Fernando Pedone,et al.  Sprint: a middleware for high-performance transaction processing , 2007, EuroSys '07.

[23]  Michel Raynal,et al.  Atomic Broadcast in Asynchronous Crash-Recovery Distributed Systems and Its Use in Quorum-Based Replication , 2003, IEEE Trans. Knowl. Data Eng..

[24]  Luis Irún-Briz,et al.  Ensuring Progress in Amnesiac Replicated Systems , 2008, 2008 Third International Conference on Availability, Reliability and Security.

[25]  Flaviu Cristian,et al.  Understanding fault-tolerant distributed systems , 1991, CACM.

[26]  Francesc D. Muñoz-Escoí,et al.  A Deterministic Database Replication Protocol Where Multicast Writesets Never Get Aborted , 2007, OTM Workshops.