Processing transactions over optimistic atomic broadcast protocols

Atomic broadcast primitives allow fault-tolerant cooperation between sites in a distributed system. Unfortunately, the delay incurred before a message can be delivered makes it difficult to implement high performance, scalable applications on top of atomic broadcast primitives. A new approach has been proposed which, based on optimistic assumptions about the communication system, reduces the average delay for message delivery. We develop this idea further and present a replicated database architecture that employs the new atomic broadcast primitive in such a way that the coordination phase of the atomic broadcast is fully overlapped with the execution of transactions, providing high performance without relaxing transaction correctness.

[1]  Rachid Guerraoui,et al.  Transaction reordering in replicated databases , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[2]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[3]  Roy Friedman,et al.  Packing messages as a tool for boosting the performance of total ordering protocols , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[4]  Rachid Guerraoui,et al.  Exploiting Atomic Broadcast in Replicated Databases , 1998, Euro-Par.

[5]  Robbert van Renesse,et al.  Horus: a flexible group communication system , 1996, CACM.

[6]  Doug Stacey Replication: DB2, Oracle, or Sybase? , 1995, SGMD.

[7]  André Schiper,et al.  Optimistic Atomic Broadcast , 1998, DISC.

[8]  Gustavo Alonso,et al.  Database replication based on group communication , 1998 .

[9]  Louise E. Moser,et al.  Totem: a fault-tolerant multicast group communication system , 1996, CACM.

[10]  David R. Jefferson,et al.  Virtual time , 1985, ICPP.

[11]  Sam Toueg,et al.  Unreliable failure detectors for asynchronous systems (preliminary version) , 1991, PODC '91.

[12]  Avishai Wool,et al.  Quorum Systems in Replicated Databases: Science or Fiction? , 1998, IEEE Data Eng. Bull..

[13]  Gustavo Alonso,et al.  A suite of database replication protocols based on group communication primitives , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[14]  Kenneth P. Birman,et al.  Performance of the ISIS Distributed Computing Toolkit , 1994 .

[15]  Danny Dolev,et al.  The Transis approach to high availability cluster communication , 1996, CACM.

[16]  Gustavo Alonso,et al.  Exploiting Atomic Broadcast in Replicated Databases (Extended Abstract) , 1997, Euro-Par.

[17]  Michel Raynal,et al.  Consensus-Based Management of Distributed and Replicated Data , 1998, IEEE Data Eng. Bull..

[18]  Sam Toueg,et al.  Unreliable Failure Detectors for Asynchronous Systems , 1991 .

[19]  Gustavo Alonso,et al.  Supporting partial data accesses to replicated data , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[20]  André Schiper,et al.  Lightweight causal and atomic group multicast , 1991, TOCS.