A formal characterization of SI-based ROWA replication protocols

Snapshot isolation (SI) is commonly used in some commercial DBMSs with a multiversion concurrency control mechanism since it never blocks read-only transactions. Recent database replication protocols have been designed using SI replicas where transactions are firstly executed in a delegate replica and their updates (if any) are propagated to the rest of the replicas at commit time; i.e. they follow the Read One Write All (ROWA) approach. This paper provides a formalization that shows the correctness of abstract protocols which cover these replication proposals. These abstract protocols differ in the properties demanded for achieving a global SI level and those needed for its generalized SI (GSI) variant - allowing reads from old snapshots. Additionally, we propose two more relaxed properties that also ensure a global GSI level. Thus, some applications can further optimize their performance in a replicated system while obtaining GSI.

[1]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[2]  Fernando Pedone,et al.  Tashkent: uniting durability with transaction ordering for high-performance scalable database replication , 2006, EuroSys.

[3]  Carlo Curino,et al.  Schism , 2010, Proc. VLDB Endow..

[4]  Butler W. Lampson,et al.  Atomic Transactions , 1980, Advanced Course: Distributed Systems.

[5]  Francesc D. Muñoz-Escoí,et al.  A Weak Voting Database Replication Protocol Providing Different Isolation Levels , 2007 .

[6]  Bettina Kemme,et al.  Database replication for clusters of workstations , 2000 .

[7]  Christos H. Papadimitriou,et al.  The Theory of Database Concurrency Control , 1986 .

[8]  Irving L. Traiger,et al.  Transactions and consistency in distributed database systems , 1982, TODS.

[9]  Gustavo Alonso,et al.  Exploiting Atomic Broadcast in Replicated Databases (Extended Abstract) , 1997, Euro-Par.

[10]  Luis Irún-Briz,et al.  Managing Transaction Conflicts in Middleware-based Database Replication Architectures , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[11]  Gustavo Alonso,et al.  Searching in time , 2006, SIGMOD Conference.

[12]  Jeffrey F. Naughton,et al.  Transaction reordering , 2010, Data Knowl. Eng..

[13]  Ming-Hua Lin An optimal workload-based data allocation approach for multidisk databases , 2009, Data Knowl. Eng..

[14]  Butler W. Lampson,et al.  Distributed Systems - Architecture and Implementation, An Advanced Course , 1981, Advanced Course: Distributed Systems.

[15]  Yi Lin,et al.  Snapshot isolation and integrity constraints in replicated databases , 2009, TODS.

[16]  Sameh Elnikety,et al.  Strongly consistent replication for a bargain , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[17]  P. Krishna Reddy,et al.  Improving the performance of read-only transactions through asynchronous speculation , 2008, SpringSim '08.

[18]  Hiroshi Nakamura,et al.  Pangea: An Eager Database Replication Middleware guaranteeing Snapshot Isolation without Modification of Database Servers , 2009, Proc. VLDB Endow..

[19]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[20]  Patrick E. O'Neil,et al.  Generalized isolation level definitions , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[21]  Dale Skeen,et al.  Nonblocking commit protocols , 1981, SIGMOD '81.

[22]  Gustavo Alonso,et al.  Extending DBMSs with satellite databases , 2008, The VLDB Journal.

[23]  Ippokratis Pandis,et al.  Data-oriented transaction execution , 2010, Proc. VLDB Endow..

[24]  André Schiper,et al.  Comparison of database replication techniques based on total order broadcast , 2005, IEEE Transactions on Knowledge and Data Engineering.

[25]  Gustavo Alonso,et al.  A new approach to developing and implementing eager database replication protocols , 2000, TODS.

[26]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[27]  Bettina Kemme,et al.  Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[28]  Ricardo Jiménez-Peris,et al.  Boosting Database Replication Scalability through Partial Replication and 1-Copy-Snapshot-Isolation , 2007, 13th Pacific Rim International Symposium on Dependable Computing (PRDC 2007).

[29]  Francesc D. Muñoz-Escoí,et al.  A formal analysis of database replication protocols with SI replicas and crash failures , 2009, The Journal of Supercomputing.

[30]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[31]  Kenneth Salem,et al.  Lazy database replication with snapshot isolation , 2006, VLDB.

[32]  Daniel J. Abadi,et al.  Low overhead concurrency control for partitioned main memory databases , 2010, SIGMOD Conference.

[33]  Ishfaq Ahmad,et al.  A Pure Nash Equilibrium-Based Game Theoretical Method for Data Replication across Multiple Servers , 2009, IEEE Transactions on Knowledge and Data Engineering.

[34]  Ricardo Jiménez-Peris,et al.  Middleware based data replication providing snapshot isolation , 2005, SIGMOD '05.

[35]  Francesc D. Muñoz-Escoí,et al.  SIPRe: a partial database replication protocol with SI replicas , 2008, SAC '08.

[36]  Hendrik Decker,et al.  k-bound GSI: a flexible database replication protocol , 2007, SAC '07.

[38]  Amr El Abbadi,et al.  ElasTraS: An Elastic Transactional Data Store in the Cloud , 2009, HotCloud.

[39]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[40]  Patrick Valduriez,et al.  Data currency in replicated DHTs , 2007, SIGMOD '07.

[41]  Fernando Pedone,et al.  Database replication using generalized snapshot isolation , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[42]  Sungwon Jung,et al.  A concurrency control scheme for mobile transactions in broadcast disk environments , 2009, Data Knowl. Eng..

[43]  José Pereira,et al.  Revisiting 1-copy equivalence in clustered databases , 2006, SAC '06.

[44]  Gustavo Alonso,et al.  Exploiting atomic broadcast in replicated databases , 1997 .