Evaluating the performance of the database state machine using realistic simulations

Replication of database servers using the Database State Machine (DBSM) approach has recently been the subject of much attention as it promises both good performance and increased reliability. Fulfilling the promise of performance, however, requires that the impact of environment parameters as well as of design and implementation decisions are evaluated in a realistic setting which allows experimentation with configuration and environment parameters. This paper introduces a model that combines simulated network and database engine components with real implementations of termination and communication protocols. This allows us to precisely evaluate the protocols’ performance when subjected to a realistic load generated by the simulated database engine. It also allows us to evaluate the impact of the protocol overhead on the overall performance of the database system in several environments. Besides describing the design and validation of the simulation model, results obtained with prototype implementations of the protocols are presented.

[1]  Fernando Pedone,et al.  Partial replication in the Database State Machine , 2001, Proceedings IEEE International Symposium on Network Computing and Applications. NCA 2001.

[2]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[3]  Andrew S. Tanenbaum,et al.  Group communication in the Amoeba distributed operating system , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[4]  Flaviu Cristian,et al.  Applying simulation to the design and performance evaluation of fault-tolerant systems , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[5]  Donald F. Towsley,et al.  A comparison of sender-initiated and receiver-initiated reliable multicast protocols , 1994, IEEE J. Sel. Areas Commun..

[6]  Kenneth P. Birman,et al.  Scalable message stability detection protocols , 1998 .

[7]  Liuba Shrira,et al.  Lazy replication: exploiting the semantics of distributed services (extended abstract) , 1990, OPSR.

[8]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[9]  Luís E. T. Rodrigues,et al.  Semantically Reliable Multicast: Definition, Implementation, and Performance Evaluation , 2003, IEEE Trans. Computers.

[10]  David M. Nicol,et al.  Towards Realistic Million-Node Internet Simulation , 1999, PDPTA.

[11]  Rui Oliveira,et al.  A mutable protocol for Consensus in large groups , 2003 .

[12]  Gustavo Alonso,et al.  Processing transactions over optimistic atomic broadcast protocols , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[13]  James H. Cowie Scalable Simulation Framework API Reference Manual , 1999 .

[14]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[15]  Rachid Guerraoui,et al.  Exploiting Atomic Broadcast in Replicated Databases , 1998, Euro-Par.

[16]  Miron Livny,et al.  Concurrency control performance modeling: alternatives and implications , 1987, TODS.

[17]  Rachid Guerraoui,et al.  The Database State Machine Approach , 2003, Distributed and Parallel Databases.

[18]  Divyakant Agrawal,et al.  The performance of database replication with group multicast , 1999, Digest of Papers. Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing (Cat. No.99CB36352).

[19]  André Schiper,et al.  Optimistic atomic broadcast: a pragmatic viewpoint , 2003, Theor. Comput. Sci..

[20]  Gustavo Alonso,et al.  Don't Be Lazy, Be Consistent: Postgres-R, A New Way to Implement Database Replication , 2000, VLDB.

[21]  Francisco Moura,et al.  Optimistic total order in wide area networks , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[22]  Fred B. Schneider,et al.  Replication management using the state-machine approach , 1993 .

[23]  Robbert van Renesse,et al.  Reliable Distributed Computing with the Isis Toolkit , 1994 .

[24]  Fred B. Schneider,et al.  The primary-backup approach , 1993 .

[25]  André Schiper,et al.  Uniform reliable multicast in a virtually synchronous environment , 1993, [1993] Proceedings. The 13th International Conference on Distributed Computing Systems.

[26]  William S. Keezer Array-driven simulation of real databases , 1998, 1998 Winter Simulation Conference. Proceedings (Cat. No.98CH36274).

[27]  Divyakant Agrawal,et al.  Epidemic algorithms in replicated databases (extended abstract) , 1997, PODS.

[28]  Fernando Pedone The database state machine and group communication issues , 1999 .