Improving Performance in Replicated Databases through Relaxed Coherency

Applications in finance and telecommunications (intelligent network, network management, mobile computing) cause renewed interest in distributed and replicated data management. Since synchronous update of replicated data is experienced to degrade distributed systems performance substantially, relaxing the requirement of coherency (mutual consistency) has become a favorable approach to achieve high performance in replicated databases. In this paper we present formal concepts for specifying relaxed coherency which allows to calculate an independent measure of relaxation, called + coherency index. We incorporate this metric into a detailed analytical queueing model which emphasizes on the quality of replication to evaluate the impact of relaxed coherency on the performance of replicated databases. The model considers response time, throughput, scalability and network traffic as performance criteria. As it turns out, performance improvements through relaxed coherency depend significantly on various system parameters. We closely examine the trade off between consistency and performance, and show that in many situations a slight relaxation of coherency can increase performance remarkably. Finally, we verify these results by benchmarking an implementation of relaxed coherency.

[1]  N. Meyers,et al.  H = W. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[3]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[4]  Hector Garcia-Molina Performance of update algorithms for replicated data in a distributed database , 1979 .

[5]  Bruce G. Lindsay,et al.  Database Snapshots , 1980, VLDB.

[6]  Philip A. Bernstein,et al.  Concurrency Control in Distributed Database Systems , 1986, CSUR.

[7]  Hector Garcia-Molina,et al.  Read-only transactions in a distributed database , 1982, TODS.

[8]  M. Blasgen Database Systems , 1982, Science.

[9]  Abraham Silberschatz,et al.  Operating System Concepts , 1983 .

[10]  Carl M. Harris,et al.  Fundamentals of queueing theory (2nd ed.). , 1985 .

[11]  Hector Garcia-Molina,et al.  Consistency in a partitioned network: a survey , 1985, CSUR.

[12]  Mukesh Singhal,et al.  Performance analysis of an algorithm for concurrency control in replicated database systems , 1986, SIGMETRICS '86/PERFORMANCE '86.

[13]  Hamid Pirahesh,et al.  A snapshot differential refresh algorithm , 1986, SIGMOD '86.

[14]  M Singhal Concurrency control algorithms and their performance in replicated database systems , 1986 .

[15]  Eric N. Hanson,et al.  A performance analysis of view materialization strategies , 1987, SIGMOD '87.

[16]  Gio Wiederhold,et al.  Modeling asynchrony in distributed databases , 1987, 1987 IEEE Third International Conference on Data Engineering.

[17]  Umeshwar Dayal,et al.  The HiPAC project: combining active databases and timing constraints , 1988, SGMD.

[18]  Olivia R. Liu Sheng,et al.  Analysis of Query Processing in Distributed Database Systems with Fully Replicated Files: A Hierarchical Approach , 1988, Perform. Evaluation.

[19]  Mukesh Singhal A fully-distributed approach to concurrency control in replicated database systems , 1988, Proceedings COMPSAC 88: The Twelfth Annual International Computer Software & Applications Conference.

[20]  Philip S. Yu,et al.  Analysis of Replication in Distributed Database Systems , 1990, IEEE Trans. Knowl. Data Eng..

[21]  Amit P. Sheth,et al.  Management of interdependent data: specifying dependency and consistency requirements , 1990, [1990] Proceedings. Workshop on the Management of Replicated Data.

[22]  Rafael Alonso,et al.  Data caching issues in an information retrieval system , 1990, TODS.

[23]  Peter J. B. King,et al.  Computer and Communication Systems Performance Modelling , 1990, SIGMETRICS Perform. Evaluation Rev..

[24]  Hector Garcia-Molina,et al.  The case for controlled inconsistency in replicated data , 1990, [1990] Proceedings. Workshop on the Management of Replicated Data.

[25]  Gio Wiederhold,et al.  Consistency control of replicated data in federated databases , 1990, [1990] Proceedings. Workshop on the Management of Replicated Data.

[26]  Mukesh Singhal,et al.  Update Transport: A New Technique for Update Synchronization in Replicated Database Systems , 1990, IEEE Trans. Software Eng..

[27]  Pierangela Samarati,et al.  A classification of update methods for replicated databases , 1991 .

[28]  Amit P. Sheth,et al.  Specifying interdatabase dependencies in a multidatabase environment , 1991, Computer.

[29]  Sang H. Son,et al.  Performance Evaluation of Replication Control Algorithms for Distributed Database Systems , 1991 .

[30]  G. Ramanathan,et al.  Specification of real-time distributed database systems , 1992, CompEuro 1992 Proceedings Computer Systems and Software Engineering.

[31]  Calton Pu,et al.  An analysis of replica control , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[32]  Narain H. Gehani,et al.  Composite Event Specification in Active Databases: Model & Implementation , 1992, VLDB.

[33]  Alison Payne Designing the databases of the intelligent network , 1992, ICSE 1992.

[34]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[35]  Jim Gray,et al.  Benchmark Handbook: For Database and Transaction Processing Systems , 1992 .

[36]  Philip S. Yu,et al.  Analysis of Concurrency-Coherency Control Protocols for Distributed Transaction Processing Systems with Regional Locality , 1992, IEEE Trans. Software Eng..

[37]  Peter B. Galvin,et al.  Operating System Concepts, 4th Ed. , 1993 .

[38]  Ravi Mukkamala,et al.  Performance Analysis of Transaction Management Algorithms for the SINTRA Replicated-Architecture Database System , 1993, Database Security.

[39]  Matthias Jarke,et al.  Intelligent Networks as a Data Intensive Application (INDIA) , 1994, ADB.

[40]  Sujata Banerjee,et al.  Performance analysis of the send-on-demand: A distributed database concurrency control protocol for high-speed networks , 1994, Comput. Commun..