The Effect of Failure and Repair Distributions on Consistency Protocols for Replicated Data Objects

The accessibility of vital information can be enhanced by replicating the data on several sites, and employing a consistency control protocol to manage the copies. Various protocols have been proposed to ensure that only current copies of the data can be accessed. The effect these protocols have on the accessibility of the replicated data is investigated by simulating the operation of the network and measuring the performance. Several strategies for replica maintenance are considered, and the benefits of each are analyzed. The details of the simulations are discussed. Measurements of the reliability and the availability of the replicated data are compared and contrasted. The sensitivity of the Available Copy and Dynamic-linear Voting protocols to common patterns of site failures and repairs is studied in detail. Exponential, Erlang, uniform, and hyperexponential distributions are considered, and the effect the second moments have on the results is analyzed. The relative performance of competing protocols is shown to be only marginally affected by non-exponential distributions, validating the robustness of the exponential approximations.

[1]  Sushil Jajodia Managing replicated files in partitioned distributed database systems , 1987, 1987 IEEE Third International Conference on Data Engineering.

[2]  Jerre D. Noe,et al.  Effectiveness of Replication in Distributed Computer Networks , 1987, ICDCS.

[3]  J.-F. Paris,et al.  Reliability of replicated data objects , 1989, Eighth Annual International Phoenix Conference on Computers and Communications. 1989 Conference Proceedings.

[4]  Sushil Jajodia,et al.  Dynamic voting , 1987, SIGMOD '87.

[5]  A. Glockner Optimal consistency protocols for replicated files , 1989, Eighth Annual International Phoenix Conference on Computers and Communications. 1989 Conference Proceedings.

[6]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[7]  Darrell D. E. Long,et al.  A realistic evaluation of optimistic dynamic voting , 1988, Proceedings [1988] Seventh Symposium on Reliable Distributed Systems.

[8]  Darrell D. E. Long,et al.  The reliability of regeneration-based replica control protocols , 1989, [1989] Proceedings. The 9th International Conference on Distributed Computing Systems.

[9]  Robert H. Thomas,et al.  A Majority consensus approach to concurrency control for multiple copy databases , 1979, ACM Trans. Database Syst..

[10]  Darrell D. E. Long,et al.  Regeneration protocols for replicated objects , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[11]  Walter A. Burkhard,et al.  Consistency and recovery control for replicated files , 1985, SOSP 1985.

[12]  Darrell D. E. Long,et al.  On Improving the Availability of Replicated Files , 1987, SRDS.

[13]  Richard D. Schlichting,et al.  Fail-stop processors: an approach to designing fault-tolerant computing systems , 1983, TOCS.

[14]  G. S. Fishman Principles of Discrete Event Simulation , 1978 .

[15]  Arnold O. Allen,et al.  Probability, statistics and queueing theory - with computer science applications (2. ed.) , 1981, Int. CMG Conference.

[16]  David K. Gifford,et al.  Weighted voting for replicated data , 1979, SOSP '79.

[17]  Philip A. Bernstein,et al.  An algorithm for concurrency control and recovery in replicated distributed databases , 1984, TODS.

[18]  Darrell D. E. Long,et al.  Block-Level Consistency of Replicated Files , 1987, ICDCS.

[19]  Darrell D. E. Long,et al.  The management of replication in a distributed system , 1988 .