Consistency and recovery control for replicated files

We present a consistency and recovery control scheme for replicated files. The purpose of a replicated file is to improve the availability of a logical file in the presence of site failures and network partitions. The accessible physical copies of a replicated file will be mutually consistent and behave like a single copy as far as the user can tell. Our recovery scheme requires no manual intervention. The scheme tolerates any number of site failures and network partitions as well as repairs.

[1]  Hector Garcia-Molina,et al.  The Vulnerability of Voting Mechanisms , 1984, Symposium on Reliability in Distributed Software and Database Systems.

[2]  Alley Stoughton,et al.  Detection of Mutual Inconsistency in Distributed Systems , 1983, IEEE Transactions on Software Engineering.

[3]  H ThomasRobert A Majority consensus approach to concurrency control for multiple copy databases , 1979 .

[4]  David K. Gifford,et al.  Weighted voting for replicated data , 1979, SOSP '79.

[5]  Philip A. Bernstein,et al.  An algorithm for concurrency control and recovery in replicated distributed databases , 1984, TODS.

[6]  Dale Skeen,et al.  A Quorum-Based Commit Protocol , 1982, Berkeley Workshop.

[7]  Richard D. Schlichting,et al.  Fail-stop processors: an approach to designing fault-tolerant computing systems , 1983, TOCS.

[8]  Kevin Wilkinson,et al.  Managing Replicate Data in JASMIN , 1984, Symposium on Reliability in Distributed Software and Database Systems.

[9]  Gérard Le Lann,et al.  Algorithms for Distributed Data-Sharing Systems Which Use Tickets , 1978, Berkeley Workshop.

[10]  Hector Garcia-Molina,et al.  Data-Pach: Integrating Inconsistent Copies of a Database After a Partition , 1983, Symposium on Reliability in Distributed Software and Database Systems.

[11]  Philip A. Bernstein,et al.  Site Initialization, Recovery, and Backup in a Distributed Database System , 1984, IEEE Transactions on Software Engineering.

[12]  J. D. Day,et al.  A principle for resilient sharing of distributed resources , 1976, ICSE '76.

[13]  Derek L. Eager,et al.  Achieving robustness in distributed database systems , 1983, TODS.

[14]  Butler W. Lampson,et al.  Distributed Systems - Architecture and Implementation, An Advanced Course , 1981, Advanced Course: Distributed Systems.

[15]  Susan B. Davidson,et al.  Optimism and consistency in partitioned distributed database systems , 1984, TODS.

[16]  Yuval Tamir,et al.  ERROR RECOVERY IN MULTICOMPUTERS USING GLOBAL CHECKPOINTS , 1984 .