Multiview access protocols for large-scale replication

The article proposes a scalable protocol for replication management in large-scale replicated systems. The protocol organizes sites and data replicas into a tree-structured, hierarchical cluster architecture. The basic idea of the protocol is to accomplish the complex task of updating replicated data with a very large number of replicas by a set of related but independently committed transactions. Each transaction is responsible for updating replicas in exactly one cluster and invoking additional transactions for member clusters. Primary copies (one from each cluster) are updated by a cross-cluster transaction. Then each cluster is independently updated by a separate transaction. This decoupled update propagation process results in possible multiple views of replicated data in a cluster. Compared to other replicated data management protocols, the proposed protocol has several unique advantages. First, thanks to a smaller number of replicas each transaction needs to atomically update in a cluster, the protocol significantly reduces the transaction abort rate, which tends to soar in large transactional systems. Second, the protocol improves user-level transaction response time as top-level update transactions are allowed to commit before all replicas have been updated. Third, read-only queries have the flexibility to see database views of different degrees of consistency and data currency. This ranges from global, most up to date, and consistent views, to local, consistent, but potentially old views, to local, nearest to users but potentially inconsistent views. Fourth, the protocol maintains its scalability by allowing dynamic system reconfiguration as it grows by splitting a cluster into two or more smaller ones. Fifth, autonomy of the clusters is preserved as no specific protocol is required to update replicas within the same cluster. Clusters are, therefore, free to use any valid replication or concurrency control protocols.

[1]  Hector Garcia-Molina,et al.  Consistency in a partitioned network: a survey , 1985, CSUR.

[2]  Mahadev Satyanarayanan,et al.  Scalable, secure, and highly available distributed file access , 1990, Computer.

[3]  Doug Terry,et al.  Epidemic algorithms for replicated database maintenance , 1988, OPSR.

[4]  Hector Garcia-Molina,et al.  Read-only transactions in a distributed database , 1982, TODS.

[5]  Sushil Jajodia,et al.  Dynamic voting , 1987, SIGMOD '87.

[6]  J. D. Day,et al.  A principle for resilient sharing of distributed resources , 1976, ICSE '76.

[7]  Harjinder S. Sandhu,et al.  Cluster-based file replication in large-scale distributed systems , 1992, SIGMETRICS '92/PERFORMANCE '92.

[8]  Bharat Bhargava,et al.  WANCE: A wide area network communication emulation system , 1993, Proceedings 1993 IEEE Workshop on Advances in Parallel and Distributed Systems.

[9]  Jim Gray,et al.  Benchmark Handbook: For Database and Transaction Processing Systems , 1992 .

[10]  Roger M. Needham,et al.  Grapevine: An exercise in distributed computing , 1981, SOSP.

[11]  Bharat K. Bhargava,et al.  Adaptable Recovery Using Dynamic Quorum Assignments , 1990, VLDB.

[12]  Hector Garcia-Molina,et al.  Increasing availability under mutual exclusion constraints with dynamic vote reassignment , 1989, TOCS.

[13]  W. H. Inmon,et al.  Rdb/VMS: Developing the Data Warehouse , 1993 .

[14]  Raymond A. Lorie,et al.  Physical integrity in a large segmented database , 1977, TODS.

[15]  SatyanarayananMahadev Scalable, Secure, and Highly Available Distributed File Access , 1990 .

[16]  Darrell D. E. Long,et al.  Accessing Replicated Data in a Large-Scale Distributed System , 1991, Int. J. Comput. Simul..

[17]  Michael Stonebraker,et al.  Concurrency Control and Consistency of Multiple Copies of Data in Distributed Ingres , 1979, IEEE Transactions on Software Engineering.

[18]  Bharat Bhargava,et al.  Effective Strategies for Multi-Media Message Transmission: UDP Cooling and TCP , 1995 .

[19]  Robbert van Renesse,et al.  Design and Performance of Horus: A Lightweight Group Communications System , 1994 .

[20]  Yogen K. Dalal,et al.  The clearinghouse: a decentralized agent for locating named objects in a distributed environment , 1983, TOIS.

[21]  M. Herlihy A quorum-consensus replication method for abstract data types , 1986, TOCS.

[22]  Weimin Du,et al.  Maintaining consistency of replicated data in multidatabase systems , 1994, 14th International Conference on Distributed Computing Systems.

[23]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[24]  Matt Blaze,et al.  Dynamic hierarchical caching in large-scale distributed file systems , 1992, [1992] Proceedings of the 12th International Conference on Distributed Computing Systems.

[25]  Mostafa H. Ammar,et al.  The Grid Protocol: A High Performance Scheme for Maintaining Replicated Data , 1992, IEEE Trans. Knowl. Data Eng..

[26]  Flaviu Cristian,et al.  An efficient, fault-tolerant protocol for replicated data management , 1985, PODS '85.

[27]  Philip A. Bernstein,et al.  Categories and Subject Descriptors: H.2.4 [Database Management]: Systems. , 2022 .

[28]  John Heidemann,et al.  Name transparency in very large scale distributed file systems , 1990, IEEE Workshop on Experimental Distributed Systems.

[29]  Abdelsalam Ali Helal Experimental analysis of replication in distributed systems , 1991 .

[30]  Roderic G. G. Cattell The benchmark handbook for database and transaction processing systems , 1991 .

[31]  Bharat K. Bhargava,et al.  Replication Techniques in Distributed Systems , 1996, Advances in Database Systems.

[32]  Andreas Reuter,et al.  Principles of transaction-oriented database recovery , 1983, CSUR.

[33]  Bharat Bhargava,et al.  Communication experiments for distributed transaction processing: from lan to wan , 1994 .

[34]  Divyakant Agrawal,et al.  An efficient and fault-tolerant solution for distributed mutual exclusion , 1991, TOCS.

[35]  Flaviu Cristian,et al.  An efficient, fault-tolerant protocol for replicated data management , 1985, Fault-Tolerant Distributed Computing.

[36]  Michael Stonebraker,et al.  A measure of transaction processing power , 1985 .

[37]  William E. Weihl,et al.  Distributed Version Management for Read-Only Actions , 1985, IEEE Transactions on Software Engineering.

[38]  Jean Bacon,et al.  A hierarchical asynchronous replication protocol for large scale systems , 1993, Proceedings 1993 IEEE Workshop on Advances in Parallel and Distributed Systems.

[39]  K. Brahmadathan,et al.  Read-only transactions in partitioned replicated databases , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[40]  Toshimi Minoura,et al.  Resilient Extended True-Copy Token Scheme for a Distributed Database System , 1982, IEEE Transactions on Software Engineering.

[41]  Mahadev Satyanarayanan,et al.  Coda: a highly available file system for a distributed workstation environment , 1989, Proceedings of the Second Workshop on Workstation Operating Systems.

[42]  Mamoru Maekawa,et al.  A N algorithm for mutual exclusion in decentralized systems , 1985, TOCS.

[43]  Walter A. Burkhard,et al.  Consistency and recovery control for replicated files , 1985, SOSP 1985.

[44]  Mahadev Satyanarayanan,et al.  Coda: A Highly Available File System for a Distributed Workstation Environment , 1990, IEEE Trans. Computers.

[45]  Bharat K. Bhargava,et al.  Analyzing Availability of Replicated Database Systems , 1991, Int. J. Comput. Simul..

[46]  Amnon Barak,et al.  Design Principles of Operating Systems for Large Scale Multicomputers , 1987, Experiences with Distributed Systems.

[47]  Philip A. Bernstein,et al.  Serializability Theory for Replicated Databases , 1985, J. Comput. Syst. Sci..

[48]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[49]  Scott Shenker,et al.  Epidemic algorithms for replicated database maintenance , 1988, OPSR.

[50]  Walter A. Burkhard,et al.  Consistency and recovery control for replicated files , 1985, SOSP '85.

[51]  Akhil Kumar,et al.  HPP: a hierarchical propagation protocol for large scale replication in wide area networks , 1994 .

[52]  Amr El Abbadi,et al.  Maintaining availability in partitioned replicated databases , 1987, ACM Trans. Database Syst..

[53]  Akhil Kumar,et al.  A High Availability \sqrt{N} Hierarchical Grid Algorithm for Replicated Data , 1991, Inf. Process. Lett..

[54]  Akhil Kumar,et al.  Performance analysis of a hierarchical quorum consensus algorithm for replicated objects , 1990, Proceedings.,10th International Conference on Distributed Computing Systems.

[55]  Mostafa H. Ammar,et al.  Multidimensional voting , 1991, TOCS.

[56]  Xiangning Liu Data communication and replication strategies for large scale distributed databases , 1996 .

[57]  Robbert van Renesse,et al.  Horus: a flexible group communication system , 1996, CACM.

[58]  Roger M. Needham,et al.  Grapevine: an exercise in distributed computing , 1982, CACM.

[59]  Bharat Bhargava,et al.  Adapting Distributed Database Systems for High Availability , 1995 .

[60]  David K. Gifford,et al.  Weighted voting for replicated data , 1979, SOSP '79.

[61]  Divyakant Agrawal,et al.  Dynamic logical structures: a position statement for managing replicated data , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.