Practically Realizable Efficient Data Allocation and Replication Strategies for Distributed Databases with Buffer Constraints

In this paper, we address the performance of distributed database systems with buffer constraints. Specifically, our objective is to design and analyze efficient data allocation and replication strategies to minimize the total servicing cost for an arbitrary read/write request sequence, under finite buffer constraints of the nodes in the system. When the available buffer space in a node is not enough to store a copy of an object, the decision has to be made on whether or not we should evict one or more objects in use to give room for the new object copy. In this paper, we design and analyze the data replication strategies with the model of dynamic window mechanism (DWM) algorithm jointly implemented with different types of object replacement strategies (no replacement, LRU, and LFU) commonly found in practice. We consider situations wherein the object sizes are identical as well as heterogeneous. We will show the impact on the performance of the allocation and replication strategies due to the limited local database buffer capacities. We analyze and quantify theoretically (using competitive analysis) the performances of all the proposed algorithms. Further, we perform rigorous simulation experiments to validate the findings with respect to several influencing parameters. Several useful conclusions are drawn based on the experimental results and we highlight the usefulness of the algorithms under different situations

[1]  Azer Bestavros,et al.  Popularity-aware greedy dual-size Web proxy caching algorithms , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[2]  J. Spencer Love,et al.  Caching strategies to improve disk system performance , 1994, Computer.

[3]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[4]  Sushil Jajodia,et al.  An adaptive data replication algorithm , 1997, TODS.

[5]  Bharadwaj Veeravalli,et al.  Object Management in Distributed Database Systems for Stationary and Mobile Computing Environments , 2003, Network Theory and Applications.

[6]  Paulo Veríssimo,et al.  Distributed Systems for System Architects , 2001, Advances in Distributed Computing and Middleware.

[7]  Sang Lyul Min,et al.  Buffer cache management: predicting the future from the past , 2002, Proceedings International Symposium on Parallel Architectures, Algorithms and Networks. I-SPAN'02.

[8]  Konstantinos Kalpakis,et al.  Optimal Placement of Replicas in Trees with Read, Write, and Storage Costs , 2001, IEEE Trans. Parallel Distributed Syst..

[9]  Robert E. Tarjan,et al.  Amortized efficiency of list update and paging rules , 1985, CACM.

[10]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[11]  Gerhard Weikum,et al.  The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[12]  Ambuj K. Singh,et al.  Stable and fault-tolerant object allocation , 2000, PODC '00.

[13]  Luigi Rizzo,et al.  Replacement policies for a proxy cache , 2000, TNET.

[14]  S. Albers Competitive Online Algorithms , 1996 .

[15]  Sushil Jajodia,et al.  Distributed algorithms for dynamic replication of data , 1992, PODS.

[16]  Ouri Wolfson,et al.  The multicast policy and its relationship to replicated data placement , 1991, TODS.

[17]  Andrew S. Tanenbaum,et al.  Distributed systems: Principles and Paradigms , 2001 .

[18]  Amos Fiat,et al.  Competitive distributed file allocation , 1993, STOC '93.

[19]  Kai Hwang,et al.  Advanced computer architecture - parallelism, scalability, programmability , 1992 .

[20]  Stephen A. Cook,et al.  The optimal location of replicas in a network using a READ-ONE-WRITE-ALL policy , 2002, Distributed Computing.

[21]  Ouri Wolfson,et al.  Competitive Analysis of Caching in Distributed Databases , 1998, IEEE Trans. Parallel Distributed Syst..

[22]  Michael D. Smith,et al.  Code cache management schemes for dynamic optimizers , 2002, Proceedings Sixth Annual Workshop on Interaction between Compilers and Computer Architectures.

[23]  Subhash Bhalla,et al.  Asynchronous Operations in Distributed Concurrency Control , 2003, IEEE Trans. Knowl. Data Eng..

[24]  Tetsuya Furukawa,et al.  Concurrency Control Based on Order Constraints in Advanced Database Systems , 1996, CODAS.

[25]  Sushil Jajodia,et al.  An algorithm for dynamic data distribution , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[26]  Ouri Wolfson,et al.  A competitive dynamic data replication algorithm , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.