Experimental evaluation of dynamic data allocation strategies in a distributed database with changing workloads

Traditionally, allocation of data in distributed database management systems has been determined by off-line analysis and optimization. This technique works well for static database access patterns, but is often inadequate for frequently changing workloads. In this paper we address how to dynamically reallocate data for partionable distributed databases with changing access patterns. Rather than complicated and expensive optimization algorithms, a simple heuristic is presented and shown, via an implementation study, to improve system throughput by 3 based system. Based on artificial wide area network delays, we show that dynamic reallocation can improve system throughput by a factor of two and a half for wide area networks. We also show that individual site load must be taken into consideration when reallocating data, and provide a simple policy that incorporates load in the reallocation decision.

[1]  Teunis J. Ott,et al.  Load-balancing heuristics and process behavior , 1986, SIGMETRICS '86/PERFORMANCE '86.

[2]  S. Zhou,et al.  A Trace-Driven Simulation Study of Dynamic Load Balancing , 1987, IEEE Trans. Software Eng..

[3]  Philip S. Yu,et al.  On Optimal Site Assignment for Relations in the Distributed Database Environment , 1989, IEEE Trans. Software Eng..

[4]  Anna Hác,et al.  Algorithms for file replication in a distributed system , 1991, J. Syst. Softw..

[5]  Shamkant B. Navathe,et al.  Scheduling data redistribution in distributed databases , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[6]  Benjamin W. Wah File Placement on Distributed Computer Systems , 1984, Computer.

[7]  Giancarlo Martella,et al.  Optimal File Allocation in a Computer Network: a Solution Method Based on the Knapsack Problem , 1982, Comput. Networks.

[8]  Yung-Terng Wang,et al.  Load Sharing in Distributed Systems , 1985, IEEE Transactions on Computers.

[9]  Amos Fiat,et al.  Competitive distributed file allocation , 1993, STOC '93.

[10]  Edward D. Lazowska,et al.  Adaptive load sharing in homogeneous distributed systems , 1986, IEEE Transactions on Software Engineering.

[11]  Dorit S. Hochbaum,et al.  Database Location in Computer Networks , 1980, JACM.

[12]  Lawrence W. Dowdy,et al.  Comparative Models of the File Assignment Problem , 1982, CSUR.

[13]  Akhil Kumar,et al.  Cost and availability tradeoffs in replicated data concurrency control , 1993, TODS.

[14]  Walter H. Kohler,et al.  Models for Dynamic Load Balancing in a Heterogeneous Multiple Processor System , 1979, IEEE Transactions on Computers.

[15]  Peter M G Apers,et al.  Data allocation in distributed database systems , 1988, TODS.

[16]  Tony P. Ng,et al.  Optimal data migration policies in distributed databases , 1991, [1991] Proceedings The Fifteenth Annual International Computer Software & Applications Conference.

[17]  David J. DeWitt,et al.  The EXODUS Extensible DBMS Project: An Overview , 1989 .

[18]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[19]  Philip S. Yu,et al.  Analysis of Replication in Distributed Database Systems , 1990, IEEE Trans. Knowl. Data Eng..

[20]  Roderic G. G. Cattell The benchmark handbook for database and transaction processing systems , 1991 .

[21]  Kapali P. Eswaran Placement of Records in a File and File Allocation in a Computer , 1974, IFIP Congress.

[22]  David J. DeWitt,et al.  The Architecture of the EXODUS Extensible DBMS , 1986, On Object-Oriented Database System.

[23]  Wesley W. Chu,et al.  Optimal File Allocation in a Multiple Computer System , 1969, IEEE Transactions on Computers.

[24]  Miron Livny,et al.  Load Balancing in Homogeneous Broadcast Distributed Systems , 1982, SIGMETRICS.

[25]  Miron Livny,et al.  Load balancing in homogeneous broadcast distributed systems , 1982, SIGMETRICS 1982.

[26]  Anna Hać A distributed algorithm for performance improvement through file replication, file migration and process migration , 1986 .

[27]  K. Dan Levin,et al.  Optimal program and data locations in computer networks , 1977, CACM.

[28]  Xiaolin Du,et al.  Data allocation in a dynamically reconfigurable environment , 1988, Proceedings. Fourth International Conference on Data Engineering.

[29]  Jim Gray,et al.  Benchmark Handbook: For Database and Transaction Processing Systems , 1992 .

[30]  Rahul Simha,et al.  A Microeconomic Approach to Optimal Resource Allocation in Distributed Computer Systems , 1989, IEEE Trans. Computers.

[31]  Shigeichi Hirasawa,et al.  On the trade-offs between the file redundancy and the communication costs in distributed database systems , 1989, IEEE Trans. Syst. Man Cybern..

[32]  J. Spruce Riordon,et al.  Optimal allocation of resources in distributed information networks , 1976, TODS.