Effective Data Distribution and Reallocation Strategies for Fast Query Response in Distributed Query-Intensive Data Environments

Modern large distributed applications, such as mobile communications and banking services, require fast responses to enormous and frequent query requests. This kind of application usually employs in a distributed query-intensive data environment, where the system response time significantly depends on ways of data distribution. Motivated by the efficiency need, we develop two novel strategies: a static data distribution strategy DDH and a dynamic data reallocation strategy DRC to speed up the query response time through load balancing. DDH uses a hash-based heuristic technique to distribute data off-line according to the query history. DRC can reallocate data dynamically at runtime to adapt the changing query patterns in the system. To validate the performance of these two strategies, experiments are conducted using a simulation environment and real customer data. Experimental results show that they both offer favorable performance with the increasing query load of the system.

[1]  Yongbing Zhang,et al.  Comparison of dynamic and static load-balancing strategies in heterogeneous distributed systems , 1997 .

[2]  Jie Li,et al.  Load Balancing Problems for Multiclass Jobs in Distributed/Parallel Computer Systems , 1998, IEEE Trans. Computers.

[3]  Prashant J. Shenoy,et al.  Efficient Data Migration in Self-managing Storage Systems , 2006, 2006 IEEE International Conference on Autonomic Computing.

[4]  Anthony T. Chronopoulos,et al.  Load balancing in distributed systems: an approach using cooperative games , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[5]  Tei-Wei Kuo,et al.  Dynamic Load Balancing for Multiple Processors , 2006, 12th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA'06).

[6]  Cauligi S. Raghavendra,et al.  A Dynamic Load-Balancing Policy With a Central Job Dispatcher (LBC) , 1992, IEEE Trans. Software Eng..

[7]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[8]  David Finkel,et al.  Book review: The Art of Computer Systems Performance Analysis by R. Jain (Wiley-Interscience, 1991) , 1990, PERV.

[9]  Hisao Kameda,et al.  Optimal static load balancing of multi-class jobs in a distributed computer system , 1990, Proceedings.,10th International Conference on Distributed Computing Systems.

[10]  Xiao Qin,et al.  A dynamic load balancing scheme for I/O-intensive applications in distributed systems , 2003, 2003 International Conference on Parallel Processing Workshops, 2003. Proceedings..