Optimal Load Distribution for Multiple Heterogeneous Blade Servers in a Cloud Computing Environment

{em Given a group of heterogeneous blade servers in a cloud computing environment or a data center of a cloud computing provider, each having its own size and speed and its own amount of preloaded special tasks, we are facing the problem of optimal distribution of generic tasks over these blade servers, such that the average response time of generic tasks is minimized. Such performance optimization is important for a cloud computing provider to efficiently utilize all the available resources and to deliver the highest quality of service. We develop a queueing model for a group of heterogeneous blade servers, and formulate and solve the optimal load distribution problem of generic tasks for multiple heterogeneous blade servers in a cloud computing environment in two different situations, namely, special tasks with and without higher priority. Extensive numerical examples and data are demonstrated and some important observations are made. It is found that server sizes, server speeds, task execution requirement, and the arrival rates of special tasks all have significant impact on the average response time of generic tasks, especially when the total arrival rate of generic tasks is large. It is also found that the server size heterogeneity and the server speed heterogeneity do not have much impact on the average response time of generic tasks. Furthermore, larger (smaller, respectively) heterogeneity results in shorter (longer, respectively) average response time of generic tasks.

[1]  Keqin Li Minimizing mean response time in heterogeneous multiple computer systems with a central stochastic job dispatcher , 1998 .

[2]  Stephen A. Jarvis,et al.  Allocating non-real-time and soft real-time jobs in multiclusters , 2006, IEEE Transactions on Parallel and Distributed Systems.

[3]  亀田 壽夫,et al.  Optimal load balancing in distributed computer systems , 1997 .

[4]  M. Thomas Queueing Systems. Volume 1: Theory (Leonard Kleinrock) , 1976 .

[5]  C. Gary Rommel The Probability of Load Balancing Success in a Homogeneous Network , 1991, IEEE Trans. Software Eng..

[6]  Anurag Kumar,et al.  Adaptive Optimal Load Balancing in a Nonhomogeneous Multiserver System with a Central Job Scheduler , 1990, IEEE Trans. Computers.

[7]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[8]  Kevin Li Optimizing Average Job Response Time via Decentralized Probabilistic Job Dispatching in Heterogeneous Multiple Computer Systems , 1998, Comput. J..

[9]  Xueyan Tang,et al.  Optimizing static job scheduling in a network of heterogeneous computers , 2000, Proceedings 2000 International Conference on Parallel Processing.

[10]  David D. Yao,et al.  Optimal load balancing and scheduling in a distributed computer system , 1991, JACM.

[11]  Keqin Li Minimizing the probability of load imbalance in heterogeneous distributed computer systems , 2002 .

[12]  C. G. Rommen The probability of load balancing success in a homogeneous network , 1991 .

[13]  Ali R. Hurson,et al.  Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[14]  Keqin Li,et al.  Optimal load distribution in nondedicated heterogeneous cluster and grid computing environments , 2008, J. Syst. Archit..