Optimal Power Allocation and Load Distribution for Multiple Heterogeneous Multicore Server Processors across Clouds and Data Centers

For multiple heterogeneous multicore server processors across clouds and data centers, the aggregated performance of the cloud of clouds can be optimized by load distribution and balancing. Energy efficiency is one of the most important issues for large-scale server systems in current and future data centers. The multicore processor technology provides new levels of performance and energy efficiency. The present paper aims to develop power and performance constrained load distribution methods for cloud computing in current and future large-scale data centers. In particular, we address the problem of optimal power allocation and load distribution for multiple heterogeneous multicore server processors across clouds and data centers. Our strategy is to formulate optimal power allocation and load distribution for multiple servers in a cloud of clouds as optimization problems, i.e., power constrained performance optimization and performance constrained power optimization. Our research problems in large-scale data centers are well-defined multivariable optimization problems, which explore the power-performance tradeoff by fixing one factor and minimizing the other, from the perspective of optimal load distribution. It is clear that such power and performance optimization is important for a cloud computing provider to efficiently utilize all the available resources. We model a multicore server processor as a queuing system with multiple servers. Our optimization problems are solved for two different models of core speed, where one model assumes that a core runs at zero speed when it is idle, and the other model assumes that a core runs at a constant speed. Our results in this paper provide new theoretical insights into power management and performance optimization in data centers.

[1]  Ali R. Hurson,et al.  Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[2]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[3]  David Sinclair Load balancing in parallel and distributed systems , 1993 .

[4]  Luca Benini,et al.  A survey of design techniques for system-level dynamic power management , 2000, IEEE Trans. Very Large Scale Integr. Syst..

[5]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.

[6]  Keqin Li Minimizing mean response time in heterogeneous multiple computer systems with a central stochastic job dispatcher , 1998 .

[7]  Suman Nath,et al.  Energy-Aware Server Provisioning and Load Dispatching for Connection-Intensive Internet Services , 2008, NSDI.

[8]  Samee Ullah Khan,et al.  Autonomic Power & Performance Management for Large-Scale Data Centers , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[9]  Navendu Jain,et al.  Managing cost, performance, and reliability tradeoffs for energy-aware server provisioning , 2011, 2011 Proceedings IEEE INFOCOM.

[10]  C. G. Rommen The probability of load balancing success in a homogeneous network , 1991 .

[11]  David Blaauw,et al.  Theoretical and practical limits of dynamic voltage scaling , 2004, Proceedings. 41st Design Automation Conference, 2004..

[12]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[13]  Kevin Li Optimizing Average Job Response Time via Decentralized Probabilistic Job Dispatching in Heterogeneous Multiple Computer Systems , 1998, Comput. J..

[14]  Xiaorui Wang,et al.  SHIP: Scalable Hierarchical Power Control for Large-Scale Data Centers , 2009, 2009 18th International Conference on Parallel Architectures and Compilation Techniques.

[15]  Israel Koren,et al.  System-level power-aware design techniques in real-time systems , 2003, Proc. IEEE.

[16]  Keqin Li Minimizing the probability of load imbalance in heterogeneous distributed computer systems , 2002 .

[17]  Keqin Li,et al.  Optimal power allocation among multiple heterogeneous servers in a data center , 2012, Sustain. Comput. Informatics Syst..

[18]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[19]  亀田 壽夫,et al.  Optimal load balancing in distributed computer systems , 1997 .

[20]  Mor Harchol-Balter,et al.  Optimality analysis of energy-performance trade-off for server farm management , 2010, Perform. Evaluation.

[21]  Albert Y. Zomaya,et al.  A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems , 2010, Adv. Comput..

[22]  Anurag Kumar,et al.  Adaptive Optimal Load Balancing in a Nonhomogeneous Multiserver System with a Central Job Scheduler , 1990, IEEE Trans. Computers.

[23]  Kevin Skadron,et al.  Power-aware computing , 2003, Computer.

[24]  C. Gary Rommel The Probability of Load Balancing Success in a Homogeneous Network , 1991, IEEE Trans. Software Eng..

[25]  Keqin Li,et al.  Optimal load distribution in nondedicated heterogeneous cluster and grid computing environments , 2008, J. Syst. Archit..

[26]  Michael Franz,et al.  Power reduction techniques for microprocessor systems , 2005, CSUR.

[27]  Erol Gelenbe,et al.  Energy-Efficient Cloud Computing , 2010, Comput. J..

[28]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[29]  Xueyan Tang,et al.  Optimizing static job scheduling in a network of heterogeneous computers , 2000, Proceedings 2000 International Conference on Parallel Processing.

[30]  David D. Yao,et al.  Optimal load balancing and scheduling in a distributed computer system , 1991, JACM.

[31]  Yu Cai,et al.  Achieving Energy Proportionality In Server Clusters , 2010 .

[32]  A. Guyot,et al.  Low power CMOS digital design , 1998, Proceedings of the Tenth International Conference on Microelectronics (Cat. No.98EX186).

[33]  Erik D. Demaine,et al.  Energy-Efficient Algorithms , 2016, ITCS.

[34]  Yefu Wang,et al.  Coordinating Power Control and Performance Management for Virtualized Server Clusters , 2011, IEEE Transactions on Parallel and Distributed Systems.

[35]  Stephen A. Jarvis,et al.  Allocating non-real-time and soft real-time jobs in multiclusters , 2006, IEEE Transactions on Parallel and Distributed Systems.

[36]  Keqin Li Optimal Load Distribution for Multiple Heterogeneous Blade Servers in a Cloud Computing Environment , 2011, IPDPS Workshops.