Energy-Aware Scheduling for Precedence-Constrained Parallel Virtual Machines in Virtualized Data Centers

Large scale Internet services are expected to only increase in complexity and popularity. Their energy consumption is also a major concern in data centers. Smart scheduling of their sub-services on data center Physical Machines (PM) can effectively improve their energy as well as performance. Since today servers are not energy-proportional yet, a major and traditionally neglected source of inefficiency in them is the utilization level of PMs. We present two scheduling algorithms for precedence-constrained parallel Virtual Machines (VM) in a virtualized data center where each VM represents a sub-service of the Internet-scale service. Our algorithms use virtualization technology to increase utilization of the PMs, and hence reduce total number of active PMs, to improve energy with minimal effect on makespan. Both proposed algorithms have a polynomial time complexity which make them suitable options for scheduling of large services. Simulation results using real-world services demonstrate that the algorithms are capable of increasing utilization level of PMs on average by 52 % and improving energy consumption by 18 % while the makespan of services is degraded less than 2 %.

[1]  Wei Zhang,et al.  A Dynamic Simulated Annealing Algorithm with Self-adaptive Technique for Grid Scheduling , 2009, 2009 WRI Global Congress on Intelligent Systems.

[2]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems (Wiley Series on Parallel and Distributed Computing) , 2007 .

[3]  Hamid Arabnejad,et al.  A Budget Constrained Scheduling Algorithm for Workflow Applications , 2014, Journal of Grid Computing.

[4]  Gil Neiger,et al.  Intel virtualization technology , 2005, Computer.

[5]  Li Zhao,et al.  SCEC CyberShake Workflows - Automating Probabilistic Seismic Hazard Analysis Calculations , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[6]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[7]  P. Laird Institutional Profile: The USC Epigenome Center , 2009 .

[8]  Mohsen Sharifi,et al.  PASTA: a power-aware solution to scheduling of precedence-constrained tasks on heterogeneous computing resources , 2012, Computing.

[9]  T. Aaron Gulliver,et al.  Efficient Workflow Scheduling for Grid Computing Using a Leveled Multi-objective Genetic Algorithm , 2014, Journal of Grid Computing.

[10]  Nawwaf N. Kharma,et al.  A high performance algorithm for static task scheduling in heterogeneous distributed computing systems , 2008, J. Parallel Distributed Comput..

[11]  Keqin Li,et al.  Energy efficient scheduling of parallel tasks on multiprocessor computers , 2012, The Journal of Supercomputing.

[12]  Keqin Li,et al.  Scheduling Precedence Constrained Tasks with Reduced Processor Energy on Multiprocessor Computers , 2012, IEEE Transactions on Computers.

[13]  Luiz Fernando Bittencourt,et al.  Towards the Scheduling of Multiple Workflows on Computational Grids , 2010, Journal of Grid Computing.

[14]  Albert Y. Zomaya,et al.  Author manuscript, published in "Journal of Parallel and Distributed Computing (2011)" A Parallel Bi-objective Hybrid Metaheuristic for Energy-aware Scheduling for Cloud Computing Systems , 2011 .

[15]  Poonam Panwar,et al.  A Genetic Algorithm Based Technique for Efficient Scheduling of Tasks on Multiprocessor System , 2011, SocProS.

[16]  Michael Franz,et al.  Power reduction techniques for microprocessor systems , 2005, CSUR.

[17]  Pascal Bouvry,et al.  Energy-Aware Scheduling on Multicore Heterogeneous Grid Computing Systems , 2013, Journal of Grid Computing.

[18]  L. Minas,et al.  Energy Efficiency for Information Technology: How to Reduce Power Consumption in Servers and Data Centers , 2009 .

[19]  Yves Robert,et al.  Parallel Gaussian elimination on an MIMD computer , 1988, Parallel Comput..

[20]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[21]  José Antonio Lozano,et al.  Towards a Greener Cloud Infrastructure Management using Optimized Placement Policies , 2015, Journal of Grid Computing.

[22]  Miron Livny,et al.  Correction: High-Throughput, Kingdom-Wide Prediction and Annotation of Bacterial Non-Coding RNAs , 2008, PLoS ONE.

[23]  Joshua R. Smith,et al.  LIGO: the Laser Interferometer Gravitational-Wave Observatory , 1992, Science.

[24]  Ümit V. Çatalyürek,et al.  A task duplication based bottom-up scheduling algorithm for heterogeneous environments , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[25]  Suman Nath,et al.  Energy-Aware Server Provisioning and Load Dispatching for Connection-Intensive Internet Services , 2008, NSDI.

[26]  Sanjeev Baskiyar,et al.  Scheduling directed a-cyclic task graphs on heterogeneous network of workstations to minimize schedule length , 2003, 2003 International Conference on Parallel Processing Workshops, 2003. Proceedings..

[27]  Min-You Wu,et al.  Local search for DAG scheduling and task assignment , 1997, Proceedings of the 1997 International Conference on Parallel Processing (Cat. No.97TB100162).

[28]  Emmanuel Jeannot,et al.  Triplet: A clustering scheduling algorithm for heterogeneous systems , 2001, Proceedings International Conference on Parallel Processing Workshops.

[29]  Ching-Hsien Hsu,et al.  Optimizing Energy Consumption with Task Consolidation in Clouds , 2014, Inf. Sci..

[30]  Anton Beloglazov,et al.  Energy-efficient management of virtual machines in data centers for cloud computing , 2013 .

[31]  Jack J. Dongarra,et al.  Scheduling workflow applications on processors with different capabilities , 2006, Future Gener. Comput. Syst..

[32]  Chi-Kwong Li,et al.  Heterogeneous Dominant Sequence Cluster (HDSC): a low complexity heterogeneous scheduling algorithm , 1997, 1997 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM. 10 Years Networking the Pacific Rim, 1987-1997.

[33]  Hesham El-Rewini,et al.  Scheduling Parallel Program Tasks onto Arbitrary Target Machines , 1990, J. Parallel Distributed Comput..

[34]  Füsun Özgüner,et al.  Parallelizing Existing Applications in a Distributed Heterogeneous Environment , 1995 .

[35]  Kenli Li,et al.  List scheduling with duplication for heterogeneous computing systems , 2010, J. Parallel Distributed Comput..

[36]  Jun Gu,et al.  Efficient Local Search for DAG Scheduling , 2001, IEEE Trans. Parallel Distributed Syst..

[37]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[38]  Wei Du,et al.  Energy-Aware Task Clustering Scheduling Algorithm for Heterogeneous Clusters , 2011, 2011 IEEE/ACM International Conference on Green Computing and Communications.

[39]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[40]  Qian Zhu,et al.  Power-Aware Consolidation of Scientific Workflows in Virtualized Environments , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[41]  Nagarajan Kandasamy,et al.  Power and performance management of virtualized computing environments via lookahead control , 2008, 2008 International Conference on Autonomic Computing.

[42]  Dario Pompili,et al.  Energy-Efficient Thermal-Aware Autonomic Management of Virtualized HPC Cloud Infrastructure , 2012, Journal of Grid Computing.

[43]  Albert Y. Zomaya,et al.  Energy Conscious Scheduling for Distributed Computing Systems under Different Operating Conditions , 2011, IEEE Transactions on Parallel and Distributed Systems.

[44]  Min Xie,et al.  Iterative list scheduling for heterogeneous computing , 2005, J. Parallel Distributed Comput..

[45]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[46]  Xiao Qin,et al.  EAD and PEBD: Two Energy-Aware Duplication Scheduling Algorithms for Parallel Tasks on Homogeneous Clusters , 2011, IEEE Transactions on Computers.

[47]  Edward A. Lee,et al.  A Compile-Time Scheduling Heuristic for Interconnection-Constrained Heterogeneous Processor Architectures , 1993, IEEE Trans. Parallel Distributed Syst..

[48]  Jan Janecek,et al.  A high performance, low complexity algorithm for compile-time task scheduling in heterogeneous systems , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[49]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems , 2007, Wiley series on parallel and distributed computing.

[50]  Saurabh Kumar,et al.  Energy Efficient Utilization of Resources in Cloud Computing Systems , 2016 .

[51]  Thomas D. Burd,et al.  Energy efficient CMOS microprocessor design , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[52]  David S. Johnson,et al.  Some simplified NP-complete problems , 1974, STOC '74.

[53]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[54]  Qingyuan Deng,et al.  MemScale: active low-power modes for main memory , 2011, ASPLOS XVI.

[55]  Margaret Martonosi,et al.  Computer Architecture Techniques for Power-Efficiency , 2008, Computer Architecture Techniques for Power-Efficiency.

[56]  Tao Yang,et al.  DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors , 1994, IEEE Trans. Parallel Distributed Syst..

[57]  Ewa Deelman,et al.  Pegasus: Mapping Large-Scale Workflows to Distributed Resources , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[58]  J. Koomey,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431: Appendices , 2008 .

[59]  M. Livny,et al.  High-Throughput, Kingdom-Wide Prediction and Annotation of Bacterial Non-Coding RNAs , 2008, PloS one.

[60]  Margaret Martonosi,et al.  Cache decay: exploiting generational behavior to reduce cache leakage power , 2001, ISCA 2001.