An Approach to Optimise Resource Provision with Energy-Awareness in Datacentres by Combating Task Heterogeneity

Cloud workloads are increasingly heterogeneous such that a single Cloud job may encompass one to several tasks, and tasks belonging to the same job may behave distinctively during their actual execution. This inherent task heterogeneity imposes increased complexities in achieving an energy efficient management of the Cloud jobs. The phenomenon of a few proportions of tasks characterising increased resource intensity within a given job usually lead the providers to over-provision all the encompassed tasks, resulting in majority of the tasks incurring an increased proportions of resource idleness. To this end, this paper proposes a novel analytics framework which integrates a resource estimation module to estimate the resource requirements of tasks a priori, a straggler classification module to classify tasks based on their resource intensity, and a resource optimisation module to optimise the level of resource provision depending on the task nature and various runtime factors. Performance evaluations conducted both theoretically and through practical experiments prove that the proposed methodology performs better than the compared statistical resource estimation methods and existing models of straggler mitigation, and further demonstrate the effectiveness of the proposed methodology in achieving energy conservation by postulating appropriate level of resource provisioning for task execution.

[1]  Wen-Yi Hung,et al.  A prediction based energy conserving resources allocation scheme for cloud computing , 2014, 2014 IEEE International Conference on Granular Computing (GrC).

[2]  Lu Liu,et al.  InOt-RePCoN: Forecasting user behavioural trend in large-scale cloud environments , 2018, Future Gener. Comput. Syst..

[3]  Aniruddha S. Gokhale,et al.  Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[4]  Dzmitry Kliazovich,et al.  GreenCloud: a packet-level simulator of energy-aware cloud computing data centers , 2010, The Journal of Supercomputing.

[5]  Bernd Freisleben,et al.  Energy-Efficient Virtual Machine Consolidation , 2013, IT Professional.

[6]  Randy H. Katz,et al.  Wrangler: Predictable and Faster Jobs using Fewer Resources , 2014, SoCC.

[7]  Junliang Chen,et al.  Workload Predicting-Based Automatic Scaling in Service Clouds , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[8]  Scott Shenker,et al.  Usenix Association 10th Usenix Symposium on Networked Systems Design and Implementation (nsdi '13) 185 Effective Straggler Mitigation: Attack of the Clones , 2022 .

[9]  Antonio Corradi,et al.  VM consolidation: A real case based on OpenStack Cloud , 2014, Future Gener. Comput. Syst..

[10]  Albert G. Greenberg,et al.  Reining in the Outliers in Map-Reduce Clusters using Mantri , 2010, OSDI.

[11]  Mateusz Jarus,et al.  Performance bounded energy efficient virtual machine allocation in the global cloud , 2014, Sustain. Comput. Informatics Syst..

[12]  Christoph Meinel,et al.  Energy efficient scheduling of HPC-jobs on virtualize clusters using host and VM dynamic configuration , 2012, OPSR.

[13]  Jie Xu,et al.  An Approach for Modeling and Ranking Node-Level Stragglers in Cloud Datacenters , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[14]  Marcello Trovati,et al.  Latency-Aware Empirical Analysis of the Workloads for Reducing Excess Energy Consumptions at Cloud Datacentres , 2016, 2016 IEEE Symposium on Service-Oriented System Engineering (SOSE).

[15]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[16]  Bingsheng He,et al.  Green-aware workload scheduling in geographically distributed data centers , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[17]  Randy H. Katz,et al.  Improving MapReduce Performance in Heterogeneous Environments , 2008, OSDI.

[18]  Josh Rosen,et al.  Fine-Grained Micro-Tasks for MapReduce Skew-Handling , 2012 .

[19]  Jie Xu,et al.  Straggler Detection in Parallel Computing Systems through Dynamic Threshold Calculation , 2016, 2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA).

[20]  Brian D. Noble,et al.  Bobtail: Avoiding Long Tails in the Cloud , 2013, NSDI.

[21]  Nick Antonopoulos,et al.  Characterisation of Hidden Periodicity in Large-Scale Cloud Datacentre Environments , 2017, 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[22]  Ziming Zhang,et al.  Characterizing Power and Energy Usage in Cloud Computing Systems , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[23]  Quan Chen,et al.  SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[24]  Wei Li,et al.  Load Prediction-Based Automatic Scaling Cloud Computing , 2016, 2016 International Conference on Networking and Network Applications (NaNA).

[25]  Zhen Xiao,et al.  Improving MapReduce Performance Using Smart Speculative Execution Strategy , 2014, IEEE Transactions on Computers.

[26]  Ce-Kuen Shieh,et al.  Improving Speculative Execution Performance with Coworker for Cloud Computing , 2011, 2011 IEEE 17th International Conference on Parallel and Distributed Systems.

[27]  ZhiHui Lv,et al.  RPPS: A Novel Resource Prediction and Provisioning Scheme in Cloud Data Center , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[28]  Jie Xu,et al.  Timely Long Tail Identification through Agent Based Monitoring and Analytics , 2015, 2015 IEEE 18th International Symposium on Real-Time Distributed Computing.

[29]  Neeraja J. Yadwadkar Proactive Straggler Avoidance using Machine Learning , 2012 .

[30]  Cees T. A. M. de Laat,et al.  Profiling Energy Consumption of VMs for Green Cloud Computing , 2011, 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing.

[31]  Valentin Cristea,et al.  Modelling Requirements for Enabling Meta-scheduling in Inter-Clouds and Inter-Enterprises , 2011, 2011 Third International Conference on Intelligent Networking and Collaborative Systems.

[32]  Jie Xu,et al.  An Analysis of Failure-Related Energy Waste in a Large-Scale Cloud Environment , 2014, IEEE Transactions on Emerging Topics in Computing.

[33]  Farokh B. Bastani,et al.  Workload Estimation for Improving Resource Management Decisions in the Cloud , 2015, 2015 IEEE Twelfth International Symposium on Autonomous Decentralized Systems.