Adaptive resource provisioning method using application-aware machine learning based on job history in heterogeneous infrastructures

With the remarkable growth in cloud computing, computing resources can be manipulated on-demand in most scientific fields. This enables scientists to strategically select their experimental environment. Since it is hard to offer cloud resources in accordance with application characteristics, efficient resource provisioning methods are needed. This paper proposes an adaptive resource provisioning method using an application-aware machine learning technique that is based on the job history in heterogeneous infrastructures. The proposed resource provisioning method is built on two main concepts. First, it provides application-aware resource provisioning through the profiling of scientific application in a heterogeneous computing infrastructure. A resource provisioning model uses the resource usage profiles of scientific applications and job history data in heterogeneous computing infrastructures. In addition to the multilayer perceptron machine learning method, an error back-propagation approach is applied to analyze job history to re-learn the error of the output value. Second, it offers an adaptive resource scaling that is invoked by the availability of resource changes. An adaptive resource management method results in an enhancement of the overall balance between the performance and utilization of a system. For the experiments with the two CPU-intensive applications according to the method, a heterogeneous infrastructure comprising clusters and cloud environments is used. Experimental results indicate that the use of the proposed method can gratify user requests (cost and execution time) regarding its application and enhance resource usage effectiveness.

[1]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[2]  Seoyoung Kim,et al.  HTCaaS(High Throughput Computing as a Service) in Supercomputing Environment , 2014 .

[3]  S. Mrenna,et al.  Pythia 6.3 physics and manual , 2003, hep-ph/0308153.

[4]  Chung-Horng Lung,et al.  Cloud Resource Auto-scaling System Based on Hidden Markov Model (HMM) , 2014, 2014 IEEE International Conference on Semantic Computing.

[5]  Gilles Fedak,et al.  Multi-criteria and satisfaction oriented scheduling for hybrid distributed computing infrastructures , 2016, Future Gener. Comput. Syst..

[6]  Jae-Kwon Kim,et al.  Fuzzy Logic-driven Virtual Machine Resource Evaluation Method for Cloud Provisioning Service , 2013 .

[7]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[8]  Brandon M. Malone,et al.  A Learning-based Selection for Portfolio Scheduling of Scientific Applications on Heterogeneous Computing Systems , 2014, CloudCom 2014.

[9]  Ian T. Foster,et al.  Condor-G: A Computation Management Agent for Multi-Institutional Grids , 2004, Cluster Computing.

[10]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[11]  Rajarshi Das,et al.  A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation , 2006, 2006 IEEE International Conference on Autonomic Computing.

[12]  P. Simon Too Big to Ignore: The Business Case for Big Data , 2013 .

[13]  Jaeyoung Choi,et al.  VM auto-scaling methods for high throughput computing on hybrid infrastructure , 2015, Cluster Computing.

[14]  Calvin J. Ribbens,et al.  Hybrid Computing - Where HPC meets grid and Cloud Computing , 2011, Future Gener. Comput. Syst..

[15]  Gilles Fedak,et al.  SpeQuloS: a QoS service for BoT applications using best effort distributed computing infrastructures , 2012, HPDC '12.

[16]  Abul Bashar,et al.  Autonomic scaling of Cloud Computing resources using BN-based prediction models , 2013, 2013 IEEE 2nd International Conference on Cloud Networking (CloudNet).

[17]  Soonwook Hwang,et al.  A Job Dispatch Optimization Method on Cluster and Cloud for Large-Scale High-Throughput Computing Service , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[18]  Kun Wang,et al.  A Distributed Self-Learning Approach for Elastic Provisioning of Virtualized Cloud Resources , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.