Reasoning Based Workload Performance Prediction in Cloud Data Centers

Cloud computing provides utility-based and scalable services to end-users. In the past decade, the demands for resource management in cloud computing have increased substantially which lead to certain challenges such as optimal resource utilization, power consumption, and service level agreement violations. Workload performance prediction serves as an assistance to address these issues. In this paper, we propose a prediction model based on clustered Case-Based Reasoning (CBR). The proposed model determines the performance metrics for workload prior to the co-operation of autonomic computing characteristics. Thus, CBR provides optimal scheduling of resources and workload monitoring for cloud data centers. In order to validate the proposed CBR-based prediction model, we perform a series of experiments and evaluate the effectiveness in terms of precision, recall, f-measure, and mean square error rate. We generate the cases for CBR using traces from the Google cluster data center. Moreover, we also validate our proposed prediction model against Support Vector Machine (SVM) prediction scheme. Experimental results show that the proposed CBR outperforms the SVM-based approach and yields 10% improvement in terms of precision.

[1]  Abdelmounaam Rezgui,et al.  FailureSim: A System for Predicting Hardware Failures in Cloud Data Centers Using Neural Networks , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[2]  Naveed Ahmad,et al.  A Game-based Thermal-Aware Resource Allocation Strategy for Data Centers , 2019, IEEE Transactions on Cloud Computing.

[3]  Franck Cappello,et al.  Adaptive Algorithm for Minimizing Cloud Task Length with Prediction Errors , 2014, IEEE Transactions on Cloud Computing.

[4]  Georges Da Costa,et al.  Modeling, classifying and generating large-scale Google-like workload , 2018, Sustain. Comput. Informatics Syst..

[5]  Rajkumar Buyya,et al.  Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications’ QoS , 2015, IEEE Transactions on Cloud Computing.

[6]  Zhao Xu,et al.  Modeling and problem solving of building defects using point clouds and enhanced case-based reasoning , 2018 .

[7]  Jaime G. Carbonell,et al.  An Overview of Machine Learning , 1983 .

[8]  Farokh B. Bastani,et al.  Integrating Clustering and Learning for Improved Workload Prediction in the Cloud , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[9]  Muhammad Faheem,et al.  Performance prediction and adaptation for database management system workload using Case-Based Reasoning approach , 2018, Inf. Syst..

[10]  Sankaran Mahadevan,et al.  An improved method to construct basic probability assignment based on the confusion matrix for classification problem , 2016, Inf. Sci..

[11]  K. Murphy,et al.  Overview of Machine Learning , 2022, International Journal of Advanced Research in Science, Communication and Technology.

[12]  Jie Wu,et al.  The dynamic cuckoo filter , 2017, 2017 IEEE 25th International Conference on Network Protocols (ICNP).

[13]  Maryam Amiri,et al.  Survey on prediction models of applications for resources provisioning in cloud , 2017, J. Netw. Comput. Appl..

[14]  Yubin Zhang,et al.  Experience mining based on case-based reasoning for dispute settlement of international construction projects , 2019 .

[15]  Carlos Becker Westphall,et al.  Cloud resource management: A survey on forecasting and profiling models , 2015, J. Netw. Comput. Appl..

[16]  Chen Jian,et al.  A Review and Analysis of Case-Based Reasoning Research , 2015, 2015 International Conference on Intelligent Transportation, Big Data and Smart City.

[17]  Olaf David,et al.  Demystifying the Clouds: Harnessing Resource Utilization Models for Cost Effective Infrastructure Alternatives , 2017, IEEE Transactions on Cloud Computing.

[18]  Saju Mathew,et al.  Implementation of Cloud Computing in Education - A Revolution , 2012 .

[19]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[20]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[21]  Laurence T. Yang,et al.  An Efficient Deep Learning Model to Predict Cloud Workload for Industry Informatics , 2018, IEEE Transactions on Industrial Informatics.

[22]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[23]  Xiaofei Wang,et al.  Dynamic Resource Prediction and Allocation for Cloud Data Center Using the Multiobjective Genetic Algorithm , 2018, IEEE Systems Journal.

[24]  Xiaomin Zhu,et al.  WITCAT: A Workload Spike Targeted Cloud Management Solution , 2018, SoCC.

[25]  Elizabeth Chang,et al.  Comparing time series with machine learning-based prediction approaches for violation management in cloud SLAs , 2018, Future Gener. Comput. Syst..

[26]  Kwonsik Song,et al.  Covariance effect analysis of similarity measurement methods for early construction cost estimation using case-based reasoning , 2017 .

[27]  Rajkumar Buyya,et al.  Architectural Models for Resource Management in the Grid , 2000, GRID.

[28]  Fan Zhang,et al.  Popularity-aware differentiated distributed stream processing on skewed streams , 2017, 2017 IEEE 25th International Conference on Network Protocols (ICNP).

[29]  Hai Jin,et al.  FastJoin: A Skewness-Aware Distributed Stream Join System , 2019, 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[30]  Jitendra Kumar,et al.  Workload prediction in cloud using artificial neural network and adaptive differential evolution , 2018, Future Gener. Comput. Syst..

[31]  Yanjun Qi,et al.  Empirical Evaluation of Workload Forecasting Techniques for Predictive Cloud Resource Scaling , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[32]  Bowen Zhou,et al.  Pythia: Improving Datacenter Utilization via Precise Contention Prediction for Multiple Co-located Workloads , 2018, Middleware.

[33]  Jitendra Kumar,et al.  Long Short Term Memory Recurrent Neural Network (LSTM-RNN) Based Workload Forecasting Model For Cloud Datacenters , 2018 .