Forecasting the Stability of the Data Centre Based on Real-Time Data of Batch Workload Using Times Series Models

Forecasting has diverse range of applications in many fields like weather, stock market, etc. The main highlight of this work is to forecast the values of the given metric for near future and predict the stability of the Data Centre based on the usage of that metric. Since the parameters that are being monitored in a Data Centre are large, an accurate forecasting is essential for the Data Centre architects in order to make necessary upgrades in a server system. The major criteria that result in SLA violation and loss to a particular business are peak values in performance parameters and resource utilization; hence it is very important that the peak values in performance, resource and workload be forecasted. Here, we mainly concentrate on the metric batch workload of a real-time Data Centre. In this work, we mainly focused on forecasting the batch workload using the auto regressive integrated moving average (ARIMA) model and exponential smoothing and predicted the stability of the Data Centre for the next 6 months. Further, we have performed a comparison of ARIMA model and exponential smoothing and we arrived at the conclusion that ARIMA model outperformed the other. The best model is selected based on the ACF residual correlogram, Forecast Error histogram and the error measures like root mean square error (RMSE), mean absolute error (MAE), mean absolute scale error (MASE) and p-value of Ljung-Box statistics. From the above results we conclude that ARIMA model is the best model for forecasting this time series data and hence based on the ARIMA models forecast result we predicted the stability of the Data Centre for the next 6 months.

[1]  S. Bacha,et al.  Hourly server workload forecasting up to 168 hours ahead using Seasonal ARIMA model , 2012, 2012 IEEE International Conference on Industrial Technology.

[2]  Francisco de A. T. de Carvalho,et al.  Hybrid model with dynamic architecture for forecasting time series , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[3]  Jerome A. Rolia,et al.  Workload Analysis and Demand Prediction of Enterprise Data Center Applications , 2007, 2007 IEEE 10th International Symposium on Workload Characterization.

[4]  Xiaodong Wang,et al.  Hierarchical Forecasting of Web Server Workload Using Sequential Monte Carlo Training , 2006, 2006 40th Annual Conference on Information Sciences and Systems.

[5]  Nima Amjady,et al.  Short-term hourly load forecasting using time-series modeling with peak load estimation capability , 2001 .

[6]  Xiaochen Li,et al.  Comparison and Analysis Between Holt Exponential Smoothing and Brown Exponential Smoothing Used for Freight Turnover Forecasts , 2013, 2013 Third International Conference on Intelligent System Design and Engineering Applications.

[7]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[8]  Joshua Ojo Nehinbe,et al.  A Forensic Model for Forecasting Alerts Workload and Patterns of Intrusions , 2012, 2012 UKSim 14th International Conference on Computer Modelling and Simulation.

[9]  W. R. Christiaanse Short-Term Load Forecasting Using General Exponential Smoothing , 1971 .