Detection of time series patterns and periodicity of cloud computing workloads

Abstract Workload pattern detection can be used as part of a proactive decision-making approach to optimize resource provisioning strategies and anticipate any performance problem in cloud computing environments. Existing workload pattern detection approaches suffer from two essential limitations which restrict their effectiveness in such environments. Specifically, they require massive human intervention, and are specific to particular types of workload data. To overcome these limitations, we propose in this paper a generic workload pattern and periodicity detection technique that employs the prefix transposition approach from the molecular biology domain to detect workload periodicity in node-specific and aggregated cloud environments. The strengths of the proposed technique compared to the state of the art lie in its ability to be applied to any type of workload, and also to detect patterns of varying lengths, amplitudes, and shapes. Experiments conducted on cloud server nodes and aggregated CPU and throughput workload datasets collected from Information Technology (IT) and Telecom domains reveal that our solution improves the accuracy of the detections, especially in harsh environments, where the lengths, shapes, and amplitudes of patterns vary, as compared to the autocorrelation technique. Other findings show that the proposed approach is also highly efficient at detecting multiple short-term and long-term periodic patterns on any type of time series-based cloud computing workloads of different time granularity.

[1]  Manziba Akanda Nishi,et al.  An efficient approach to mine flexible periodic patterns in time series databases , 2015, Eng. Appl. Artif. Intell..

[2]  Zhibin Gao,et al.  Simulation Study on Multi-Rate Time-Frequency Analysis of Non-Stationary Signals , 2018 .

[3]  Zanoni Dias,et al.  Sorting by Prefix Reversals and Prefix Transpositions , 2015, Discret. Appl. Math..

[4]  Pasquale Pagano,et al.  Analysing and forecasting fisheries time series: purse seine in Indian Ocean as a case study , 2016 .

[5]  Manziba Akanda Nishi,et al.  Effective periodic pattern mining in time series databases , 2013, Expert Syst. Appl..

[6]  Walid G. Aref,et al.  Periodicity detection in time series databases , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Mohammad Sohel Rahman,et al.  Prefix and suffix transreversals on binary and ternary strings , 2015, J. Discrete Algorithms.

[8]  Reda Alhajj,et al.  A Framework for Periodic Outlier Pattern Detection in Time-Series Sequences , 2014, IEEE Transactions on Cybernetics.

[9]  Philip S. Yu,et al.  A Periodicity-based Parallel Time Series Prediction Algorithm in Cloud Computing Environments , 2018, Inf. Sci..

[10]  Pasquale Pagano,et al.  The gCube system: Delivering Virtual Research Environments as-a-Service , 2019, Future Gener. Comput. Syst..

[11]  Reda Alhajj,et al.  STNR: A suffix tree based noise resilient algorithm for periodicity detection in time series databases , 2010, Applied Intelligence.

[12]  Ronald K. Pearson,et al.  BMC Bioinformatics BioMed Central Methodology article , 2005 .

[13]  Mohammad Sohel Rahman,et al.  Prefix transpositions on binary and ternary strings , 2013, Inf. Process. Lett..

[14]  Tao Wang,et al.  Workload-aware anomaly detection for Web applications , 2014, J. Syst. Softw..

[15]  Yves Lemieux,et al.  Abnormal behavior detection using resource level to service level metrics mapping in virtualized systems , 2020, Future Gener. Comput. Syst..

[16]  Carson K. Leung,et al.  A new framework for mining weighted periodic patterns in time series databases , 2017, Expert Syst. Appl..

[17]  Jie Xu,et al.  Analysis, Modeling and Simulation of Workload Patterns in a Large-Scale Utility Cloud , 2014, IEEE Transactions on Cloud Computing.

[18]  S. Mehdi Hashemi,et al.  Modeling and Forecasting the Urban Volume Using Stochastic Differential Equations , 2014, IEEE Transactions on Intelligent Transportation Systems.