A Novel Method for the Synthetic Generation of Non-I.I.D Workloads for Cloud Data Centers

Cloud data center workloads have time- dependencies and are hence non-i.i.d (independent and identically distributed). In this paper, we propose a new model-based method for creating synthetic workload traces for cloud data centers that have similar time characteristics and cumulative distributions to those of the actual traces. We evaluate our method using the actual resource request traces of Azure collected in 2019 and the well-known Google cloud trace. Our method enables generating synthetic traces that can be used for a more realistic evaluation of cloud data centers.

[1]  Christos Faloutsos,et al.  Beyond Poisson: Modeling Inter-Arrival Time of Requests in a Datacenter , 2014, PAKDD.

[2]  Ricardo Bianchini,et al.  Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms , 2017, SOSP.

[3]  Furkan Koltuk,et al.  CLOUDGEN: Workload Generation for the Evaluation of Cloud Computing Systems , 2019, 2019 27th Signal Processing and Communications Applications Conference (SIU).

[4]  Aniruddha S. Gokhale,et al.  Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[5]  Gregory R. Ganger,et al.  On the diversity of cluster workloads and its impact on research results , 2018, USENIX Annual Technical Conference.

[6]  Riccardo Gusella,et al.  Characterizing the Variability of Arrival Processes with Indexes of Dispersion , 1991, IEEE J. Sel. Areas Commun..

[7]  Rajkumar Buyya,et al.  Workload modeling for resource usage analysis and simulation in cloud computing , 2015, Comput. Electr. Eng..

[8]  Michael A. Stephens,et al.  The Anderson-Darling Statistic , 1979 .

[9]  Guangjie Han,et al.  Characteristics of Co-Allocated Online Services and Batch Jobs in Internet Data Centers: A Case Study From Alibaba Cloud , 2019, IEEE Access.

[10]  Dror G. Feitelson,et al.  Workload Modeling for Computer Systems Performance Evaluation , 2015 .

[11]  Xue Liu,et al.  BURSE: A Bursty and Self-Similar Workload Generator for Cloud Computing , 2015, IEEE Transactions on Parallel and Distributed Systems.

[12]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[13]  Jie Xu,et al.  Analysis, Modeling and Simulation of Workload Patterns in a Large-Scale Utility Cloud , 2014, IEEE Transactions on Cloud Computing.

[14]  Robert H. Shumway,et al.  Time series analysis and its applications : with R examples , 2017 .

[15]  Rajkumar Buyya,et al.  Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications’ QoS , 2015, IEEE Transactions on Cloud Computing.