Modeling, classifying and generating large-scale Google-like workload

One of the key elements needed to test most large-scale scheduling algorithms is a testing infrastructure. Large scale is of upmost importance as failures and complex behaviors are common occurrences only at such scale. In order to test the reaction of a system to failures or extreme behaviors, it is necessary to be able to create large scale environments. Such an infrastructure must be reproducible so that several studies are able to compare themselves but also capable of diversity as otherwise it would risk to limit them to particular subcases. In this article, we propose a generic adaptable and reusable model of large scale workload. The original schema comes from the Google Cluster Workload Traces which is a perfect representative of large scale production workload. The methodology to produce the model can be used at different precision levels and can classify the input level without human supervision. Contrary to most model analysis of such traces, we propose along with our model a reference implementation in order for other studies using our results to produce comparable experiments.

[1]  Sangyeun Cho,et al.  Characterizing Machines and Workloads on a Google Cluster , 2012, 2012 41st International Conference on Parallel Processing Workshops.

[2]  Jie Xu,et al.  Analysis, Modeling and Simulation of Workload Patterns in a Large-Scale Utility Cloud , 2014, IEEE Transactions on Cloud Computing.

[3]  Bo Yang,et al.  On time-sensitive revenue management in green data centers , 2017, Sustain. Comput. Informatics Syst..

[4]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[5]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[6]  Marios D. Dikaiakos,et al.  Nine months in the life of EGEE: a look from the South , 2007, 2007 15th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[7]  Selma Boumerdassi,et al.  Characterizing Servers Workload in Cloud Datacenters , 2015, 2015 3rd International Conference on Future Internet of Things and Cloud.

[8]  Sheng Di,et al.  Characterization and Comparison of Cloud versus Grid Workloads , 2012, 2012 IEEE International Conference on Cluster Computing.

[9]  Rajeev Gandhi,et al.  An Analysis of Traces from a Production MapReduce Cluster , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[10]  Franck Cappello,et al.  Characterizing Cloud Applications on a Google Data Center , 2013, 2013 42nd International Conference on Parallel Processing.

[11]  Yu Chen,et al.  Quantitative workload analysis and prediction using Google cluster traces , 2016, 2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[12]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[13]  Alexandru Iosup,et al.  The Grid Workloads Archive , 2008, Future Gener. Comput. Syst..

[14]  Mansaf Alam,et al.  Analysis and Clustering of Workload in Google Cluster Trace Based on Resource Usage , 2015, 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES).

[15]  David Wright,et al.  Scheduling workloads in a network of datacentres to reduce electricity cost and carbon footprint , 2015, Sustain. Comput. Informatics Syst..

[16]  Charles Reiss,et al.  Towards understanding heterogeneous clouds at scale : Google trace analysis , 2012 .