A Realistic Integrated Model of Parallel System Workloads

Performance evaluation is a significant step in the study of scheduling algorithms in large-scale parallel systems ranging from supercomputers to clusters and grids. One of the key factors that have a strong effect on the evaluation results is the workloads (or traces) used in experiments. In practice, several researchers use unrealistic synthetic workloads in their scheduling evaluations because they lack models that can help generate realistic synthetic workloads. In this paper we propose a full model to capture the following characteristics of real parallel system workloads: 1) long range dependence in the job arrival process, 2) temporal and spatial burstiness, 3) bag-oftasks behaviour, and 4) correlation between the runtime and the number of processors. Validation of our model with real traces shows that our model not only captures the above characteristics but also fits the marginal distributions well. In addition, we also present an approach to quantify burstiness in a job arrival process (temporal) as well as burstiness in the load of a trace (spatial).

[1]  Alexandru Iosup,et al.  The performance of bags-of-tasks in large-scale distributed systems , 2008, HPDC '08.

[2]  Evgenia Smirni,et al.  How to parameterize models with bursty workloads , 2008, PERV.

[3]  Murad S. Taqqu,et al.  Theory and applications of long-range dependence , 2003 .

[4]  Richard G. Baraniuk,et al.  A Multifractal Wavelet Model with Application to Network Traffic , 1999, IEEE Trans. Inf. Theory.

[5]  Jan Broeckhove,et al.  Dynamic Grid Scheduling Using Job Runtime Requirements and Variable Resource Availability , 2008, Euro-Par.

[6]  Tran Ngoc Minh,et al.  Modeling Job Arrival Process with Long Range Dependence and Burstiness Characteristics , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[7]  Christos Faloutsos,et al.  Capturing the spatio-temporal behavior of real traffic data , 2002, Perform. Evaluation.

[8]  Ramin Yahyapour,et al.  Parallel Computer Workload Modeling with Markov Chains , 2004, JSSPP.

[9]  Jens Mache,et al.  A Comparative Study of Real Workload Traces and Synthetic Workload Models for Parallel Job Scheduling , 1998, JSSPP.

[10]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[11]  H. E. Hurst,et al.  Long-Term Storage Capacity of Reservoirs , 1951 .

[12]  Hui Li,et al.  Towards A Better Understanding of Workload Dynamics on Data-Intensive Clusters and Grids , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[13]  M. A. Johnson,et al.  Descriptors of arrival-process burstiness with application to the discrete Markovian arrival process , 1996, Queueing Syst. Theory Appl..

[14]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[15]  Chuliang Weng,et al.  Heuristic scheduling for bag-of-tasks applications in combination with QoS in the computational grid , 2005, Future Gener. Comput. Syst..

[16]  Rajkumar Buyya,et al.  Model-based simulation and performance evaluation of grid scheduling strategies , 2009, Future Gener. Comput. Syst..

[17]  Hui Li,et al.  Workload Characteristics of a Multi-cluster Supercomputer , 2004, JSSPP.

[18]  Athina P. Petropulu,et al.  Long-range dependence and heavy-tail modeling for teletraffic data , 2002, IEEE Signal Process. Mag..

[19]  Cosimo Anglano,et al.  Fault-Tolerant Scheduling for Bag-of-Tasks Grid Applications , 2005, EGC.

[20]  Patrice Abry,et al.  Long-Range Dependence: Theory and Applications , 2002 .

[21]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[22]  Eric J. Heikkila,et al.  Adjusting Spatial-Entropy Measures for Scale and Resolution Effects , 2006 .

[23]  Hui Li Long Range Dependent Job Arrival Process and Its Implications in Grid Environments , 2007 .

[24]  Dror G. Feitelson,et al.  Locality of sampling and diversity in parallel system workloads , 2007, ICS '07.

[25]  Hui Li,et al.  Workload dynamics on clusters and grids , 2008, The Journal of Supercomputing.