On Grid Performance Evaluation Using Synthetic Workloads

Grid computing is becoming a common platform for solving large scale computing tasks. However, a number of major technical issues, including the lack of adequate performance evaluation approaches, hinder the grid computing's further development. The requirements herefore are manifold; adequate approaches must combine appropriate performance metrics, realistic workload models, and flexible tools for workload generation, submission, and analysis. In this paper we present an approach to tackle this complex problem. First, we introduce a set of grid performance objectives based on traditional and grid-specific performance metrics. Second, we synthesize the requirements for realistic grid workload modeling, e.g. co-allocation, data and network management, and failure modeling. Third, we show how GRENCHMARK, an existing framework for generating, running, and analyzing grid workloads, can be extended to implement the proposed modeling techniques. Our approach aims to be an initial and necessary step towards a common performance evaluation framework for grid environments.

[1]  Ramin Yahyapour Grid Resource Management-State of the Art and Future Trends, chapter Applying Economic Scheduling Me , 2003 .

[2]  Dror G. Feitelson Experimental analysis of the root causes of performance evaluation results: a backfilling case study , 2005, IEEE Transactions on Parallel and Distributed Systems.

[3]  W. Cirne,et al.  A comprehensive model of the supercomputer workload , 2001, Proceedings of the Fourth Annual IEEE International Workshop on Workload Characterization. WWC-4 (Cat. No.01EX538).

[4]  Dror G. Feitelson,et al.  The Forgotten Factor: Facts on Performance Evaluation and Its Dependence on Workloads , 2002, Euro-Par.

[5]  Nicola Beume,et al.  Design and Management of Complex Technical Processes and Systems by Means of Computational Intelligence Methods Scheduling Algorithm Development Based on Complex Owner Defined Objectives Scheduling Algorithm Development Based on Complex Owner Defined Objectives , 2022 .

[6]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[7]  Ian T. Foster,et al.  Experiences in Running Workloads over Grid3 , 2005, GCC.

[8]  Francisco Vilar Brasileiro,et al.  Faults in grids: why are they so bad and what can be done about it? , 2003, Proceedings. First Latin American Web Congress.

[9]  Dror G. Feitelson,et al.  Workload Modeling for Performance Evaluation , 2002, Performance.

[10]  Dick H. J. Epema,et al.  Experiences with the KOALA co-allocating scheduler in multiclusters , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[11]  Michael A. Frumkin,et al.  NAS Grid Benchmarks: a tool for Grid space exploration , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[12]  Michael Pinedo,et al.  Scheduling: Theory, Algorithms, and Systems , 1994 .

[13]  Anca I. D. Bucur,et al.  Trace-based simulations of processor co-allocation policies in multiclusters , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[14]  Marios D. Dikaiakos,et al.  GridBench: A Workbench for Grid Benchmarking , 2005, EGC.

[15]  Ramin Yahyapour,et al.  UNIVERSITY OF DORTMUND REIHE COMPUTATIONAL INTELLIGENCE COLLABORATIVE RESEARCH CENTER 531 Design and Management of Complex Technical Processes and Systems by means of Computational Intelligence Methods User Group-based Workload Analysis and Modelling , 2005 .

[16]  Alexandru Iosup,et al.  GRENCHMARK: A Framework for Analyzing, Testing, and Comparing Grids , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[17]  Ramin Yahyapour,et al.  Parallel Computer Workload Modeling with Markov Chains , 2004, JSSPP.

[18]  Giuseppe Serazzi,et al.  A Characterization of the Variation in Time of Workload Arrival Patterns , 1985, IEEE Transactions on Computers.

[19]  Warren Smith,et al.  Benchmarks and Standards for the Evaluation of Parallel Job Schedulers , 1999, JSSPP.

[20]  Jarek Nabrzyski,et al.  Grid resource management: state of the art and future trends , 2004 .

[21]  Leonid Oliker,et al.  Job Superscheduler Architecture and Performance in Computational Grid Environments , 2003, SC.

[22]  Ramin Yahyapour,et al.  Applying economic scheduling methods to Grid environments , 2004 .

[23]  Allen B. Downey,et al.  A parallel workload model and its implications for processor allocation , 1996, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[24]  Andrea C. Arpaci-Dusseau,et al.  Pipeline and batch sharing in grid workloads , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[25]  Ramin Yahyapour,et al.  Scaling of Workload Traces , 2003, JSSPP.

[26]  Ramin Yahyapour,et al.  User group-based workload analysis and modelling , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[27]  Mark J. Clement,et al.  Core Algorithms of the Maui Scheduler , 2001, JSSPP.

[28]  Andrew A. Chien,et al.  Henri Casanova , 2022 .

[29]  Geoffrey C. Fox,et al.  Grid and Cooperative Computing - GCC 2005, 4th International Conference, Beijing, China, November 30 - December 3, 2005, Proceedings , 2005, GCC.

[30]  Marian Bubak,et al.  Advances in Grid Computing - EGC 2005, European Grid Conference, Amsterdam, The Netherlands, February 14-16, 2005, Revised Selected Papers , 2005, EGC.

[31]  Dror G. Feitelson,et al.  Metric and workload effects on computer systems evaluation , 2003, Computer.

[32]  Allen B. Downey A parallel workload model and its implications for processor allocation , 2004, Cluster Computing.

[33]  Alain Abran,et al.  New Approaches in Software Measurement , 2001, Lecture Notes in Computer Science.

[34]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[35]  Emmanuel Medernach,et al.  Workload Analysis of a Cluster in a Grid Environment , 2005, JSSPP.

[36]  Francine Berman,et al.  A comprehensive model of the supercomputer workload , 2001 .

[37]  Soonwook Hwang,et al.  Grid workflow: a flexible failure handling framework for the grid , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[38]  Howard Jay Siegel,et al.  A mathematical model, heuristic, and simulation study for a basic data staging problem in a heterogeneous networking environment , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[39]  David Abramson,et al.  The Grid Economy , 2005, Proceedings of the IEEE.

[40]  Larry Rudolph,et al.  Metrics and Benchmarking for Parallel Job Scheduling , 1998, JSSPP.

[41]  Chaitanya K. Baru,et al.  Analysis of HPSS performance based on per-file transfer logs , 1999, 16th IEEE Symposium on Mass Storage Systems in cooperation with the 7th NASA Goddard Conference on Mass Storage Systems and Technologies (Cat. No.99CB37098).

[42]  Ian T. Foster,et al.  End-to-end quality of service for high-end applications , 2004, Comput. Commun..

[43]  Uwe Schwiegelshohn,et al.  Fairness in parallel job scheduling , 2000 .

[44]  D J Evans,et al.  Parallel processing , 1986 .

[45]  Uwe Schwiegelshohn,et al.  On Advantages of Grid Computing for Parallel Job Scheduling , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[46]  Jason Maassen,et al.  Synthetic Grid Workloads With Ibis, KOALA, and GrenchMark , 2007 .

[47]  Anca I. D. Bucur,et al.  The Performance of Processor Co-Allocation in Multicluster Systems , 2003, CCGRID.

[48]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[49]  Dror G. Feitelson,et al.  Packing Schemes for Gang Scheduling , 1996, JSSPP.

[50]  Giorgos Cheliotis,et al.  Architecture requirements for commercializing Grid resources , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[51]  Alexandru Iosup,et al.  How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications , 2006, 2006 7th IEEE/ACM International Conference on Grid Computing.

[52]  Francine Berman,et al.  A model for moldable supercomputer jobs , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[53]  Miron Livny,et al.  Phoenix: making data-intensive grid applications fault-tolerant , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[54]  Hui Li,et al.  Workload Characteristics of a Multi-cluster Supercomputer , 2004, JSSPP.

[55]  Uwe Schwiegelshohn,et al.  Theory and Practice in Parallel Job Scheduling , 1997, JSSPP.

[56]  Henri Casanova,et al.  Benchmark probes for grid assessment , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[57]  Ramin Yahyapour,et al.  Benefits of global grid computing for job scheduling , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[58]  Chaki Ng,et al.  Mirage: a microeconomic resource allocation system for sensornet testbeds , 2005, The Second IEEE Workshop on Embedded Networked Sensors, 2005. EmNetS-II..

[59]  Francine Berman,et al.  Adaptive Computing on the Grid Using AppLeS , 2003, IEEE Trans. Parallel Distributed Syst..

[60]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..