A reference model for grid architectures and its validation

Computing and data‐intensive applications in physics, medicine, biology, graphics, and business intelligence require large and distributed infrastructures to address the challenges of the present and the future. For example, process mining applications are faced with terrabytes of event data and computationally expensive algorithms. Computer grids are increasingly being used to deal with such challenges. However, grid computing is often approached in an ad hoc and engineering‐like manner. Despite the availability of many software packages for grid applications, a good conceptual model of the grid is missing. This paper provides a formal description of the grid in terms of a colored Petri net (CPN). This CPN can be seen as a reference model for grids as it clarifies the basic concepts at the conceptual level. Moreover, the CPN allows for various kinds of analyses ranging from verification to performance analysis. We validate our model based on real‐life experiments using a testbed grid architecture available in our group and we show how the model can be used for the estimation of throughput times for scientific workflows. Copyright © 2009 John Wiley & Sons, Ltd.

[1]  Sergei Gorlatch,et al.  A Grid Workflow Language Using High-Level Petri Nets , 2005, PPAM.

[2]  Henri Casanova,et al.  Simgrid: a toolkit for the simulation of application scheduling , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[3]  Boudewijn F. van Dongen,et al.  Business process mining: An industrial application , 2007, Inf. Syst..

[4]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jesús Montes,et al.  The Grid as a Single Entity: Towards a Behavior Model of the Whole Grid , 2008, OTM Conferences.

[6]  Jakob Engblom,et al.  The worst-case execution-time problem—overview of methods and survey of tools , 2008, TECS.

[7]  Wolfgang Reisig,et al.  System Design Using Petri Nets , 1983, Requirements Engineering.

[8]  Ian T. Foster,et al.  Predicting the performance of wide area data transfers , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[9]  Kurt Jensen,et al.  Coloured Petri Nets: Basic Concepts, Analysis Methods and Practical Use. Vol. 2, Analysis Methods , 1992 .

[10]  Egon Börger,et al.  Abstract State Machines. A Method for High-Level System Design and Analysis , 2003 .

[11]  Rajkumar Buyya,et al.  GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for Grid computing , 2002, Concurr. Comput. Pract. Exp..

[12]  Jianwei Yin,et al.  A Novel Architecture for Realizing Grid Workflow Using Pi-Calculus Technology , 2006, APWeb.

[13]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[14]  Wil M. P. van der Aalst,et al.  Evaluating a Data Removal Strategy for Grid Environments Using Colored Petri Nets , 2008, OPODIS.

[15]  Heinz Stockinger,et al.  Defining the grid: a snapshot on the current view , 2007, The Journal of Supercomputing.

[16]  Changjun Jiang,et al.  Towards a Formal Model for Grid Architecture via Petri Nets , 2006 .

[17]  Zsolt Németh,et al.  A Formal Framework for Defining Grid Systems , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[18]  Geoffrey C. Fox,et al.  Web Service Grids: an evolutionary approach , 2005, Concurr. Pract. Exp..

[19]  Jing Zhou,et al.  Describing and Reasoning on the Composition of Grid Services Using Pi-Calculus , 2006, The Sixth IEEE International Conference on Computer and Information Technology (CIT'06).

[20]  Nikola Trcka,et al.  A reference model for grid architectures and its validation , 2010, Grid 2010.

[21]  Natalia Sidorova,et al.  Modeling grid workflows with colored Petri nets , 2007 .

[22]  David Abramson,et al.  Economic models for resource management and scheduling in Grid computing , 2002, Concurr. Comput. Pract. Exp..

[23]  Francine Berman,et al.  Overview of the Book: Grid Computing – Making the Global Infrastructure a Reality , 2003 .

[24]  kc claffy,et al.  Bandwidth estimation: metrics, measurement techniques, and tools , 2003, IEEE Netw..