Design and implementation of a workflow-based resource broker with information system on computational grids

The grid is a promising infrastructure that can allow scientists and engineers to access resources among geographically distributed environments. Grid computing is a new technology which focuses on aggregating resources (e.g., processor cycles, disk storage, and contents) from a large-scale computing platform. Making grid computing a reality requires a resource broker to manage and monitor available resources. This paper presents a workflow-based resource broker whose main functions are matching available resources with user requests and considering network information statuses during matchmaking in computational grids. The resource broker provides a graphic user interface for accessing available and the appropriate resources via user credentials. This broker uses the Ganglia and NWS tools to monitor resource status and network-related information, respectively. Then we propose a history-based execution time estimation model to predict the execution time of parallel applications, according to previous execution results. The experimental results show that our model can accurately predict the execution time of embarrassingly parallel applications. We also report on using the Globus Toolkit to construct a grid platform called the TIGER project that integrates resources distributed across five universities in Taichung city, Taiwan, where the resource broker was developed.

[1]  Gregor von Laszewski,et al.  A Java commodity grid kit , 2001, Concurr. Comput. Pract. Exp..

[2]  Jesús Labarta,et al.  eNANOS Grid Resource Broker , 2005, EGC.

[3]  Rajkumar Buyya,et al.  High Performance Cluster Computing , 1999 .

[4]  Tao Xu,et al.  Pegasys: software for executing and integrating analyses of biological sequences , 2004, BMC Bioinformatics.

[5]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[6]  Ibm Redbooks Introduction to Grid Computing With Globus , 2003 .

[7]  Italo Epicoco,et al.  The Grid Resource Broker portal , 2007, Concurr. Comput. Pract. Exp..

[8]  Italo Epicoco,et al.  The Grid Resource Broker workflow engine , 2008 .

[9]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[10]  Massimo Cafaro,et al.  Web‐based access to the Grid using the Grid Resource Broker portal , 2002, Concurr. Comput. Pract. Exp..

[11]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[12]  Jarek Nabrzyski,et al.  Grid Resource Management , 2004 .

[13]  Ching-Hsien Hsu,et al.  On Utilization of the Grid Computing Technology for Video Conversion and 3D Rendering , 2005, ISPA.

[14]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[15]  Ian Foster,et al.  The Grid: A New Infrastructure for 21st Century Science , 2002 .

[16]  Ching-Hsien Hsu,et al.  On Utilization of the Grid Computing Technology for Video Conversion and 3D Rendering , 2005, ISPA.

[17]  Sang-Min Park,et al.  Chameleon: a resource scheduler in a data grid environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[18]  Rajkumar Buyya,et al.  A Grid service broker for scheduling e‐Science applications on global data Grids , 2006, Concurr. Comput. Pract. Exp..

[19]  Chao-Tung Yang,et al.  Design and Implementation of TIGER Grid: an Integrated Metropolitan-Scale Grid Environment , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[20]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[21]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[22]  Steven Tuecke,et al.  Protocols and services for distributed data-intensive science , 2002 .

[23]  Chao-Tung Yang,et al.  An Efficient Network Information Model Using NWS for Grid Computing Environments , 2005, GCC.

[24]  Chao-Tung Yang,et al.  A high-performance computational resource broker for grid computing environments , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[25]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[26]  Chao-Tung Yang,et al.  A Resource Broker for Computing Nodes Selection in Grid Computing Environments , 2004, GCC.

[27]  Andrew L. Wendelborn,et al.  A Data-Aware Resource Broker for Data Grids , 2004, NPC.

[28]  W. Allcock,et al.  GridFTP protocol specification , 2002 .

[29]  Ian T. Foster,et al.  A Grid-Enabled MPI: Message Passing in Heterogeneous Distributed Computing Systems , 1998, Proceedings of the IEEE/ACM SC98 Conference.

[30]  R. V. van Nieuwpoort,et al.  The Grid 2: Blueprint for a New Computing Infrastructure , 2003 .

[31]  Italo Epicoco,et al.  The Grid Resource BrokerWorkflow Engine , 2007, Sixth International Conference on Grid and Cooperative Computing (GCC 2007).