Performance-based middleware services for grid computing

Managing resources and distributed workload across multiple administrative boundaries is a key issue in grid computing and middleware research. Standard services that perform look-up, match-making, scheduling and staging are being developed to address this problem. As active middleware components, these services will allow users to identify and utilize appropriate resources that provide sustainable system- and user-level qualities of service. This paper documents two enhanced match-making services that address the performance implications of executing a particular workload on a given set of resources. These services are based on an established performance prediction system that is employed at both the local (intra-domain) and global (multi-domain) levels to provide dynamic workload steering. These additional facilities bring about significant performance improvements, the details of which are presented with regard to the user-perceived quality of service and to the grid resource utilization.

[1]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[2]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[3]  Ian Foster,et al.  A quality of service architecture that combines resource reservation and application adaptation , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[4]  Donna N. Dillenberger,et al.  Adaptive Algorithms for Managing a Distributed Data Processing Workload , 1997, IBM Syst. J..

[5]  G. Laszewski,et al.  A QoS Guided Scheduling Algorithm for Grid Computing * , 2002 .

[6]  Subhash Saini,et al.  Local grid scheduling techniques using performance prediction , 2003 .

[7]  Subhash Saini,et al.  ARMS: An agent-based resource management system for grid computing , 2002, Sci. Program..

[8]  Rajesh Raman,et al.  Matchmaking: distributed resource management for high throughput computing , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[9]  Graham R. Nudd,et al.  Performance optimization of financial option calculations , 2000, Parallel Comput..

[10]  Jack J. Dongarra,et al.  The NetSolve environment: progressing towards the seamless grid , 2000, Proceedings 2000. International Workshop on Parallel Processing.

[11]  Fabrizio Petrini,et al.  Predictive Performance and Scalability Modeling of a Large-Scale Application , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[12]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[13]  David Abramson,et al.  Economic models for resource management and scheduling in Grid computing , 2002, Concurr. Comput. Pract. Exp..

[14]  Fabrizio Petrini,et al.  Predictive Performance and Scalability Modeling of a Large-Scale Application , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[15]  Graham R. Nudd,et al.  High Performance Service Discovery in Large-Scale Multi-Agent and Mobile-Agent Systems , 2001, Int. J. Softw. Eng. Knowl. Eng..

[16]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[17]  Vipin Kumar,et al.  Information power grid: The new frontier in parallel computing? , 1999, IEEE Concurr..

[18]  Graham R. Nudd,et al.  Pace—A Toolset for the Performance Prediction of Parallel and Distributed Systems , 2000, Int. J. High Perform. Comput. Appl..

[19]  Francine Berman,et al.  The GrADS Project: Software Support for High-Level Grid Application Development , 2001, Int. J. High Perform. Comput. Appl..

[20]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[21]  Henri Casanova,et al.  Netsolve: a Network-Enabled Server for Solving Computational Science Problems , 1997, Int. J. High Perform. Comput. Appl..

[22]  Graham R. Nudd,et al.  Optimisation of application execution on dynamic systems , 2001, Future Gener. Comput. Syst..

[23]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[24]  Graham R. Nudd,et al.  Performance Modelling of Parallel and Distributed Computing Using PACE 1 , .