A WSLA-based monitoring system for grid service - GSMon

The target of grid monitoring and management is to monitor services in the grid for fault detection, performance analysis, performance tuning, load balancing and scheduling. This paper introduces the design and implementation of a WSLA-based grid monitoring and management system - GSMon. In GSMon, we use a service-oriented architecture, give monitoring information of the grid service a unique format by using WSLA and develop extended modules in the OGSA container. GSMon conforms to grid service and Web service standards and uses dynamic deployment technology to solve the problem of monitoring new grid services. It is a novel infrastructure for grid monitoring and management with high flexibility and scalability.

[1]  Klara Nahrstedt,et al.  CPU service classes for multimedia applications , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[2]  Carl Kesselman,et al.  A Network Performance Tool for Grid Environments , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[3]  William E. Johnston,et al.  The NetLogger methodology for high performance distributed systems performance analysis , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[4]  J. C. Yan,et al.  Performance tuning with AIMS/spl minus/an Automated Instrumentation and Monitoring System for multicomputers , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[5]  Jeffrey S. Vetter,et al.  Real-Time Performance Monitoring, Adaptive Control, and Interactive Steering of Computational Grids , 2000, Int. J. High Perform. Comput. Appl..

[6]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[7]  Lundy Lewis,et al.  Managing Business and Service Networks , 2001, Network and Systems Management.

[8]  Ragunathan Rajkumar,et al.  An interactive interface and RT-Mach support for monitoring and controlling resource management , 1995, Proceedings Real-Time Technology and Applications Symposium.

[9]  William E. Johnston,et al.  JAVA Agents for Distributed System Management , 1998 .

[10]  Devesh Bhatt,et al.  SPI: an instrumentation development environment for parallel/distributed systems , 1995, Proceedings of 9th International Parallel Processing Symposium.

[11]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[12]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[13]  Barton P. Miller,et al.  The Paradyn Parallel Performance Measurement Tool , 1995, Computer.