SCALEA-G: A Unified Monitoring and Performance Analysis System for the Grid

This paper describes SCALEA-G, a unified monitoring and performance analysis system for the Grid. SCALEA-G is implemented as a set of grid services based on the Open Grid Services Architecture (OGSA). SCALEA-G provides an infrastructure for conducting online monitoring and performance analysis of a variety of Grid services including computational and network resources, and Grid applications. Both push and pull models are supported, providing flexible and scalable monitoring and performance analysis. Source code and dynamic instrumentation are exploited to perform profiling and monitoring of Grid applications. A novel instrumentation request language has been developed to facilitate the interaction between client and instrumentation services.

[1]  FosterIan,et al.  Grid Services for Distributed System Integration , 2002 .

[2]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[3]  Bernd Mohr,et al.  Monitoring and Instrumentation Requests for Fortran, Java, C and C++ Programs , 2000 .

[4]  Jason Lee,et al.  NetLogger: a toolkit for distributed system performance analysis , 2000, Proceedings 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (Cat. No.PR00728).

[5]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[6]  Thomas Ludwig,et al.  OMIS 2.0 - A Universal Interface for Monitoring Systems , 1997, PVM/MPI.

[7]  Werner Nutt,et al.  R-GMA: An Information Integration System for Grid Monitoring , 2003, OTM.

[8]  Marian Bubak,et al.  The CrossGrid Performance Analysis Tool for Interactive Grid Applications , 2002, PVM/MPI.

[9]  The International Journal of High Performance Computing Applications— , 1998 .

[10]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[11]  Mark Baker,et al.  GridRM: A Resource Monitoring Architecture for the Grid , 2002, GRID.

[12]  Ian T. Foster,et al.  Security for Grid services , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[13]  Hong Linh Truong,et al.  SCALEA: a performance analysis tool for parallel programs , 2003, Concurr. Comput. Pract. Exp..

[14]  Werner Nutt,et al.  Relational Grid Monitoring Architecture (R-GMA) , 2003, ArXiv.

[15]  Péter Kacsuk,et al.  Design and implementation of a distributed monitor for semi-on-line monitoring of VisualMP applications , 2000 .

[16]  William E. Johnston,et al.  The NetLogger Methodology for High Performance Distributed Systems Performance Analysis , 1999 .

[17]  Gregor von Laszewski,et al.  A Java commodity grid kit , 2001, Concurr. Comput. Pract. Exp..

[18]  Péter Kacsuk,et al.  From Cluster Monitoring to Grid Monitoring Based on GRM , 2001, Euro-Par.

[19]  Donald D. Chamberlin XQuery: An XML query language , 2002, IBM Syst. J..

[20]  William E. Johnston,et al.  The NetLogger methodology for high performance distributed systems performance analysis , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[21]  Marian Bubak,et al.  Performance Tools for the Grid: State of the Art and Future , 2004 .

[22]  Ian T. Foster,et al.  A security architecture for computational grids , 1998, CCS '98.

[23]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[24]  Bartosz Balis,et al.  An Infrastructure for Grid Application Monitoring , 2002, PVM/MPI.

[25]  Péter Kacsuk,et al.  Monitoring Message Passing Applications in the Grid with GRM and R-GMA , 2003, PVM/MPI.

[26]  Thomas Fahringer,et al.  Standardized Intermediate Representation for Fortran, Java, C and C++ Programs , 2004 .

[27]  Ruth A. Aydt,et al.  A Grid Monitoring Architecture , 2002 .

[28]  Jeffrey K. Hollingsworth,et al.  The dynamic probe class library-an infrastructure for developing instrumentation for performance tools , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[29]  Jeffrey K. Hollingsworth,et al.  An API for Runtime Code Patching , 2000, Int. J. High Perform. Comput. Appl..