Distributed grid applications are becoming more and more popular as the use of complex grid middlewares becomes extensive, and more facilities are offered by these complex pieces of software. But as grid middlewares grow and offer more advanced features, they become more complex and weighty, as well as hard to tune. As the performance of a distributed grid applications can be strongly influenced by the operation of the underlying grid middleware, it becomes important to study and analyze their behaviour and performance. In this paper we present the eDragon monitoring framework (eDMF), a set of tools for instrumentation and analysis of grid middleware, that provides an unique environment to study the performance of grid applications. The eDMF is composed of a set of specialised monitoring tools as well as a flexible and powerful performance analysis platform. Additionally we also provide a practical application of the eDMF to the Globus toolkit 4 (GT4), one of the most extended and popular grid middlewares, showing how it helped us in the detection and resolution of several job management problems observed in the GT4 middleware
[1]
Mathilde Romberg,et al.
The UNICORE Grid infrastructure
,
2002,
Sci. Program..
[2]
Jordi Torres,et al.
Complete instrumentation requirements for performance analysis of Web based technologies
,
2003,
2003 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS 2003..
[3]
Borja Sotomayor,et al.
Globus toolkit 4 : programming Java services
,
2006
.
[4]
Jesús Labarta,et al.
Performance analysis of multilevel parallel applications on shared memory architectures
,
2003,
Proceedings International Parallel and Distributed Processing Symposium.
[5]
Michel Dagenais,et al.
Measuring and Characterizing System Behavior Using Kernel-Level Event Logging
,
2000,
USENIX Annual Technical Conference, General Track.
[6]
Ian Foster,et al.
The Globus toolkit
,
1998
.