Ad Hoc Debugging Environment for Grid Applications

Debugging can help programmers to locate the reasons for incorrect program behaviors. The dynamic and heterogeneous characteristics of computational grids make it much harder to debug grid applications. In this paper, we give the definition the concept of ad hoc debugging environment, which uncover the nature of debugging behavior in computational grids. Besides solving some normal problems in tradition parallel debugging, such as user interface, application instrumentation, we first address the nondeterministic dynamic behavior of computing environment in grids during the debugging session. We evaluate the similarity between computing environments, in which the grid application can be re-executed for cyclic debugging.

[1]  James Coyle,et al.  Deadlock detection in MPI programs , 2002, Concurr. Comput. Pract. Exp..

[2]  Robert H. B. Netzer,et al.  Debugging race conditions in message-passing programs , 1996, SPDT '96.

[3]  Dieter Kranzlmüller,et al.  DeWiz - Event-Based Debugging on the Grid , 2002, PDP.

[4]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[5]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[6]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[7]  Robert Hood,et al.  A debugger for computational grid applications , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[8]  Paraskevas Evripidou,et al.  Net-dbx: A Web-Based Debugger of MPI Programs Over Low-Bandwidth Lines , 2001, IEEE Trans. Parallel Distributed Syst..

[9]  Jason Lee,et al.  NetLogger: a toolkit for distributed system performance analysis , 2000, Proceedings 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (Cat. No.PR00728).

[10]  Denis Caromel,et al.  IC2D: Interactive Control and Debugging of Distribution , 2001, LSSC.