User-level grid monitoring with Inca 2

The primary goal in the creation of Grids is to provide unified and coherent access to distributed computing, data storage and analysis, instruments, and other resources to advance scientific exploration. Grids combine multiple complex and interdependent systems that span several administrative domains. This complexity poses challenges for both the administrators who build and maintain the Grid resources and the scientists who use them. While other Gridmonitoring tools provide system-level information on the utilization of Grid resources, the Inca system provides user-level Grid monitoring with periodic, automated user-level testing of the software and services required to support Grid operation. Inca can be used by Grid operators, system administrators,and application users to identify, analyze, and troubleshoot user-level Grid failures, thereby improving Grid stability. In this paper, we describe the new features of our current Inca release, Inca 2. We then describe the architecture of the Inca 2 system, in addition to use cases that describe two Inca 2 deployments in production environments.

[1]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[2]  Henri Casanova,et al.  Benchmark probes for grid assessment , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[3]  Peter H. Beckman,et al.  The Inca Test Harness and Reporting Framework , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[4]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[5]  Sergio Andreozzi,et al.  GridICE: a monitoring service for Grid systems , 2005, Future Gener. Comput. Syst..

[6]  Ciprian Dobre,et al.  MonALISA: An agent based, dynamic service system to monitor, control and optimize grid base applications , 2005 .

[7]  Henri Casanova,et al.  Measuring the Performance and Reliability of Production Computational Grids , 2006, 2006 7th IEEE/ACM International Conference on Grid Computing.