The SAMGrid monitoring service and its integration with MonALISA

The SAMGrid team is in the process of implementing a monitoring and information service, which fulfills several important roles in the operation of the SAMGrid system, and will replace the first generation of monitoring tools in the current deployments. The first generation tools are in general based on text log-files and represent solutions which are not scalable or maintainable. The roles of the monitoring and information service are: (1) providing diagnostics for troubleshooting the operation of SAMGrid services; (2) providing support for monitoring at the level of user jobs; (3) providing runtime support for local configuration and other information which currently must be stored centrally (thus moving the system toward greater autonomy for the SAMGrid station services, which include cache management and job management services); (4) providing intelligent collection of statistics in order to enable performance monitoring and tuning. The architecture of this service is quite flexible, permitting input from any instrumented SAMGrid application or service. It will allow multiple backend storage for archiving of (possibly) filtered monitoring events, as well as real time information displays and active notification service for alarm conditions. This service will be able to export, in a configurable manner, information to higher level Grid monitoring services, such as MonALISA. We describe our experience to date with using a prototype version together with MonALISA.