A Monitoring System Based on Nagios for Data Grid Environments

✽ This research was supported in part by the National Science Council,Taiwan, R.O.C., under Grant NSC 95-3114-P-001-007-MY3 and NSC99-2631-H001-024. a Corresponding author: hwwei@iis.sinica.edu.tw; Institute of Information Science, Academia Sinica, No 128, Section 2, Academia Road, Nankang, Taipei, Taiwan, R.O.C; Phone: 886-2-2788-3799 ext.2471; Fax: 886-2-2782-4814 Abstract The amount of digital data in today’s society is already enormous and it will continue to grow exponentially. Therefore, it is necessary to devise new ways to preserve and manage the data effectively and efficiently. SRB (Storage Resource Broker), and its extension iRODS (the Integrated Rule-Oriented Data System), are data grid technologies for managing colossal amounts of data. In a distributed environment, monitoring systems oversee the operation of computing systems. The monitoring service is crucial because it must ensure a high-quality computing environment and provide reliable services. In this paper, we introduce a monitoring system called SIAM, which is based on Nagios. SIAM supports full monitoring services for SRB/iRODS-based systems, including fault-tolerance and notification functions. This study focuses on extending existing components and notification functions to satisfy clients’ needs and improve our system’s failover scheme. The results of experiments show that the proposed system is feasible for cloud storage services, and it is adaptable robust, and responsive in the face of system failures. Overall, SIAM enhances the reliability of SRB/iRODS based systems significantly.