Robustness and security in a mobile-agent based network monitoring system

We present the mechanisms for self-recovery in Konark, a mobile agent based system for monitoring network computing systems. An important aspect of our design is the use of the monitoring system's inherent capabilities to detect its own component failures. The Konark system is implemented using Ajanta. Our monitoring system achieves robustness by incorporating mechanisms for self-monitoring and self-configuration at different levels of the system architecture. The event detection, correlation, and notification mechanisms are used as the basic building blocks for failure detection. Our design uses the notion of continuous periodic detection and notification of a failure event until the failed components causing it are repaired.

[1]  Anand R. Tripathi,et al.  Security in the Ajanta mobile agent system , 2001, Softw. Pract. Exp..

[2]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[3]  Eugene H. Spafford,et al.  An architecture for intrusion detection using autonomous agents , 1998, Proceedings 14th Annual Computer Security Applications Conference (Cat. No.98EX217).

[4]  Joel F. Bartlett,et al.  A NonStop kernel , 1981, SOSP.

[5]  Naftaly H. Minsky,et al.  On conditions for self-healing in distributed software systems , 2003, 2003 Autonomic Computing Workshop.

[6]  Eric A. Brewer,et al.  Cluster-based scalable network services , 1997, SOSP.

[7]  Anand R. Tripathi,et al.  Secure Multi-agent Coordination in a Network Monitoring System , 2002, SELMAS.

[8]  Aaron Kershenbaum,et al.  Mobile Agents: Are They a Good Idea? , 1996, Mobile Object Systems.

[9]  Paolo Bellavista,et al.  An Open Secure Mobile Agent Framework for Systems Management , 1999, Journal of Network and Systems Management.

[10]  Vern Paxson,et al.  Bro: a system for detecting network intruders in real-time , 1998, Comput. Networks.

[11]  George Candea,et al.  JAGR: an autonomous self-recovering application server , 2003, 2003 Autonomic Computing Workshop.

[12]  Peter G. Neumann,et al.  EMERALD: Event Monitoring Enabling Responses to Anomalous Live Disturbances , 1997, CCS 2002.