Using host criticalities for fault tolerance in mobile agent systems

Monitoring is a crucial factor for smooth run of distributed systems such as mobile agent based system. Various activities in such systems require monitoring such as performance analysis and tuning, scheduling strategies and fault detection. In this paper we present monitoring and fault tolerance technique for mobile agent based systems. We present mobile agent based fault prevention and detection technique where the team of mobile agents monitor each host in mobile agent based system. This research focuses on building an automatic, adaptive and predictive determining policy where critical host agents are identified in advance by monitoring agents, to avoid their failures. The novelty of proposed approach is constant collection and updating of local as well as global information of the system. This policy is determined by calculating weights; taking into account the criticality of the hosts by their monitoring agents which keep updating the weights of hosts. These weights act are used for decision making of checkpointing. These monitoring mobile agents act together to detect undesirable behaviors and also provide support for restoring the system back to normalcy. We also report on the result of reliability and performance issue of our proposed approach.

[1]  P. Venkataram,et al.  Applications of agent technology in communications: a review , 2004, Comput. Commun..

[2]  Heon Young Yeom,et al.  The cost of checkpointing, logging and recovery for the mobile agent systems , 2002, 2002 Pacific Rim International Symposium on Dependable Computing, 2002. Proceedings..

[3]  Jiannong Cao,et al.  CIC: an integrated approach to checkpointing in mobile agent systems , 2006, 2006 Semantics, Knowledge and Grid, Second International Conference on.

[4]  Jean-Pierre Briot,et al.  Adaptive replication of large-scale multi-agent systems: towards a fault-tolerant multi-agent platform , 2005, SELMAS '05.

[5]  Michael R. Lyu,et al.  Design and evaluation of a fault-tolerant mobile-agent system , 2004, IEEE Intelligent Systems.

[6]  Xavier Défago,et al.  A Survey of Mobile Agent-Based Fault-Tolerant Technology , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[7]  Lotfi A. Zadeh,et al.  A New Direction in AI: Toward a Computational Theory of Perceptions , 2001, AI Mag..

[8]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .

[9]  Bradley R. Schmerl,et al.  Model-based adaptation for self-healing systems , 2002, WOSS '02.

[10]  André Schiper,et al.  FATOMAS-a fault-tolerant mobile agent system based on the agent-dependent approach , 2001, 2001 International Conference on Dependable Systems and Networks.

[11]  Hong Shen,et al.  Analysis of Mobile Agents' Fault-Tolerant Behavior , 2004, PDCAT.

[12]  Hyacinth S. Nwana,et al.  Software agents: an overview , 1996, The Knowledge Engineering Review.

[13]  Marco Colombetti,et al.  An analysis of agent speech acts as institutional actions , 2002, AAMAS '02.

[14]  Markus Straßer,et al.  A fault-tolerant protocol for providing the exactly-once property of mobile agents , 1998, Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281).

[15]  Kyeongmo Park A Fault-Tolerant Mobile Agent Model in Replicated Secure Services , 2004, ICCSA.