Antecedence Graph Approach to Checkpointing for Fault Tolerance in Multi Agent System

Checkpointing has been widely used for providing fault tolerance in multi-agent systems. But the traditional message passing based checkpointing and rollback algorithms may suffer from problems of excess bandwidth consumption and large overheads. In order to maintain consistency of multi agent system, the checkpointing is forced on all participating agents that may result in blocking of agents’ operations to carry out checkpointing. These overheads could be considerably reduced if the checkpointing would be forced only on selective agents instead of all agents. This paper presents a low latency, non-blocking checkpointing scheme which marks out dependent agents using Antecedence graphs and then checkpoints are forced on only these agents. To recover from failures, the antecedence graphs and message logs are regenerated and normal operations continued. The proposed scheme reports less overheads and reduced recovery times as compared to existing schemes.

[1]  Bharat K. Bhargava,et al.  Independent checkpointing and concurrent rollback for recovery in distributed systems-an optimistic approach , 1988, Proceedings [1988] Seventh Symposium on Reliable Distributed Systems.

[2]  Michael R. Lyu,et al.  Design and evaluation of a fault-tolerant mobile-agent system , 2004, IEEE Intelligent Systems.

[3]  D. Manivannan,et al.  Quasi-Synchronous Checkpointing: Models, Characterization, and Classification , 1999, IEEE Trans. Parallel Distributed Syst..

[4]  Hyacinth S. Nwana,et al.  Software agents: an overview , 1996, The Knowledge Engineering Review.

[5]  William G. Tuel,et al.  Parallel checkpoint/restart without message logging , 2000, Proceedings 2000. International Workshop on Parallel Processing.

[6]  Willy Zwaenepoel,et al.  Manetho: fault tolerance in distributed systems using rollback-recovery and process replication , 1994 .

[7]  Aamer Nadeem,et al.  An Antecedence Graph Approach for Fault Tolerance in a Multi-Agent , 2006, 7th International Conference on Mobile Data Management (MDM'06).