Distributed Algorithms for Dynamic Survivability of Multiagent Systems

Though multiagent systems (MASs) are being increasingly used, few methods exist to ensure survivability of MASs. All existing methods suffer from two flaws. First, a centralized survivability algorithm (CSA) ensures survivability of the MAS – unfortunately, if the node on which the CSA exists goes down, the survivability of the MAS is questionable. Second, no mechanism exists to change how the MAS is deployed when external factors trigger a re-evaluation of the survivability of the MAS. In this paper, we present three algorithms to address these two important problems. Our algorithms can be built on top of any CSA. Our algorithms are completely distributed and can handle external triggers to compute a new deployment. We report on experiments assessing the efficiency of these algorithms.

[1]  Michael R. Lyu,et al.  Improving the N-version programming process through the evolution of a design paradigm , 1993 .

[2]  Walter J. Gutjahr,et al.  Reliability optimization of redundant software with correlated failures , 1998, Proceedings Ninth International Symposium on Software Reliability Engineering (Cat. No.98TB100257).

[3]  Sarit Kraus,et al.  Probabilistically Survivable MASs , 2003, IJCAI.

[4]  Jiawei Zhang,et al.  Approximation algorithms for facility location problems , 2004 .

[5]  Éva Tardos,et al.  Approximation algorithms for facility location problems (extended abstract) , 1997, STOC '97.

[6]  André Schiper,et al.  FATOMAS-a fault-tolerant mobile agent system based on the agent-dependent approach , 2001, 2001 International Conference on Dependable Systems and Networks.

[7]  Ravishankar K. Iyer,et al.  Active replication of multithreaded applications , 2006, IEEE Transactions on Parallel and Distributed Systems.

[8]  Katia P. Sycara,et al.  Configuration management for multi-agent systems , 2001, AGENTS '01.

[9]  Mark Klein,et al.  Exception handling in agent systems , 1999, AGENTS '99.

[10]  Nils J. Nilsson,et al.  Artificial Intelligence: A New Synthesis , 1997 .

[11]  Chengqi Zhang,et al.  Multi-Agent Systems Methodologies and Applications , 1996, Lecture Notes in Computer Science.

[12]  Milind Tambe,et al.  Robust Agent Teams via Socially-Attentive Monitoring , 2000, J. Artif. Intell. Res..

[13]  Somesh Jha,et al.  Increasing Resource Utilization and Task Performance by Agent Cloning , 1998, ATAL.

[14]  Hector J. Levesque,et al.  The adaptive agent architecture: achieving fault-tolerance using persistent broker teams , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[15]  David B. Shmoys,et al.  Approximation algorithms for facility location problems , 2000, APPROX.

[16]  Katia P. Sycara,et al.  Cloning for Intelligent Adaptive Information Agents , 1996, DAI.

[17]  Pierre Sens,et al.  Towards Adaptive Fault-Tolerance For Distributed Multi-Agent Systems , 2001 .

[18]  Siddhartha Mishra Agent fault tolerance using group communica-tion , 2001 .

[19]  Siddhartha Mishra,et al.  Fault-tolerance in agent-based computing systems , 2000 .

[20]  Xiaocong Fan On Splitting and Cloning Agents , 2001 .

[21]  Ralph Deters,et al.  Improving fault-tolerance by replicating agents , 2002, AAMAS '02.