Effective, Quantitative, Obscured Observation-Based Fault Detection in Multi-Agent Systems (Extended Abstract)

A key challenge in multi agent systems is verifying that all agents obey the system rules at any moment [10, 6]. In behavior-based systems, each agent is in some ‘state’ at any moment. The system designer defines what states each agent might take, according to its teammates states (plans [9, 1] or policies [8]). Being a distributed system, no single agent has knowledge of the others. Monitoring for fault detection must rely on gathering information by communication or observation. This information is not always accurate [9, 5]. There are various types of approaches for that issue. Some researches define a-priory the possible failures, and identify them at run time [7, 4]. Others, which we adopt, prefer to define the allowed behavior, and identify exceptions from it [6, 8], in the following way: (a) Define a policy of the state-combinations agents are allowed to take; (b) at run time, each agent observes its teammate, deducing their possible states; (c) then, it compares them to those allowed by the policy and see if: (1) all the possible states are allowed (no fault); (2) none of them is allowed (fault); (3) some are allowed and some are not (possible fault). Since the overall number of joint states in the system is m (number of agents powered by the number of states each agent might take), the näıve comparison has exponential complexity in both space and time. Various researches suggests ways for reducing that. In [8], a binary matrix based policy is used. Each matrix (supercombination or s-comb) represents the states each agent is allowed to take; a pol-

[1]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[2]  Milind Tambe,et al.  Robust Agent Teams via Socially-Attentive Monitoring , 2000, J. Artif. Intell. Res..

[3]  Bikramjit Banerjee,et al.  Multi-Agent Plan Recognition: Formalization and Algorithms , 2010, AAAI.

[4]  Mark Klein,et al.  An experimental evaluation of domain-independent fault handling services in open multi-agent systems , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[5]  Gita Reese Sukthankar,et al.  Hypothesis Pruning and Ranking for Large Plan Recognition Problems , 2008, AAAI.

[6]  Meir Kalech,et al.  A representation for coordination fault detection in large-scale multi-agent systems , 2009, Annals of Mathematics and Artificial Intelligence.

[7]  Meir Kalech Diagnosis of coordination failures: a matrix-based approach , 2010, Autonomous Agents and Multi-Agent Systems.

[8]  Mark Klein,et al.  Exception handling in agent systems , 1999, AGENTS '99.

[9]  Bryan Horling,et al.  Diagnosis as an integral part of multi-agent adaptability , 2000, Proceedings DARPA Information Survivability Conference and Exposition. DISCEX'00.

[10]  L. Comtet,et al.  Advanced Combinatorics: The Art of Finite and Infinite Expansions , 1974 .