Diagnosing resource usage failures in multi-agent systems

Present and formally define the Temporal Multi-Agent Resource Allocation diagnosis problem.Propose an efficient solution to this problem by a novel encoding to Boolean Satisfiability (SAT).Demonstrate empirically the scalability of the proposed approach. In the not-so-far future, autonomous vehicles will be ubiquitous and, consequently, need to be coordinated to avoid traffic jams and car accidents. A failure in one or more autonomous vehicles may break this coordination, resulting in reduced efficiency (due to traffic load) or even bodily harm (due to accidents). The challenge we address in this paper is to identify the root cause of such failures. Identifying the faulty vehicles in such cases is crucial in order to know which vehicles to repair to avoid future failures as well as for determining accountability (e.g., for legal purposes). More generally, this paper discusses multi-agent systems (MAS) in which the agents use a shared pool of resources and they coordinate to avoid resource contention by agreeing on a temporal resource allocation. The problem we address, called the Temporal Multi-Agent Resource Allocation (TMARA) diagnosis problem (TMARA-Diag), is to find the root cause of failures in such MAS that are caused by malfunctioning agents that use resources not allocated to them. As in the autonomous vehicles example, such failures may cause the MAS to perform suboptimally or even fail, potentially causing a chain reaction of failures, and we aim to identify the root cause of such failures, i.e., which agents did not follow the planned resource allocation. We show how to formalize TMARA-Diag as a model-based diagnosis problem and how to compile it to a set of logical constraints that can be compiled to Boolean satisfiability (SAT) and solved efficiently with modern SAT solvers. Importantly, the proposed solution does not require the agents to share their actual plans, only the agreed upon temporal resource allocation and the resources used at the time of failure. Such solutions are key in the development and success of intelligent, large, and security-aware MAS.

[1]  Peter Struss,et al.  "Physical Negation" Integrating Fault Models into the General Diagnostic Engine , 1989, IJCAI.

[2]  Peter Stone,et al.  A Multiagent Approach to Autonomous Intersection Management , 2008, J. Artif. Intell. Res..

[3]  Pietro Torasso,et al.  Plan Diagnosis and Agent Diagnosis in Multi-agent Systems , 2007, AI*IA.

[4]  Milind Tambe,et al.  Robust Agent Teams via Socially-Attentive Monitoring , 2000, J. Artif. Intell. Res..

[5]  Sherif Sakr,et al.  Network-based social coordination of business processes , 2016, Inf. Syst..

[6]  Meir Kalech Diagnosis of coordination failures: a matrix-based approach , 2010, Autonomous Agents and Multi-Agent Systems.

[7]  Raymond Reiter,et al.  A Theory of Diagnosis from First Principles , 1986, Artif. Intell..

[8]  Celia Gutiérrez An Analysis Architecture for Communications in Multi-agent Systems , 2013, Int. J. Interact. Multim. Artif. Intell..

[9]  Alexander Feldman,et al.  Solving Strong-Fault Diagnostic Models by Model Relaxation , 2009, IJCAI.

[10]  Brian C. Williams,et al.  Diagnosing Multiple Faults , 1987, Artif. Intell..

[11]  Nico Roos,et al.  Primary and secondary diagnosis of multi-agent plan execution , 2009, Autonomous Agents and Multi-Agent Systems.

[12]  Peter J. Stuckey,et al.  Boolean Equi-propagation for Optimized SAT Encoding , 2011, CP.

[13]  Rosaldo J. F. Rossetti,et al.  Spectrum-Based Fault Localisation for Multi-Agent Systems , 2015, IJCAI.

[14]  Claude Castelluccia,et al.  Extending SAT Solvers to Cryptographic Problems , 2009, SAT.

[15]  Meir Kalech,et al.  Towards Model-Based Diagnosis of Coordination Failures , 2005, AAAI.

[16]  Meir Kalech,et al.  Exploring the Duality in Conflict-Directed Model-Based Diagnosis , 2012, AAAI.

[17]  Dean Allemang,et al.  The Computational Complexity of Abduction , 1991, Artif. Intell..

[18]  Brian C. Williams,et al.  Conflict-directed A* and its role in model-based embedded systems , 2007, Discret. Appl. Math..

[19]  Roberto Micalizio,et al.  Explaining interdependent action delays in multiagent plans execution , 2015, Autonomous Agents and Multi-Agent Systems.

[20]  Franz Wotawa,et al.  The Route to Success - A Performance Comparison of Diagnosis Algorithms , 2013, IJCAI.

[21]  Iván García-Magariño,et al.  Revealing bullying patterns in multi-agent systems , 2011, J. Syst. Softw..

[22]  Wheeler Ruml,et al.  On-line Planning and Scheduling: An Application to Controlling Modular Printers , 2008, AAAI.

[23]  Roberto Micalizio A Distributed Control Loop for Autonomous Recovery in a Multi-Agent Plan , 2009, IJCAI.

[24]  Meir Kalech,et al.  On the design of coordination diagnosis algorithms for teams of situated agents , 2007, Artif. Intell..

[25]  Christine Julien,et al.  Evasion planning for autonomous vehicles at intersections , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Rina Dechter,et al.  Temporal Constraint Networks , 1989, Artif. Intell..

[27]  Rosaldo J. F. Rossetti,et al.  Empirical Evaluation of Similarity Coefficients for Multiagent Fault Localization , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[28]  Meir Kalech,et al.  A Novel SAT-Based Approach to Model Based Diagnosis , 2014, J. Artif. Intell. Res..

[29]  Sanem Sariel,et al.  Dynamic Temporal Planning for Multirobot Systems , 2011, Automated Action Planning for Autonomous Mobile Robots.

[30]  Celia Gutiérrez Analysis of Bullying in Cooperative Multi-agent Systems' Communications , 2013, Int. J. Interact. Multim. Artif. Intell..

[31]  Brian C. Williams,et al.  Diagnosis with Behavioral Modes , 1989, IJCAI.

[32]  Yann Chevaleyre,et al.  Issues in Multiagent Resource Allocation , 2006, Informatica.

[33]  Pietro Torasso,et al.  Cooperative Monitoring to Diagnose Multiagent Plans , 2014, J. Artif. Intell. Res..

[34]  Mark Abramson,et al.  Executing Reactive, Model-based Programs through Graph-based Temporal Planning , 2001, IJCAI.

[35]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[36]  Bikramjit Banerjee,et al.  Detection of Plan Deviation in Multi-Agent Systems , 2016, AAAI.

[37]  Meir Kalech,et al.  Diagnosis of Multi-Robot Coordination Failures Using Distributed CSP Algorithms , 2006, AAAI.

[38]  Peter Zoeteweij,et al.  A practical evaluation of spectrum-based fault localization , 2009, J. Syst. Softw..

[39]  Nico Roos,et al.  Models and methods for plan diagnosis , 2009, Autonomous Agents and Multi-Agent Systems.