Optimistic Planning for Consensus

An important challenge in multiagent systems is consensus, in which the agents are required to synchronize certain controlled variables of interest, often using only an incomplete and time-varying communication graph. We propose a consensus approach based on optimistic planning (OP), a predictive control algorithm that finds near-optimal control actions for any nonlinear dynamics and reward (cost) function. At every step, each agent uses OP to solve a local control problem with rewards that express the consensus objectives. Neighboring agents coordinate by exchanging their predicted behaviors in a predefined order. Due to its generality, OP consensus can adapt to any agent dynamics and, by changing the reward function, to a variety of consensus objectives. OP consensus is demonstrated for velocity consensus (flocking) with a time-varying communication graph, where it preserves connectivity better than a classical algorithm; and for leaderless and leader-based consensus of robotic arms, where OP easily deals with the nonlinear dynamics.

[1]  George J. Pappas,et al.  Flocking in Teams of Nonholonomic Agents , 2003 .

[2]  L. Buşoniu,et al.  A comprehensive survey of multi-agent reinforcement learning , 2011 .

[3]  Michael L. Littman,et al.  Sample-Based Planning for Continuous Action Markov Decision Processes , 2011, ICAPS.

[4]  Nikos Vlassis,et al.  A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence I Mobk077-fm Synthesis Lectures on Artificial Intelligence and Machine Learning a Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence a Concise Introduction to Multiagent Systems and D , 2007 .

[5]  Alberto Bemporad,et al.  Decentralized model predictive control , 2010 .

[6]  Randal W. Beard,et al.  Distributed Consensus in Multi-vehicle Cooperative Control - Theory and Applications , 2007, Communications and Control Engineering.

[7]  Antoine Girard,et al.  Sufficient conditions for flocking via graph robustness analysis , 2010, 49th IEEE Conference on Decision and Control (CDC).

[8]  Rémi Munos,et al.  Optimistic Planning of Deterministic Systems , 2008, EWRL.

[9]  Guanrong Chen,et al.  Adaptive second-order consensus of networked mobile agents with nonlinear dynamics , 2011, Autom..

[10]  Guangfu Ma,et al.  Distributed Coordinated Tracking With a Dynamic Leader for Multiple Euler-Lagrange Systems , 2011, IEEE Transactions on Automatic Control.

[11]  Bart De Schutter,et al.  Approximate dynamic programming with a fuzzy parameterization , 2010, Autom..

[12]  Riccardo Scattolini,et al.  Model Predictive Control Schemes for Consensus in Multi-Agent Systems with Single- and Double-Integrator Dynamics , 2009, IEEE Transactions on Automatic Control.

[13]  Dimitri P. Bertsekas,et al.  Dynamic programming and optimal control, 3rd Edition , 2005 .

[14]  Reza Olfati-Saber,et al.  Flocking for multi-agent dynamic systems: algorithms and theory , 2006, IEEE Transactions on Automatic Control.

[15]  Bart De Schutter,et al.  Multi-agent model predictive control for transportation networks: Serial versus parallel schemes , 2008, Eng. Appl. Artif. Intell..

[16]  Karl Henrik Johansson,et al.  A Study On Distributed Model Predictive Consensus , 2008, ArXiv.

[17]  Lucian Busoniu,et al.  Optimistic planning for Markov decision processes , 2012, AISTATS.

[18]  Wenwu Yu,et al.  Flocking of multi-agent dynamical systems based on pseudo-leader mechanism , 2009, Syst. Control. Lett..

[19]  Panagiotis D. Christofides,et al.  Sequential and Iterative Architectures for Distributed Model Predictive Control of Nonlinear Process Systems , 2010 .

[20]  Claudio De Persis,et al.  Robust Self-Triggered Coordination With Ternary Controllers , 2012, IEEE Transactions on Automatic Control.

[21]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[22]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .