论文信息 - Optimistic Planning for Consensus

Optimistic Planning for Consensus

An important challenge in multiagent systems is consensus, in which the agents are required to synchronize certain controlled variables of interest, often using only an incomplete and time-varying communication graph. We propose a consensus approach based on optimistic planning (OP), a predictive control algorithm that finds near-optimal control actions for any nonlinear dynamics and reward (cost) function. At every step, each agent uses OP to solve a local control problem with rewards that express the consensus objectives. Neighboring agents coordinate by exchanging their predicted behaviors in a predefined order. Due to its generality, OP consensus can adapt to any agent dynamics and, by changing the reward function, to a variety of consensus objectives. OP consensus is demonstrated for velocity consensus (flocking) with a time-varying communication graph, where it preserves connectivity better than a classical algorithm; and for leaderless and leader-based consensus of robotic arms, where OP easily deals with the nonlinear dynamics.

Lucian Buşoniu | Constantin Morărescu

[1] George J. Pappas,et al. Flocking in Teams of Nonholonomic Agents , 2003 .

[2] L. Buşoniu,et al. A comprehensive survey of multi-agent reinforcement learning , 2011 .

[3] Michael L. Littman,et al. Sample-Based Planning for Continuous Action Markov Decision Processes , 2011, ICAPS.

[4] Nikos Vlassis,et al. A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence I Mobk077-fm Synthesis Lectures on Artificial Intelligence and Machine Learning a Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence a Concise Introduction to Multiagent Systems and D , 2007 .

[5] Alberto Bemporad,et al. Decentralized model predictive control , 2010 .

[6] Randal W. Beard,et al. Distributed Consensus in Multi-vehicle Cooperative Control - Theory and Applications , 2007, Communications and Control Engineering.

[7] Antoine Girard,et al. Sufficient conditions for flocking via graph robustness analysis , 2010, 49th IEEE Conference on Decision and Control (CDC).

[8] Rémi Munos,et al. Optimistic Planning of Deterministic Systems , 2008, EWRL.

[9] Guanrong Chen,et al. Adaptive second-order consensus of networked mobile agents with nonlinear dynamics , 2011, Autom..

[10] Guangfu Ma,et al. Distributed Coordinated Tracking With a Dynamic Leader for Multiple Euler-Lagrange Systems , 2011, IEEE Transactions on Automatic Control.

[11] Bart De Schutter,et al. Approximate dynamic programming with a fuzzy parameterization , 2010, Autom..