论文信息 - Cooperative Control and Potential Games

Cooperative Control and Potential Games

We present a view of cooperative control using the language of learning in games. We review the game-theoretic concepts of potential and weakly acyclic games, and demonstrate how several cooperative control problems, such as consensus and dynamic sensor coverage, can be formulated in these settings. Motivated by this connection, we build upon game-theoretic concepts to better accommodate a broader class of cooperative control problems. In particular, we extend existing learning algorithms to accommodate restricted action sets caused by the limitations of agent capabilities and group based decision making. Furthermore, we also introduce a new class of games called sometimes weakly acyclic games for time-varying objective functions and action sets, and provide distributed algorithms for convergence to an equilibrium.

Jason R. Marden | Jeff S. Shamma | Gürdal Arslan | Gürdal Arslan | J. Shamma

[1] Jason R. Marden,et al. Joint Strategy Fictitious Play with Inertia for Potential Games , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[2] Jason R. Marden,et al. Payoff-Based Dynamics for Multiplayer Weakly Acyclic Games , 2009, SIAM J. Control. Optim..

[3] Christos G. Cassandras,et al. Sensor Networks and Cooperative Control , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[4] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[5] R. Srikant,et al. Consensus with Quantized Information Updates , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[6] L. Shapley,et al. Potential Games , 1994 .

[7] L. Blume. The Statistical Mechanics of Strategic Interaction , 1993 .

[8] L. Moreau,et al. Stability of continuous-time distributed consensus algorithms , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[9] Richard M. Murray,et al. Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.

[10] L. Shapley,et al. Fictitious Play Property for Games with Identical Interests , 1996 .

[11] Richard M. Murray,et al. Recent Research in Cooperative Control of Multivehicle Systems , 2007 .

[12] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[13] Stephen P. Boyd,et al. A scheme for robust distributed sensor fusion based on average consensus , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[14] John N. Tsitsiklis,et al. Distributed Asynchronous Deterministic and Stochastic Gradient Optimization Algorithms , 1984, 1984 American Control Conference.

[15] L. Blume,et al. POPULATION GAMES , 1995 .

[16] H. Peyton Young,et al. Strategic Learning and Its Limits , 2004 .

[17] Jason R. Marden,et al. Autonomous Vehicle-Target Assignment: A Game-Theoretical Formulation , 2007 .

[18] Wendi B. Heinzelman,et al. Application-specific protocol architectures for wireless networks , 2000 .

[19] L. Shapley,et al. REGULAR ARTICLEPotential Games , 1996 .

[20] H. Young. Individual Strategy and Social Structure , 2020 .

[21] Reza Olfati-Saber,et al. Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[22] D. Monderer,et al. Fictitious play and- no-cycling conditions , 1997 .

[23] W. Arthur,et al. The Economy as an Evolving Complex System II , 1988 .

[24] Jie Lin,et al. Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[25] Yoav Shoham,et al. Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[26] J.N. Tsitsiklis,et al. Convergence in Multiagent Coordination, Consensus, and Flocking , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[27] Marios M. Polycarpou,et al. Cooperative Control of Distributed Multi-Agent Systems , 2001 .

[28] Jason R. Marden,et al. Regret based dynamics: convergence in weakly acyclic games , 2007, AAMAS '07.

[29] Stephen P. Boyd,et al. Fast linear iterations for distributed averaging , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[30] Francesco Bullo,et al. Distributed Control of Robotic Networks , 2009 .

[31] Jason R. Marden,et al. Payoff based dynamics for multi-player weakly acyclic games , 2007, 2007 46th IEEE Conference on Decision and Control.