A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

Multi-agent reinforcement learning has recently shown great promise as an approach to networked system control. Arguably, one of the most difficult and important tasks for which large scale networked system control is applicable is common-pool resource management. Crucial common-pool resources include arable land, fresh water, wetlands, wildlife, fish stock, forests and the atmosphere, of which proper management is related to some of society's greatest challenges such as food security, inequality and climate change. Here we take inspiration from a recent research program investigating the game-theoretic incentives of humans in social dilemma situations such as the well-known tragedy of the commons. However, instead of focusing on biologically evolved human-like agents, our concern is rather to better understand the learning and operating behaviour of engineered networked systems comprising general-purpose reinforcement learning agents, subject only to nonbiological constraints such as memory, computation and communication bandwidth. Harnessing tools from empirical game-theoretic analysis, we analyse the differences in resulting solution concepts that stem from employing different information structures in the design of networked multi-agent systems. These information structures pertain to the type of information shared between agents as well as the employed communication protocol and network topology. Our analysis contributes new insights into the consequences associated with certain design choices and provides an additional dimension of comparison between systems beyond efficiency, robustness, scalability and mean control performance.

[1]  Tamer Basar,et al.  Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms , 2019, Handbook of Reinforcement Learning and Control.

[2]  Shimon Whiteson,et al.  Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning , 2017, ICML.

[3]  Elinor Ostrom,et al.  The Nature of Common-Pool Resource Problems , 1990 .

[4]  M. Macy,et al.  Learning dynamics in social dilemmas , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Daniel Arenas,et al.  From Raindrops to a Common Stream: Using the Social-Ecological Systems Framework for Research on Sustainable Water Management , 2018, Organization & Environment.

[6]  Shimon Whiteson,et al.  Learning with Opponent-Learning Awareness , 2017, AAMAS.

[7]  Nando de Freitas,et al.  Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning , 2018, ICML.

[8]  A. Roth,et al.  Game-Theoretic Models and the Role of Information in Bargaining , 1979 .

[9]  Joaquín Izquierdo,et al.  Multi-Agent Systems and Complex Networks: Review and Applications in Systems Engineering , 2020, Processes.

[10]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[11]  Elinor Ostrom,et al.  Crafting Institutions for Self-Governing Irrigation Systems , 1992 .

[12]  A. Sarker,et al.  Design principles in long-enduring institutions of Japanese irrigation common-pool resources , 2001 .

[13]  Sandeep Chinchali,et al.  Multi-agent Reinforcement Learning for Networked System Control , 2020, ICLR.

[14]  J. Neumann,et al.  Theory of games and economic behavior , 1945, 100 Years of Math Milestones.

[15]  Dylan Hadfield-Menell,et al.  Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors , 2020, AAMAS.

[16]  R. Gibbons,et al.  Cheap Talk Can Matter in Bargaining , 1988 .

[17]  Alexander Peysakhovich,et al.  Consequentialist conditional cooperation in social dilemmas with imperfect information , 2017, AAAI Workshops.

[18]  Joel Z. Leibo,et al.  Multi-agent Reinforcement Learning in Sequential Social Dilemmas , 2017, AAMAS.

[19]  Joshua B. Tenenbaum,et al.  Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction , 2016, CogSci.

[20]  G. Hardin,et al.  The Tragedy of the Commons , 1968, Green Planet Blues.

[21]  Michael P. Wellman Methods for Empirical Game-Theoretic Analysis , 2006, AAAI.

[22]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[23]  Tamer Basar,et al.  Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents , 2018, ICML.

[24]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[25]  T. Schelling Hockey Helmets, Concealed Weapons, and Daylight Saving , 1973 .

[26]  E. Rowland Theory of Games and Economic Behavior , 1946, Nature.

[27]  Alexander Peysakhovich,et al.  Maintaining cooperation in complex social dilemmas using deep reinforcement learning , 2017, ArXiv.

[28]  M. Jackson,et al.  Games on Networks , 2014 .

[29]  Elinor Ostrom,et al.  Design principles in long‐enduring irrigation institutions , 1993 .

[30]  Claudia Baldwin,et al.  Managing groundwater as a common-pool resource: an Australian case study , 2009 .

[31]  Joel Z. Leibo,et al.  Inequity aversion improves cooperation in intertemporal social dilemmas , 2018, NeurIPS.

[32]  L. Felkins The Social Dilemmas , 2015 .

[33]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[34]  E. Graffy,et al.  Left High and Dry? Climate Change, Common-Pool Resource Theory, and the Adaptability of Western Water Compacts , 2011 .

[35]  Igor Mordatch,et al.  Emergent Tool Use From Multi-Agent Autocurricula , 2019, ICLR.

[36]  Joel Z. Leibo,et al.  Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research , 2019, ArXiv.

[37]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[38]  Joel Z. Leibo,et al.  A multi-agent reinforcement learning model of common-pool resource appropriation , 2017, NIPS.

[39]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[40]  Olivier Compte Communication in repeated games with imperfect private monitoring , 1998 .

[41]  R. Myerson MULTISTAGE GAMES WITH COMMUNICATION , 1984 .

[42]  G. Brady Governing the Commons: The Evolution of Institutions for Collective Action , 1993 .

[43]  Alexander Peysakhovich,et al.  Towards AI that Can Solve Social Dilemmas , 2018, AAAI Spring Symposia.

[44]  Yasin Yilmaz,et al.  Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey , 2020, IEEE Transactions on Intelligent Transportation Systems.

[45]  Joel Z. Leibo,et al.  A Generalised Method for Empirical Game Theoretic Analysis , 2018, AAMAS.

[46]  G. Tesauro,et al.  Analyzing Complex Strategic Interactions in Multi-Agent Systems , 2002 .

[47]  Alexander Peysakhovich,et al.  Prosocial Learning Agents Solve Generalized Stag Hunts Better than Selfish Ones Extended Abstract , 2018 .