Open Problems in Cooperative AI

Problems of cooperation--in which agents seek ways to jointly improve their welfare--are ubiquitous and important. They can be found at scales ranging from our daily routines--such as driving on highways, scheduling meetings, and working collaboratively--to our global challenges--such as peace, commerce, and pandemic preparedness. Arguably, the success of the human species is rooted in our ability to cooperate. Since machines powered by artificial intelligence are playing an ever greater role in our lives, it will be important to equip them with the capabilities necessary to cooperate and to foster cooperation. We see an opportunity for the field of artificial intelligence to explicitly focus effort on this class of problems, which we term Cooperative AI. The objective of this research would be to study the many aspects of the problems of cooperation and to innovate in AI to contribute to solving these problems. Central goals include building machine agents with the capabilities needed for cooperation, building tools to foster cooperation in populations of (machine and/or human) agents, and otherwise conducting AI research for insight relevant to problems of cooperation. This research integrates ongoing work on multi-agent systems, game theory and social choice, human-machine interaction and alignment, natural-language processing, and the construction of social tools and platforms. However, Cooperative AI is not the union of these existing areas, but rather an independent bet about the productivity of specific kinds of conversations that involve these and other areas. We see opportunity to more explicitly focus on the problem of cooperation, to construct unified theory and vocabulary, and to build bridges with adjacent communities working on cooperation, including in the natural, social, and behavioural sciences.

[1]  N. Kaldor The Philosophy of Economics: Welfare Propositions of Economics and Interpersonal Comparisons of Utility , 1939 .

[2]  J. Neumann,et al.  Theory of Games and Economic Behavior. , 1945 .

[3]  J. Nash THE BARGAINING PROBLEM , 1950, Classics in Game Theory.

[4]  L. A. Goodman,et al.  Social Choice and Individual Values , 1951 .

[5]  Icinqsley Laffer. THE FOUNDATIONS OF WELFARE ECONOMICS , 1951 .

[6]  D. Black The theory of committees and elections , 1959 .

[7]  W. A. Reynolds The Burning Ships of Hernan Cortes , 1959 .

[8]  T. Schelling,et al.  The Strategy of Conflict. , 1961 .

[9]  G. Thompson,et al.  The Theory of Committees and Elections. , 1959 .

[10]  William Vickrey,et al.  Counterspeculation, Auctions, And Competitive Sealed Tenders , 1961 .

[11]  Joseph Weizenbaum,et al.  ELIZA—a computer program for the study of natural language communication between man and machine , 1966, CACM.

[12]  S. Huntington Political Order in Changing Societies , 1969 .

[13]  Thomas C. Schelling,et al.  Dynamic models of segregation , 1971 .

[14]  E. H. Clarke Multipart pricing of public goods , 1971 .

[15]  Theodore Groves,et al.  Incentives in Teams , 1973 .

[16]  A. Gibbard Manipulation of Voting Schemes: A General Result , 1973 .

[17]  M. Spence Job Market Signaling , 1973 .

[18]  R. Aumann Subjectivity and Correlation in Randomized Strategies , 1974 .

[19]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[20]  M. Satterthwaite Strategy-proofness and Arrow's conditions: Existence and correspondence theorems for voting procedures and social welfare functions , 1975 .

[21]  A. Zahavi Reliability in communication systems and the evolution of altruism , 1977 .

[22]  Jerry R. Green,et al.  Characterization of Satisfactory Mechanisms for the Revelation of Preferences for Public Goods , 1977 .

[23]  Reid G. Smith,et al.  The Contract Net Protocol: High-Level Communication and Control in a Distributed Problem Solver , 1980, IEEE Transactions on Computers.

[24]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[25]  A. Schotter The Economic Theory of Social Institutions , 1981 .

[26]  A. Tversky,et al.  Judgment under Uncertainty , 1982 .

[27]  J. Sobel,et al.  STRATEGIC INFORMATION TRANSMISSION , 1982 .

[28]  Michael P. Georgeff,et al.  Communication and interaction in multi-agent planning , 1983, AAAI 1983.

[29]  John Othick,et al.  The Rise and Decline of Nations: Economic Growth, Stagflation, and Social Rigidities. , 1983 .

[30]  L. Kornhauser Reliance, Reputation, and Breach of Contract , 1983, The Journal of Law and Economics.

[31]  D. Hofstadter Metamagical Themas: Questing for the Essence of Mind and Pattern , 1985 .

[32]  S. Baron-Cohen,et al.  Does the autistic child have a “theory of mind” ? , 1985, Cognition.

[33]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[34]  Arnoud Boot,et al.  Credible commitments, contract enforcement problems and banks: Intermediation as credibility assurance , 1991 .

[35]  Ben Shneiderman,et al.  Designing the User Interface: Strategies for Effective Human-Computer Interaction , 1998 .

[36]  D H Klatt,et al.  Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[37]  L. Buss,et al.  The evolution of individuality , 1987 .

[38]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[39]  L. Shapley A Value for n-person Games , 1988 .

[40]  Joseph Farrell Communication, coordination and Nash equilibrium , 1988 .

[41]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[42]  A. Greif Reputation and Coalitions in Medieval Trade: Evidence on the Maghribi Traders , 1989, The Journal of Economic History.

[43]  M. Trick,et al.  The computational difficulty of manipulating an election , 1989 .

[44]  A. Katz The Strategic Structure of Offer and Acceptance: Game Theory and the Law of Contract Formation , 1990 .

[45]  Herbert H. Clark,et al.  Grounding in communication , 1991, Perspectives on socially shared cognition.

[46]  G. Reeke The society of mind , 1991 .

[47]  Marvin Minsky,et al.  Society of Mind: A Response to Four Reviews , 1991, Artif. Intell..

[48]  David R Traum,et al.  Towards a Computational Theory of Grounding in Natural Language Conversation , 1991 .

[49]  G. Robinson Regulation of division of labor in insect societies. , 1992, Annual review of entomology.

[50]  J. Knight Institutions and Social Conflict , 1992 .

[51]  Moshe Tennenholtz,et al.  On the Synthesis of Useful Social Laws for Artificial Agent Societies (Preliminary Report) , 1992, AAAI.

[52]  Moshe Tennenholtz,et al.  Artificial Social Systems , 1992, Lecture Notes in Computer Science.

[53]  Kevin M. Murphy,et al.  Why Is Rent-Seeking So Costly to Growth? , 1993 .

[54]  Michael P. Wellman A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[55]  E. Szathmáry,et al.  The origin of chromosomes. I. Selection for linkage. , 1993, Journal of theoretical biology.

[56]  Arvind Parkhe Strategic Alliance Structuring: A Game Theoretic and Transaction Cost Examination of Interfirm Cooperation , 1993 .

[57]  Joseph Farrell Meaning and Credibility in Cheap-Talk Games , 1993 .

[58]  Edmund H. Durfee,et al.  UM-PRS: An implementation of the procedural reasoning system for multirobot applications , 1994 .

[59]  S. Lohmann The Dynamics of Informational Cascades: The Monday Demonstrations in Leipzig, East Germany, 1989–91 , 1994, World Politics.

[60]  Gerald Tesauro,et al.  TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.

[61]  Jeffrey S. Rosenschein,et al.  Rules of Encounter - Designing Conventions for Automated Negotiation among Computers , 1994 .

[62]  Candace L. Sidner,et al.  An Artificial Discourse Language for Collaborative Negotiation , 1994, AAAI.

[63]  A. Greif Cultural Beliefs and the Organization of Society: A Historical and Theoretical Reflection on Collectivist and Individualist Societies , 1994, Journal of Political Economy.

[64]  Paul R. Milgrom,et al.  Coordination, Commitment, and Enforcement: The Case of the Merchant Guild , 1994, Journal of Political Economy.

[65]  Donald A. Norman,et al.  How might people interact with agents , 1994, CACM.

[66]  Carson C. Woo,et al.  A speech-act-based negotiation protocol: design, implementation, and test use , 1994, TOIS.

[67]  Moshe Tennenholtz,et al.  On Social Laws for Artificial Agent Societies: Off-Line Design , 1995, Artif. Intell..

[68]  Eörs Szathmáry,et al.  The Major Transitions in Evolution , 1997 .

[69]  Victor R. Lesser,et al.  Issues in Automated Negotiation and Electronic Commerce: Extending the Contract Net Framework , 1997, ICMAS.

[70]  J. Fearon Rationalist explanations for war , 1995, International Organization.

[71]  Gerhard Weiß,et al.  Adaptation and Learning in Multi-Agent Systems: Some Remarks and a Bibliography , 1995, Adaption and Learning in Multi-Agent Systems.

[72]  S. Komorita,et al.  Interpersonal Relations: Mixed-Motive Interaction , 1995 .

[73]  K. Bagwell Commitment and observability in games , 1995 .

[74]  N. Gilbert,et al.  Artificial Societies: The Computer Simulation of Social Life , 1995 .

[75]  J. H. Davis,et al.  An Integrative Model Of Organizational Trust , 1995 .

[76]  Sarit Kraus,et al.  DESIGNING AND BUILDING A NEGOTIATING AUTOMATED AGENT , 1995, Comput. Intell..

[77]  H. Jürgen Müller,et al.  Negotiation principles , 1996 .

[78]  T. Sandholm Limitations of the Vickrey Auction in Computational Multiagent Systems , 1996 .

[79]  Nelson Minar,et al.  The Swarm Simulation System: A Toolkit for Building Multi-Agent Simulations , 1996 .

[80]  Nicholas R. Jennings,et al.  Coordination techniques for distributed artificial intelligence , 1996 .

[81]  Steven J. Brams,et al.  Fair division - from cake-cutting to dispute resolution , 1998 .

[82]  H. Young The Economics of Convention , 1996 .

[83]  M. Erpino Evolution teaching. , 1996, Science.

[84]  Michael Wooldridge,et al.  A Formal Specification of dMARS , 1997, ATAL.

[85]  Jim R. Oliver A Machine-Learning Approach to Automated Negotiation and Prospects for Electronic Commerce , 1996, J. Manag. Inf. Syst..

[86]  S. Hart,et al.  A simple adaptive procedure leading to correlated equilibrium , 2000 .

[87]  Hiroaki Kitano,et al.  RoboCup: The Robot World Cup Initiative , 1997, AGENTS '97.

[88]  Moshe Tennenholtz,et al.  On the Emergence of Social Conventions: Modeling, Analysis, and Simulations , 1997, Artif. Intell..

[89]  Michael Wooldridge,et al.  Agent-based software engineering , 1997, IEE Proc. Softw. Eng..

[90]  Cristiano Castelfranchi,et al.  Modeling Social Action for AI Agents , 1997, IJCAI.

[91]  Sarit Kraus,et al.  Negotiation and Cooperation in Multi-Agent Environments , 1997, Artif. Intell..

[92]  Nicholas R. Jennings,et al.  Determining successful negotiation strategies: an evolutionary approach , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[93]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[94]  Michael Lewis,et al.  Designing for Human-Agent Interaction , 1998, AI Mag..

[95]  E. Ostrom A Behavioral Approach to the Rational Choice Theory of Collective Action: Presidential Address, American Political Science Association, 1997 , 1998, American Political Science Review.

[96]  Frank Dignum,et al.  Autonomous Norm Acceptance , 1998, ATAL.

[97]  Joseph E. Beck Learning to Teach with a Reinforcement Learning Agent , 1998, AAAI/IAAI.

[98]  Michael Wooldridge,et al.  The Belief-Desire-Intention Model of Agency , 1998, ATAL.

[99]  D. Austen-Smith Social choice theory, game theory and positive political theory , 1998 .

[100]  Rino Falcone,et al.  Principles of trust for MAS: cognitive anatomy, social importance, and quantification , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[101]  J. Hovi Games, Threats and Treaties: Understanding Commitments in International Relations , 1998 .

[102]  M. Nowak,et al.  Evolution of indirect reciprocity by image scoring , 1998, Nature.

[103]  D. North Institutions and Credible Commitment , 1999 .

[104]  Edgar A. Whitley,et al.  The Construction of Social Reality , 1999 .

[105]  Jacques Ferber,et al.  Multi-agent systems - an introduction to distributed artificial intelligence , 1999 .

[106]  Marcus J. Huber JAM: a BDI-theoretic mobile agent architecture , 1999, AGENTS '99.

[107]  Edmund H. Durfee,et al.  An adaptive agent bidding strategy based on stochastic modeling , 1999, AGENTS '99.

[108]  Frank Dignum,et al.  Deliberative Normative Agents: Principles and Architecture , 1999, ATAL.

[109]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[110]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[111]  N. R. Jennings,et al.  To appear in: Int Journal of Group Decision and Negotiation GDN2000 Keynote Paper Automated Negotiation: Prospects, Methods and Challenges , 2022 .

[112]  Giorgos Zacharia,et al.  Trust management through reputation mechanisms , 2000, Appl. Artif. Intell..

[113]  Nicholas R. Jennings,et al.  On agent-based software engineering , 2000, Artif. Intell..

[114]  J. Hirshleifer Game-Theoretic Interpretations of Commitment , 2000 .

[115]  Martin Pesendorfer A Study of Collusion in First-Price Auctions , 2000 .

[116]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[117]  Michael Wooldridge,et al.  Languages for Negotiation , 2000, ECAI.

[118]  Agostino Poggi,et al.  Developing Multi-agent Systems with JADE , 2007, ATAL.

[119]  Wolfram Burgard,et al.  Collaborative multi-robot exploration , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[120]  Yugo Takeuchi Yasuhiro Katagiri A Cultural Perspective in Social Interface , 2000 .

[121]  Tuomas Sandholm,et al.  Bargaining with limited computation: Deliberation equilibrium , 2001, Artif. Intell..

[122]  Julie A. Adams,et al.  Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , 2001, AI Mag..

[123]  Sarit Kraus,et al.  Strategic Negotiation in Multiagent Environments , 2001, Intelligent robots and autonomous agents.

[124]  Alexander Artikis,et al.  A formal model of open agent societies , 2001, International Conference on Autonomous Agents.

[125]  Kevin Leyton-Brown,et al.  Incentives for sharing in peer-to-peer networks , 2001, EC '01.

[126]  Andrew S. Tanenbaum,et al.  Distributed systems: Principles and Paradigms , 2001 .

[127]  Peter Stone,et al.  Autonomous Bidding Agents in the Trading Agent Competition , 2001, IEEE Internet Comput..

[128]  Michael Wooldridge,et al.  Tractable multiagent planning for epistemic goals , 2002, AAMAS '02.

[129]  Vincent Conitzer,et al.  Complexity of manipulating elections with few candidates , 2002, AAAI/IAAI.

[130]  Murray Campbell,et al.  Deep Blue , 2002, Artif. Intell..

[131]  Michael Wooldridge,et al.  Desiderata for agent argumentation protocols , 2002, AAMAS '02.

[132]  F. Dignum,et al.  From Desires, Obligations and Norms to Goals , 2002 .

[133]  Paul Resnick,et al.  Trust among strangers in internet transactions: Empirical analysis of eBay' s reputation system , 2002, The Economics of the Internet and E-commerce.

[134]  William T. Harbaugh,et al.  The Carrot or the Stick: Rewards, Punishments and Cooperation , 2002 .

[135]  Nicholas R. Jennings,et al.  Using similarity criteria to make issue trade-offs in automated negotiations , 2002, Artif. Intell..

[136]  Vincent Conitzer,et al.  Vote elicitation: complexity and strategy-proofness , 2002, AAAI/IAAI.

[137]  Michael P. Wellman,et al.  The 2001 trading agent competition , 2002, Electron. Mark..

[138]  Hector Garcia-Molina,et al.  Identity crisis: anonymity vs reputation in P2P systems , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[139]  Michael Luck,et al.  Agent technology: Enabling next generation computing , 2003 .

[140]  Mostafa H. Ammar,et al.  A reputation system for peer-to-peer networks , 2003, NOSSDAV '03.

[141]  Martin Bichler,et al.  Towards a Structured Design of Electronic Negotiations , 2003 .

[142]  Moshe Tennenholtz,et al.  k-Implementation , 2003, EC '03.

[143]  James A. Reggia,et al.  Progress in the Simulation of Emergent Communication and Language , 2003, Adapt. Behav..

[144]  Jeffrey M. Bradshaw,et al.  Human-Agent Teamwork and Adjustable Autonomy in Practice , 2003 .

[145]  Barbara Messing,et al.  An Introduction to MultiAgent Systems , 2002, Künstliche Intell..

[146]  Tuomas Sandholm,et al.  Automated Mechanism Design: A New Application Area for Search Algorithms , 2003, CP.

[147]  Ion Stoica,et al.  Incentives for Cooperation in Peer-to-Peer Networks , 2003 .

[148]  Chrysanthos Dellarocas,et al.  The Digitization of Word-of-Mouth: Promise and Challenges of Online Feedback Mechanisms , 2003, Manag. Sci..

[149]  M. Brian Blake,et al.  Coordinating multiple agents for workflow-oriented process orchestration , 2003, Inf. Syst. E Bus. Manag..

[150]  Mark Horowitz,et al.  Implementing an untrusted operating system on trusted hardware , 2003, SOSP '03.

[151]  Samir Aknine,et al.  An Extended Multi-Agent Negotiation Protocol , 2004, Autonomous Agents and Multi-Agent Systems.

[152]  A. Heifetz Rational Ritual: Culture, Coordination, and Common Knowledge. , 2004 .

[153]  Sarvapali D. Ramchurn,et al.  Trust in Multiagent Systems , 2004 .

[154]  Brian P. Gerkey,et al.  A Formal Analysis and Taxonomy of Task Allocation in Multi-Robot Systems , 2004, Int. J. Robotics Res..

[155]  M. Nowak,et al.  Problems of somatic mutation and cancer. , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[156]  Jordi Sabater-Mir,et al.  Review on Computational Trust and Reputation Models , 2005, Artificial Intelligence Review.

[157]  Michael Winikoff,et al.  Developing intelligent agent systems - a practical guide , 2004, Wiley series in agent technology.

[158]  M. Pickering,et al.  Toward a mechanistic psychology of dialogue , 2004, Behavioral and Brain Sciences.

[159]  Nicholas R. Jennings,et al.  A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[160]  Victor R. Lesser,et al.  A survey of multi-agent organizational paradigms , 2004, The Knowledge Engineering Review.

[161]  Michael H. Bowling,et al.  Convergence and No-Regret in Multiagent Learning , 2004, NIPS.

[162]  Frank Dignum,et al.  Autonomous agents with norms , 1999, Artificial Intelligence and Law.

[163]  Colin Camerer,et al.  Foundations of Human Sociality - Economic Experiments and Ethnographic: Evidence From Fifteen Small-Scale Societies , 2004 .

[164]  David Stuart Robertson,et al.  Multi-agent Coordination as Distributed Logic Programming , 2004, ICLP.

[165]  Peter Stone,et al.  Multiagent traffic management: a reservation-based intersection control mechanism , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[166]  J. Fearon Why Do Some Civil Wars Last So Much Longer than Others? , 2004 .

[167]  John P. Lewis,et al.  The Future of Disaster Response: Humans Working with Multiagent Teams using DEFACTO , 2005, AAAI Spring Symposium: AI Technologies for Homeland Security.

[168]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[169]  Guido Boella,et al.  Introduction to normative multiagent systems , 2006, Comput. Math. Organ. Theory.

[170]  Daniel Kudenko,et al.  Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi-Agent Learning , 2003, Adaptive Agents and Multi-Agent Systems.

[171]  K. Mitusch,et al.  Mediation in Situations of Conflict and Limited Commitment , 2005 .

[172]  Laurent Vercouter,et al.  A specification of the Agent Reputation and Trust (ART) testbed: experimentation and competition for trust in agent societies , 2005, AAMAS '05.

[173]  Michael Luck,et al.  A normative framework for agent-based systems , 2006, Comput. Math. Organ. Theory.

[174]  P. Wiessner Norm enforcement among the Ju/’hoansi Bushmen , 2005, Human nature.

[175]  Javier Vázquez-Salceda,et al.  Organizing Multiagent Systems , 2005, Autonomous Agents and Multi-Agent Systems.

[176]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[177]  D. Robinson,et al.  The topology of the 2x2 games : a new periodic table , 2005 .

[178]  C. Bicchieri The grammar of society: the nature and dynamics of social norms , 2005 .

[179]  U. Mueller,et al.  The Evolution of Agriculture in Insects , 2005 .

[180]  Antonis C. Kakas,et al.  Adaptive agent negotiation via argumentation , 2006, AAMAS '06.

[181]  Moshe Tennenholtz,et al.  Strong mediated equilibrium , 2006, Artif. Intell..

[182]  Nicholas R. Jennings,et al.  Learning to Negotiate Optimally in Non-stationary Environments , 2006, CIA.

[183]  Alexander Artikis,et al.  Voting in Multi-Agent Systems , 2006, Comput. J..

[184]  J. Fearon Commitment Problems and the Spread of Ethnic Conflict , 2006 .

[185]  Estefania Argente,et al.  Multi-Agent System Development Based on Organizations , 2006, Electron. Notes Theor. Comput. Sci..

[186]  G. Mailath,et al.  Repeated Games and Reputations: Long-Run Relationships , 2006 .

[187]  Vincent Conitzer,et al.  Computing the optimal strategy to commit to , 2006, EC '06.

[188]  Nalini Venkatasubramanian,et al.  Multi-Agent Simulation of Disaster Response , 2006 .

[189]  Nicholas R. Jennings,et al.  TRAVOS: Trust and Reputation in the Context of Inaccurate Information Sources , 2006, Autonomous Agents and Multi-Agent Systems.

[190]  R. Powell War as a Commitment Problem , 2004, International Organization.

[191]  Agostino Poggi,et al.  Multiagent Systems , 2006, Intelligenza Artificiale.

[192]  Hector Garcia-Molina,et al.  Taxonomy of trust: Categorizing P2P reputation systems , 2006, Comput. Networks.

[193]  Eddie Kohler,et al.  Clustering and sharing incentives in BitTorrent systems , 2006, SIGMETRICS '07.

[194]  H. Young,et al.  Social Norms , 2020, Encyclopedia of Behavioral Medicine.

[195]  Yoav Shoham,et al.  If multi-agent learning is the answer, what is the question? , 2007, Artif. Intell..

[196]  Michael Wooldridge,et al.  Normative system games , 2007, AAMAS '07.

[197]  Hiromitsu Hattori,et al.  Multi-issue Negotiation Protocol for Agents: Exploring Nonlinear Utility Spaces , 2007, IJCAI.

[198]  Davide Grossi Designing invisible handcuffs : Formal investigations in institutions and organizations for multi-agent systems , 2007 .

[199]  Yann Chevaleyre,et al.  A Short Introduction to Computational Social Choice , 2007, SOFSEM.

[200]  Jian Lin,et al.  Autonomous service level agreement negotiation for service composition provision , 2007, Future Gener. Comput. Syst..

[201]  Charles Baur,et al.  Modeling Human-Agent Interaction with Active Ontologies , 2007, Interaction Challenges for Intelligent Assistants.

[202]  Noam Nisan,et al.  Computationally feasible VCG mechanisms , 2000, EC '00.

[203]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[204]  Edgar Kiser Institutions and the Path to the Modern Economy: Lessons from Medieval Trade , 2007 .

[205]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[206]  Michael H. Bowling,et al.  Regret Minimization in Games with Incomplete Information , 2007, NIPS.

[207]  Eric Maskin,et al.  Mechanism Design: How to Implement Social Goals , 2008 .

[208]  Michael Tomz Reputation and International Cooperation: Sovereign Debt Across Three Centuries , 2007 .

[209]  Munindar P. Singh,et al.  Formal Trust Model for Multiagent Systems , 2007, IJCAI.

[210]  A. Ghazanfar,et al.  Evolution of human vocal production , 2008, Current Biology.

[211]  Ron Lavi,et al.  Algorithmic Mechanism Design , 2008, Encyclopedia of Algorithms.

[212]  Jeremy V. Pitt,et al.  PRESAGE: A Programming Environment for the Simulation of Agent Societies , 2009, ProMAS.

[213]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[214]  Frank van Harmelen,et al.  Handbook of Knowledge Representation , 2008, Handbook of Knowledge Representation.

[215]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[216]  Sarit Kraus,et al.  Playing games for security: an efficient exact algorithm for solving Bayesian Stackelberg games , 2008, AAMAS.

[217]  Courtney A. Bell,et al.  Approaches to Evaluating Teacher Effectiveness: A Research Synthesis. , 2008 .

[218]  Ariel D. Procaccia,et al.  Algorithms for the coalitional manipulation problem , 2008, SODA '08.

[219]  S. Thompson Social Learning Theory , 2008 .

[220]  Peter Stone,et al.  Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[221]  Craig Gentry,et al.  A fully homomorphic encryption scheme , 2009 .

[222]  Moshe Tennenholtz,et al.  Power in normative systems , 2009, AAMAS.

[223]  Peter Dayan,et al.  Goal-directed control and its antipodes , 2009, Neural Networks.

[224]  Boi Faltings,et al.  Mechanisms for Making Crowds Truthful , 2014, J. Artif. Intell. Res..

[225]  M. Pipattanasomporn,et al.  Multi-agent systems in a distributed smart grid: Design and implementation , 2009, 2009 IEEE/PES Power Systems Conference and Exposition.

[226]  Michael Luck,et al.  Flexible behaviour regulation in agent based systems , 2009, ICAC '09.

[227]  K. Larson,et al.  Exchanging Reputation Information between Communities: A Payment-Function Approach , 2009, IJCAI.

[228]  A. Jøsang,et al.  Challenges for Robust Trust and Reputation Systems , 2009 .

[229]  Alexander Artikis,et al.  Specifying norm-governed computational societies , 2009, TOCL.

[230]  Adam Tauman Kalai,et al.  A commitment folk theorem , 2010, Games Econ. Behav..

[231]  Tim Roughgarden,et al.  Algorithmic Game Theory , 2007 .

[232]  Victor R. Lesser,et al.  Multi-Agent Learning with Policy Prediction , 2010, AAAI.

[233]  Pieter Abbeel,et al.  Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..

[234]  Paula Zuluaga-Borrero Violence and social orders: a conceptual framework for interpreting recorded human history , 2010 .

[235]  A. Thornton,et al.  Identifying teaching in wild animals , 2010, Learning & behavior.

[236]  R. Boyd,et al.  Coordinated Punishment of Defectors Sustains Cooperation and Can Proliferate When Rare , 2010, Science.

[237]  Robert Östling,et al.  When Does Communication Improve Coordination , 2010 .

[238]  Piotr Faliszewski,et al.  AI's War on Manipulation: Are We Winning? , 2010, AI Mag..

[239]  Piotr Faliszewski,et al.  Using complexity to protect elections , 2010, Commun. ACM.

[240]  Vincent Conitzer,et al.  Stackelberg vs. Nash in Security Games: An Extended Investigation of Interchangeability, Equivalence, and Uniqueness , 2011, J. Artif. Intell. Res..

[241]  M. Jackson,et al.  The Reasons for Wars: An updated survey , 2011 .

[242]  G. Camera,et al.  Communication, Commitment, and Deception in Social Dilemmas: Experimental Evidence , 2011 .

[243]  S. Merchant Negotiating Underwater Space: The Sensorium, the Body and the Practice of Scuba-diving , 2011 .

[244]  Natalia Criado,et al.  Open issues for normative multi-agent systems , 2011, AI Commun..

[245]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[246]  Milind Tambe,et al.  Security and Game Theory - Algorithms, Deployed Systems, Lessons Learned , 2011 .

[247]  Jordi Sabater-Mir,et al.  Computational trust and reputation models for open multi-agent systems: a review , 2013, Artificial Intelligence Review.

[248]  J. Rilling,et al.  The neuroscience of social decision-making. , 2011, Annual review of psychology.

[249]  Edward B. Rock Securities Regulation as Lobster Trap: A Credible Commitment Theory of Mandatory Disclosure , 2012 .

[250]  Michèle Sebag,et al.  APRIL: Active Preference-learning based Reinforcement Learning , 2012, ECML/PKDD.

[251]  C. Fulmer,et al.  At What Level (and in Whom) We Trust , 2012 .

[252]  Yang Cai,et al.  An algorithmic characterization of multi-dimensional mechanisms , 2011, STOC '12.

[253]  Manuel Lopes,et al.  Algorithmic and Human Teaching of Sequential Decision Tasks , 2012, AAAI.

[254]  Yang Cai,et al.  Optimal Multi-dimensional Mechanism Design: Reducing Revenue to Welfare Maximization , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[255]  F. Brandt,et al.  Computational Social Choice , 2012 .

[256]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[257]  David G. Rand,et al.  Spontaneous giving and calculated greed , 2012, Nature.

[258]  Michael Naehrig,et al.  ML Confidential: Machine Learning on Encrypted Data , 2012, ICISC.

[259]  Minjie Zhang,et al.  Emergence of social norms through collective learning in networked agent societies , 2013, AAMAS.

[260]  Gal A. Kaminka,et al.  Curing robot autism: a challenge , 2013, AAMAS.

[261]  Siddhartha S. Srinivasa,et al.  Legibility and predictability of robot motion , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[262]  Sarit Kraus,et al.  Evaluating practical negotiating agents: Results and analysis of the 2011 international competition , 2013, Artif. Intell..

[263]  Munindar P. Singh,et al.  Macau: A Basis for Evaluating Reputation Systems , 2013, IJCAI.

[264]  Chunyan Miao,et al.  A Survey of Multi-Agent Trust Management Systems , 2013, IEEE Access.

[265]  Cécile Paris,et al.  A survey of trust in social networks , 2013, CSUR.

[266]  K. Laland,et al.  Social Learning: An Introduction to Mechanisms, Methods, and Models , 2013 .

[267]  Sandip Sen,et al.  Robust convention emergence in social networks through self-reinforcing structures dissolution , 2013, TAAS.

[268]  Ariel D. Procaccia,et al.  When do noisy votes reveal the truth? , 2013, EC '13.

[269]  C. D. De Dreu Human Cooperation , 2013, Psychological science in the public interest : a journal of the American Psychological Society.

[270]  Pat Barclay Strategies for cooperation in biological markets, especially for humans , 2013 .

[271]  M. Tomasello,et al.  Young children's understanding of cultural common ground. , 2013, The British journal of developmental psychology.

[272]  David G. Rand,et al.  Why We Cooperate , 2014 .

[273]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[274]  Benja Fallenstein,et al.  Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic , 2014, ArXiv.

[275]  E. Ostrom Collective action and the evolution of social norms , 2000, Journal of Economic Perspectives.

[276]  Matthieu Zimmer,et al.  Teacher-Student Framework: a Reinforcement Learning Approach , 2014 .

[277]  T. Scott-Phillips Speaking Our Minds: Why human communication is different, and how language evolved to make it special , 2014 .

[278]  David G. Rand,et al.  Social heuristics shape intuitive cooperation , 2014, Nature Communications.

[279]  Toby Walsh,et al.  How Hard Is It to Control an Election by Breaking Ties? , 2013, ECAI.

[280]  Radu Sion,et al.  TrustedDB: A Trusted Hardware-Based Database with Privacy and Data Confidentiality , 2011, IEEE Transactions on Knowledge and Data Engineering.

[281]  T. Yamagishi,et al.  Reward and Punishment in Social Dilemmas , 2014 .

[282]  Nian-Shing Chen,et al.  Review of Speech-to-Text Recognition Technology for Enhancing Learning , 2014, J. Educ. Technol. Soc..

[283]  Leandro Soriano Marcolino,et al.  Diverse Randomized Agents Vote to Win , 2014, NIPS.

[284]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[285]  J. D. Morrow Order within Anarchy: The Laws of War as an International Institution , 2014 .

[286]  Bart Verheij,et al.  Negotiating with other minds: the role of recursive theory of mind in negotiation with incomplete information , 2015, Autonomous Agents and Multi-Agent Systems.

[287]  Paul Dütting,et al.  Payment Rules through Discriminant-Based Classifiers , 2012, ACM Trans. Economics and Comput..

[288]  Michael P. Wellman,et al.  Economic reasoning and artificial intelligence , 2015, Science.

[289]  Lawrence G. Sager Handbook of Computational Social Choice , 2015 .

[290]  Matthew Lai,et al.  Giraffe: Using Deep Reinforcement Learning to Play Chess , 2015, ArXiv.

[291]  Matt Richtel,et al.  Google’s Driverless Cars Run into Problem: Cars with Drivers , 2015 .

[292]  Peter Dayan,et al.  Interplay of approximate planning strategies , 2015, Proceedings of the National Academy of Sciences.

[293]  Ariel Ezrachi,et al.  Artificial Intelligence & Collusion: When Computers Inhibit Competition , 2015 .

[294]  Milind Tambe,et al.  When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing , 2015, IJCAI.

[295]  L. Huseynova Why nations fail? The origins of Power, Prosperity and Poverty , 2015 .

[296]  Yuval Harari,et al.  Homo Deus: A Brief History of Tomorrow , 2015 .

[297]  Roni Stern,et al.  Multi-Agent Pathfinding as a Combinatorial Auction , 2015, AAAI.

[298]  Kenny Smith,et al.  The ease and extent of recursive mindreading, across implicit and explicit tasks , 2015 .

[299]  Stefanos Nikolaidis,et al.  Improved human–robot team performance through cross-training, an approach inspired by human team training practices , 2015, Int. J. Robotics Res..

[300]  Stuart J. Russell,et al.  Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..

[301]  J. Henrich The Secret of Our Success: How Culture Is Driving Human Evolution, Domesticating Our Species, and Making Us Smarter , 2015 .

[302]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[303]  S. Goldin-Meadow,et al.  The influence of communication mode on written language processing and beyond , 2015, Behavioral and Brain Sciences.

[304]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[305]  Shimon Whiteson,et al.  Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[306]  Malte Risto,et al.  The social behavior of autonomous vehicles , 2016, UbiComp Adjunct.

[307]  David C. Parkes,et al.  Automated Mechanism Design without Money via Machine Learning , 2016, IJCAI.

[308]  Anca D. Dragan,et al.  Cooperative Inverse Reinforcement Learning , 2016, NIPS.

[309]  Anca D. Dragan,et al.  Information gathering actions over human internal state , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[310]  Stacy Marsella,et al.  People Do Not Feel Guilty About Exploiting Machines , 2016, ACM Trans. Comput. Hum. Interact..

[311]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[312]  Alejandro Lee-Penagos Learning to Coordinate: Co-Evolution and Correlated Equilibrium , 2016 .

[313]  Ariel D. Procaccia,et al.  The Unreasonable Fairness of Maximum Nash Welfare , 2016, EC.

[314]  Prateek Saxena,et al.  Making Smart Contracts Smarter , 2016, IACR Cryptol. ePrint Arch..

[315]  Sven Strauss,et al.  The Logic Of Images In International Relations , 2016 .

[316]  Elaine Shi,et al.  Hawk: The Blockchain Model of Cryptography and Privacy-Preserving Smart Contracts , 2016, 2016 IEEE Symposium on Security and Privacy (SP).

[317]  P. R. Coelho,et al.  The evolution of human cooperation , 2016 .

[318]  Michael Devetsikiotis,et al.  Blockchains and Smart Contracts for the Internet of Things , 2016, IEEE Access.

[319]  Christopher K. Frantz,et al.  From Institutions to Code: Towards Automated Generation of Smart Contracts , 2016, 2016 IEEE 1st International Workshops on Foundations and Applications of Self* Systems (FAS*W).

[320]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[321]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[322]  John Schulman,et al.  Concrete Problems in AI Safety , 2016, ArXiv.

[323]  Haris Aziz,et al.  A Discrete and Bounded Envy-Free Cake Cutting Protocol for Any Number of Agents , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[324]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[325]  Junmo Kim,et al.  A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[326]  Vijay Menon,et al.  Computational aspects of strategic behaviour in elections with top-truncated ballots , 2017, Autonomous Agents and Multi-Agent Systems.

[327]  W. Przepiorka,et al.  Order without law: Reputation promotes cooperation in a cryptomarket for illegal drugs , 2017 .

[328]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[329]  Payman Mohassel,et al.  SecureML: A System for Scalable Privacy-Preserving Machine Learning , 2017, 2017 IEEE Symposium on Security and Privacy (SP).

[330]  José M. F. Moura,et al.  Natural Language Does Not Emerge ‘Naturally’ in Multi-Agent Dialog , 2017, EMNLP.

[331]  Demis Hassabis,et al.  Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[332]  Shane Legg,et al.  Deep Reinforcement Learning from Human Preferences , 2017, NIPS.

[333]  Pingzhong Tang,et al.  Reinforcement mechanism design , 2017, IJCAI.

[334]  Nicholas A. Christakis,et al.  Locally noisy autonomous agents improve global human coordination in network experiments , 2017, Nature.

[335]  Rachna,et al.  Sapiens: A brief history of humankind , 2017 .

[336]  K. Laland The origins of language in teaching , 2016, Psychonomic bulletin & review.

[337]  Yann Dauphin,et al.  Deal or No Deal? End-to-End Learning of Negotiation Dialogues , 2017, EMNLP.

[338]  Nan Jiang,et al.  Repeated Inverse Reinforcement Learning , 2017, NIPS.

[339]  Massimo Bartoletti,et al.  Financial Cryptography and Data Security , 2017, Lecture Notes in Computer Science.

[340]  Iyad Rahwan,et al.  Cooperating with machines , 2017, Nature Communications.

[341]  Disarmament Games , 2017, AAAI.

[342]  Kevin Waugh,et al.  DeepStack: Expert-level artificial intelligence in heads-up no-limit poker , 2017, Science.

[343]  Jeffrey M. Bradshaw,et al.  Human–Agent Interaction , 2017 .

[344]  Ivan Titov,et al.  Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols , 2017, NIPS.

[345]  M. Pereda,et al.  The emergence of altruism as a social norm , 2017, Scientific Reports.

[346]  Scott S. Hughes,et al.  Extravehicular activity operations concepts under communication latency and bandwidth constraints , 2017, 2017 IEEE Aerospace Conference.

[347]  Trevor J. M. Bench-Capon,et al.  Norms and value based reasoning: justifying compliance and violation , 2017, Artificial Intelligence and Law.

[348]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[349]  Joel Z. Leibo,et al.  Multi-agent Reinforcement Learning in Sequential Social Dilemmas , 2017, AAMAS.

[350]  Felipe Leno da Silva,et al.  Simultaneously Learning and Advising in Multiagent Reinforcement Learning , 2017, AAMAS.

[351]  Alexander Peysakhovich,et al.  Multi-Agent Cooperation and the Emergence of (Natural) Language , 2016, ICLR.

[352]  Mohamed Othman,et al.  Machine-to-Machine Communication: An Overview of Opportunities , 2018, Comput. Networks.

[353]  Stefan Riezler,et al.  Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning , 2018, ACL.

[354]  Shimon Whiteson,et al.  Learning with Opponent-Learning Awareness , 2017, AAMAS.

[355]  Arvind Satyanarayan,et al.  The Building Blocks of Interpretability , 2018 .

[356]  Meng Liu,et al.  Does Machine Translation Affect International Trade? Evidence from a Large Digital Platform , 2018, Manag. Sci..

[357]  Shane Legg,et al.  Reward learning from human preferences and demonstrations in Atari , 2018, NeurIPS.

[358]  Ghislain Fourny,et al.  Perfect Prediction Equilibrium , 2014, The Individual and the Other in Economic Thought.

[359]  Nando de Freitas,et al.  Intrinsic Social Motivation via Causal Influence in Multi-Agent RL , 2018, ArXiv.

[360]  Sigifredo Laengle,et al.  Twenty-Five Years of Group Decision and Negotiation: A Bibliometric Overview , 2018, Group Decision and Negotiation.

[361]  Manuel Bohn,et al.  Common Ground and Development , 2018 .

[362]  Kira Goldner,et al.  Mechanism design for social good , 2018, SIGAI.

[363]  Pieter Abbeel,et al.  Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments , 2017, ICLR.

[364]  Xiwei Xu,et al.  On legal contracts, imperative and declarative smart contracts, and blockchain systems , 2018, Artificial Intelligence and Law.

[365]  Stephen Clark,et al.  Emergent Communication through Negotiation , 2018, ICLR.

[366]  Peter Stone,et al.  Autonomous agents modelling other agents: A comprehensive survey and open problems , 2017, Artif. Intell..

[367]  Joel Z. Leibo,et al.  Inequity aversion resolves intertemporal social dilemmas , 2018, ArXiv.

[368]  F. Warneken,et al.  How Children Solve the Two Challenges of Cooperation , 2018, Annual review of psychology.

[369]  Uwe Zdun,et al.  Design Patterns for Smart Contracts in the Ethereum Ecosystem , 2018, 2018 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[370]  Emilio Calvano,et al.  Artificial Intelligence, Algorithmic Pricing and Collusion , 2018, American Economic Review.

[371]  W. Fitch,et al.  The Biology and Evolution of Speech: A Comparative Analysis , 2018 .

[372]  L. Cong,et al.  Blockchain Disruption and Smart Contracts , 2018, The Review of Financial Studies.

[373]  Stephen Clark,et al.  Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input , 2018, ICLR.

[374]  Gang Qu,et al.  BARS: A Blockchain-Based Anonymous Reputation System for Trust Management in VANETs , 2018, 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/ 12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).

[375]  Shane Legg,et al.  Scalable agent alignment via reward modeling: a research direction , 2018, ArXiv.

[376]  Stuart Armstrong,et al.  Occam's razor is insufficient to infer the preferences of irrational agents , 2017, NeurIPS.

[377]  Katerina Fragkiadaki,et al.  Reward Learning from Narrated Demonstrations , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[378]  Sergey Levine,et al.  Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations , 2017, Robotics: Science and Systems.

[379]  Andrew Zisserman,et al.  Kickstarting Deep Reinforcement Learning , 2018, ArXiv.

[380]  Demis Hassabis,et al.  A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.

[381]  Mirco Musolesi,et al.  Understanding The Impact of Partner Choice on Cooperation and Social Norms by means of Multi-agent Reinforcement Learning , 2019, ArXiv.

[382]  Joelle Pineau,et al.  No Press Diplomacy: Modeling Multi-Agent Gameplay , 2019, NeurIPS.

[383]  Ming Zhou,et al.  HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization , 2019, ACL.

[384]  Joel Z. Leibo,et al.  Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research , 2019, ArXiv.

[385]  Zhiyong Wu,et al.  A Review of Deep Learning Based Speech Synthesis , 2019, Applied Sciences.

[386]  Khaled Shaalan,et al.  Speech Recognition Using Deep Neural Networks: A Systematic Review , 2019, IEEE Access.

[387]  Eugene Kharitonov,et al.  Anti-efficient encoding in emergent communication , 2019, NeurIPS.

[388]  Noam Brown,et al.  Superhuman AI for multiplayer poker , 2019, Science.

[389]  Prabhat Nagarajan,et al.  Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations , 2019, ICML.

[390]  Wojciech M. Czarnecki,et al.  Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.

[391]  Pushmeet Kohli,et al.  Learning to Understand Goal Specifications by Modelling Reward , 2018, ICLR.

[392]  Shimon Whiteson,et al.  Multi-Agent Common Knowledge Reinforcement Learning , 2018, NeurIPS.

[393]  H. Francis Song,et al.  Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning , 2018, ICML.

[394]  Stuart Russell Human Compatible: Artificial Intelligence and the Problem of Control , 2019 .

[395]  Andrew Critch,et al.  A PARAMETRIC, RESOURCE-BOUNDED GENERALIZATION OF LÖB’S THEOREM, AND A ROBUST COOPERATION CRITERION FOR OPEN-SOURCE GAME THEORY , 2019, The Journal of Symbolic Logic.

[396]  Tom Eccles,et al.  Biases for Emergent Communication in Multi-agent Reinforcement Learning , 2019, NeurIPS.

[397]  Rebecca E. Webber Gaudiosi,et al.  Negotiating at the United Nations , 2019 .

[398]  Thore Graepel,et al.  A Neural Architecture for Designing Truthful and Efficient Auctions , 2019, ArXiv.

[399]  Jakub W. Pachocki,et al.  Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.

[400]  Erol Akçay,et al.  Evolution of social norms and correlated equilibria , 2019, Proceedings of the National Academy of Sciences.

[401]  Paul Dütting,et al.  Optimal auctions through deep learning , 2017, ICML.

[402]  Joshua B. Tenenbaum,et al.  Finding Friend and Foe in Multi-Agent Games , 2019, NeurIPS.

[403]  Jonathan P. How,et al.  Learning to Teach in Cooperative Multiagent Reinforcement Learning , 2018, AAAI.

[404]  Marina De Vos,et al.  Norm emergence in multiagent systems: a viewpoint paper , 2019, Autonomous Agents and Multi-Agent Systems.

[405]  C. Apicella,et al.  The evolution of human cooperation , 2019, Current Biology.

[406]  Pushmeet Kohli,et al.  Adversarial Robustness through Local Linearization , 2019, NeurIPS.

[407]  Nando de Freitas,et al.  Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning , 2018, ICML.

[408]  Michael Cormier,et al.  Trusted AI and the Contribution of Trust Modeling in Multiagent Systems , 2019, AAMAS.

[409]  Guy Lever,et al.  Human-level performance in 3D multiplayer games with population-based reinforcement learning , 2018, Science.

[410]  Anca D. Dragan,et al.  On the Utility of Learning about Humans for Human-AI Coordination , 2019, NeurIPS.

[411]  Lina Yao,et al.  A Survey on Deep Learning based Brain Computer Interface: Recent Advances and New Frontiers , 2019, ArXiv.

[412]  Olivier Pietquin,et al.  Observational Learning by Reinforcement Learning , 2017, AAMAS.

[413]  Daniel E. O'Leary,et al.  GOOGLE'S Duplex: Pretending to be human , 2019, Intell. Syst. Account. Finance Manag..

[414]  Shimon Whiteson,et al.  Stable Opponent Shaping in Differentiable Games , 2018, ICLR.

[415]  Ryan J. Lowe,et al.  Learning to summarize from human feedback , 2020, NeurIPS 2020.

[416]  Elisa Bertino,et al.  Artificial Intelligence & Cooperation , 2020, ArXiv.

[417]  Baiming Chen,et al.  Delay-Aware Multi-Agent Reinforcement Learning , 2020, ArXiv.

[418]  Bowen Baker,et al.  Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences , 2020, NeurIPS.

[419]  Kate Larson,et al.  Testing Axioms Against Human Reward Divisions in Cooperative Games , 2020, AAMAS.

[420]  Chien-Chung Shen,et al.  Proof-of-Event Recording System for Autonomous Vehicles: A Blockchain-Based Solution , 2020, IEEE Access.

[421]  Jakob N. Foerster,et al.  "Other-Play" for Zero-Shot Coordination , 2020, ICML.

[422]  Mirco Musolesi,et al.  Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning , 2019, AAAI.

[423]  Hong Jun Jeon,et al.  Reward-rational (implicit) choice: A unifying formalism for reward learning , 2020, NeurIPS.

[424]  David C. Parkes,et al.  The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies , 2020, ArXiv.

[425]  Igor Mordatch,et al.  Emergent Tool Use From Multi-Agent Autocurricula , 2019, ICLR.

[426]  Terrence J Sejnowski,et al.  The unreasonable effectiveness of deep learning in artificial intelligence , 2020, Proceedings of the National Academy of Sciences.

[427]  R. Powell In the Shadow of Power: States and Strategies in International Politics , 2020 .

[428]  Karol Hausman,et al.  Learning to Interactively Learn and Assist , 2019, AAAI.

[429]  Mehrdad Farajtabar,et al.  Learning to Incentivize Other Learning Agents , 2020, NeurIPS.

[430]  Bo Liu,et al.  Towards Playing Full MOBA Games with Deep Reinforcement Learning , 2020, NeurIPS.

[431]  Stuart J. Russell,et al.  Benefits of Assistance over Reward Learning , 2020 .

[432]  H. Francis Song,et al.  The Hanabi Challenge: A New Frontier for AI Research , 2019, Artif. Intell..

[433]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[434]  Jakob N. Foerster,et al.  Improving Policies via Search in Cooperative Partially Observable Games , 2019, AAAI.

[435]  Tom Eccles,et al.  Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games , 2020, AAMAS.

[436]  Doina Precup,et al.  Gifting in Multi-Agent Reinforcement Learning (Student Abstract) , 2020, AAAI.

[437]  Yoram Bachrach,et al.  Learning to Play No-Press Diplomacy with Best Response Policy Iteration , 2020, NeurIPS.

[438]  Zhen Xiao,et al.  Learning Agent Communication under Limited Bandwidth by Message Pruning , 2019, AAAI.

[439]  Brian Scassellati,et al.  Vulnerable robots positively shape human conversational dynamics in a human–robot team , 2020, Proceedings of the National Academy of Sciences.

[440]  S. Levine,et al.  Learning Social Learning , 2020 .

[441]  Andrew Critch,et al.  AI Research Considerations for Human Existential Safety (ARCHES) , 2020, ArXiv.

[442]  Dylan Hadfield-Menell,et al.  Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors , 2020, AAMAS.

[443]  Steven Weber,et al.  The 2020s political economy of machine translation , 2020, Business and Politics.

[444]  Taxonomy and definitions for terms related to driving automation systems for on-road motor vehicles , 2022 .

[445]  A. Stierle,et al.  Designing Collective Behavior in a Termite-Inspired Robot Construction Team , 2022 .