论文信息 - On Voting Strategies and Emergent Communication

On Voting Strategies and Emergent Communication

Humans use language to collectively execute complex strategies in addition to using it as a referential tool for referring to physical entities. While existing approaches that study the emergence of language in settings where the language mainly acts as a referential tool, in this paper, we study the role of emergent languages in discovering and implementing strategies in a multi-agent setting. The agents in our setup are connected via a network and are allowed to exchange messages in the form of sequences of discrete symbols. We formulate the problem as a voting game, where two candidate agents are contesting in an election and their goal is to convince the population members (other agents) in the network to vote for them by sending them messages. We use neural networks to parameterize the policies followed by agents in the game. We investigate the effect of choosing different training objectives and strategies for agents in the game and make observations about the emergent language in each case. To the best of our knowledge this is the first work that explores emergence of language for discovering and implementing strategies in a setting where agents are connected via an underlying network.

Ambedkar Dukkipati | Shubham Gupta

[1] J. Sobel,et al. STRATEGIC INFORMATION TRANSMISSION , 1982 .

[2] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[4] Pieter Abbeel,et al. Emergence of Grounded Compositional Language in Multi-Agent Populations , 2017, AAAI.

[5] Yee Whye Teh,et al. The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.

[6] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[7] Alexander Peysakhovich,et al. Multi-Agent Cooperation and the Emergence of (Natural) Language , 2016, ICLR.

[8] J. Dall,et al. Random geometric graphs. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9] Matthew E. Taylor,et al. Autonomously Reusing Knowledge in Multiagent Reinforcement Learning , 2018, IJCAI.

[10] Territoire Urbain,et al. Convention , 1955, Hidden Nature.

[11] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.