Internal and External Pressures on Language Emergence: Least Effort, Object Constancy and Frequency

In previous work, artificial agents were shown to achieve almost perfect accuracy in referential games where they have to communicate to identify images. Nevertheless, the resulting communication protocols rarely display salient features of natural languages, such as compositionality. In this paper, we propose some realistic sources of pressure on communication that avert this outcome. More specifically, we formalise the principle of least effort through an auxiliary objective. Moreover, we explore several game variants, inspired by the principle of object constancy, in which we alter the frequency, position, and luminosity of the objects in the images. We perform an extensive analysis on their effect through compositionality metrics, diagnostic classifiers, and zero-shot evaluation. Our findings reveal that the proposed sources of pressure result in emerging languages with less redundancy, more focus on high-level conceptual information, and better abilities of generalisation. Overall, our contributions reduce the gap between emergent and natural languages.

[1]  Pieter Abbeel,et al.  Emergence of Grounded Compositional Language in Multi-Agent Populations , 2017, AAAI.

[2]  Stephanie M. Stalinski,et al.  Journal of Experimental Psychology: Learning, Memory, and Cognition , 2012 .

[3]  W. Strange Evolution of language. , 1984, JAMA.

[4]  Jonathan D. Cohen,et al.  Perceptual Constancy , 2012 .

[5]  Tomas Mikolov,et al.  A Roadmap Towards Machine Intelligence , 2015, CICLing.

[6]  Jiliang Tang,et al.  A Survey on Dialogue Systems: Recent Advances and New Frontiers , 2017, SKDD.

[7]  S. Levinson Presumptive Meanings: The theory of generalized conversational implicature , 2001 .

[8]  Stephen Clark,et al.  Emergent Communication through Negotiation , 2018, ICLR.

[9]  G. Zipf,et al.  The Psycho-Biology of Language , 1936 .

[10]  付伶俐 打磨Using Language,倡导新理念 , 2014 .

[11]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[12]  William B Estes,et al.  Similarity , Frequency , and Category Representations , 1988 .

[13]  Dan Klein,et al.  Neural Module Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Stephen Clark,et al.  Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input , 2018, ICLR.

[15]  José M. F. Moura,et al.  Natural Language Does Not Emerge ‘Naturally’ in Multi-Agent Dialog , 2017, EMNLP.

[16]  Simon Kirby,et al.  Understanding Linguistic Evolution by Visualizing the Emergence of Topographic Mappings , 2006, Artificial Life.

[17]  Robert L. Goldstone,et al.  The development of features in object concepts , 1998, Behavioral and Brain Sciences.

[18]  Ivan Titov,et al.  Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols , 2017, NIPS.

[19]  K. Lorenz Behind the Mirror: A Search for a Natural History of Human Knowledge , 1973 .

[20]  Nikolaus Kriegeskorte,et al.  Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[21]  Amy Perfors,et al.  Cross-situational learning in a Zipfian environment , 2019, Cognition.

[22]  Erich von Holst Zur Verhaltensphysiologie bei Tieren und Menschen : gesammelte Abhandlungen , 1969 .

[23]  HupkesDieuwke,et al.  Visualisation and ‘diagnostic classifiers’ reveal how recurrent and recursive neural networks process hierarchical structure , 2018 .

[24]  Willem H. Zuidema,et al.  Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure , 2017, J. Artif. Intell. Res..

[25]  Douglas L. Medin,et al.  Context theory of classification learning. , 1978 .

[26]  Elia Bruni,et al.  The Grammar of Emergent Languages , 2020, EMNLP.

[27]  Marco Baroni,et al.  How agents see things: On visual representations in an emergent language game , 2018, EMNLP.

[28]  I. H. Fichte,et al.  Zeitschrift für Philosophie und philosophische Kritik , 2022 .

[29]  Simon Kirby,et al.  Natural Language From Artificial Life , 2002, Artificial Life.

[30]  Siobhan Chapman Logic and Conversation , 2005 .

[31]  Nando de Freitas,et al.  Compositional Obverter Communication Learning From Raw Visual Input , 2018, ICLR.

[32]  John Haiman,et al.  Iconic and Economic Motivation , 1983 .

[33]  G. Āllport The Psycho-Biology of Language. , 1936 .

[34]  Mathijs Mul,et al.  Compositionality Decomposed: How do Neural Networks Generalise? , 2019, J. Artif. Intell. Res..