The classification game: combining supervised learning and language evolution

We study the emergence of shared representations in a population of agents engaged in a supervised classification task, using a model called the classification game. We connect languages with tasks by treating the agents’ classification hypothesis space as an information channel. We show that by learning through the classification game, agents can implicitly perform complexity regularisation, which improves generalisation. Improved generalisation also means that the languages that emerge are well adapted to the given task. The improved language-task fit springs from the interplay of two opposing forces: the dynamics of collective learning impose a preference for simple representations, while the intricacy of the classification task imposes a pressure towards representations that are more complex. The push–pull of these two forces results in the emergence of a shared representation that is simple but not too simple. Our agents use artificial neural networks to solve the classification tasks they face, and a simple counting algorithm to learn a language as a form-meaning mapping. We present several experiments to demonstrate that both compositional and holistic languages can emerge in our system. We also demonstrate that the agents avoid overfitting on noisy data, and can learn some very difficult tasks through interaction, which they are unable to learn individually. Further, when the agents use simple recurrent networks to solve temporal classification tasks, we see the emergence of a rudimentary grammar, which does not have to be explicitly learned.

[1]  Martin A. Nowak,et al.  The evolution of syntactic communication , 2000, Nature.

[2]  L. Steels,et al.  Crucial factors in the origins of word-meaning , 2000 .

[3]  Barak A. Pearlmutter,et al.  Chaitin-Kolmogorov Complexity and Generalization in Neural Networks , 1990, NIPS.

[4]  S. Mufwene Competition and Selection in Language Evolution , 2002 .

[5]  Luc Steels,et al.  The Origins of Ontologies and Communication Conventions in Multi-Agent Systems , 2004, Autonomous Agents and Multi-Agent Systems.

[6]  Michael Oliphant,et al.  Formal approaches to innate and learned communication : laying the foundation for language , 1998 .

[7]  Simon Kirby,et al.  From UG to universals: Linguistic adaptation through iterated learning , 2004 .

[8]  Tommi Kärkkäinen,et al.  Robust Formulations for Training Multilayer Perceptrons , 2004, Neural Computation.

[9]  R. Paget The Origin of Speech , 1927, Nature.

[10]  Angelo Cangelosi,et al.  The Emergence of a 'Language' in an Evolving Population of Neural Networks , 1998, Connect. Sci..

[11]  Leonard Pitt,et al.  On the necessity of Occam algorithms , 1990, STOC '90.

[12]  Kenny Smith,et al.  Cross-Situational Learning: A Mathematical Approach , 2006, EELC.

[13]  S. Kirby The Evolution of Meaning-Space Structure through Iterated Learning , 2007 .

[14]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[15]  Simon Kirby,et al.  Iterated Learning: A Framework for the Emergence of Language , 2003, Artificial Life.

[16]  Ted Briscoe Grammatical Assimilation , 2003 .

[17]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[18]  John Langford,et al.  PAC-MDL Bounds , 2003, COLT.

[19]  C. Lee Giles,et al.  Extraction of rules from discrete-time recurrent neural networks , 1996, Neural Networks.

[20]  R. Millikan Varieties of Meaning: The 2002 Jean Nicod Lectures , 2004 .

[21]  Angelo Cangelosi,et al.  The Adaptive Advantage of Symbolic Theft Over Sensorimotor Toil: Grounding Language in Perceptual Categories , 2001 .

[22]  Luc Steels,et al.  Bootstrapping grounded word semantics , 1999 .

[23]  Paul Vogt,et al.  On the Acquisition and Evolution of Compositional Languages: Sparse Input and the Productive Creativity of Children , 2005, Adapt. Behav..

[24]  Angelo Cangelosi,et al.  Emergence of Communication and Language , 2006 .

[25]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[26]  Ricard Solé,et al.  Language: Syntax for free? , 2005, Nature.

[27]  Paul Vogt,et al.  Investigating social interaction strategies for bootstrapping lexicon development , 2003, J. Artif. Soc. Soc. Simul..

[28]  Janet Wiles,et al.  Context-free and context-sensitive dynamics in recurrent neural networks , 2000, Connect. Sci..

[29]  Gábor Lugosi,et al.  Concept learning using complexity regularization , 1995, IEEE Trans. Inf. Theory.

[30]  Thomas L. Griffiths,et al.  A Bayesian View of Language Evolution by Iterated Learning - eScholarship , 2005 .

[31]  L. Gasser,et al.  The Role of Anticipation in the Emergence of Language , 2007, SAB ABiALS.

[32]  Henry Brighton,et al.  Compositionality, Linguistic Evolution, and Induction by Minimum Description Length , 2005 .

[33]  Angelo Cangelosi,et al.  Modeling the Evolution of Communication: From Stimulus Associations to Grounded Symbolic Associations , 1999, ECAL.

[34]  E. Markman,et al.  Children's use of mutual exclusivity to constrain the meanings of words , 1988, Cognitive Psychology.

[35]  L. Steels Perceptually grounded meaning creation , 1996 .

[36]  Bernhard Schölkopf,et al.  Learning Theory and Kernel Machines , 2003, Lecture Notes in Computer Science.

[37]  Frédéric Kaplan,et al.  Simple models of distributed co-ordination , 2005, Connect. Sci..

[38]  Andrew R. Barron,et al.  Complexity Regularization with Application to Artificial Neural Networks , 1991 .

[39]  Martin V. Butz,et al.  Anticipatory Behavior in Adaptive Learning Systems , 2003, Lecture Notes in Computer Science.

[40]  Temple F. Smith Occam's razor , 1980, Nature.

[41]  Jürgen Schmidhuber,et al.  Flat Minima , 1997, Neural Computation.

[42]  Geoffrey E. Hinton,et al.  Keeping Neural Networks Simple , 1993 .

[43]  Ming Li,et al.  Sharpening Occam's razor , 2002, Inf. Process. Lett..

[44]  Henry Brighton,et al.  Compositional Syntax From Cultural Transmission , 2002, Artificial Life.

[45]  Charles F. Hockett,et al.  A mathematical theory of communication , 1948, MOCO.

[46]  Roger Lass,et al.  On Explaining Language Change , 1980 .

[47]  M. V. Rossum,et al.  In Neural Computation , 2022 .

[48]  Angelo C. Loula,et al.  Language Evolution and Robotics: Issues on Symbol Grounding and Language Acquisition , 2006 .

[49]  Les Gasser,et al.  Noisy Preferential Attachment and Language Evolution , 2006, SAB.

[50]  Guido Boella,et al.  Normative framework for normative system change , 2009, AAMAS 2009.

[51]  Simon Kirby,et al.  Innateness and culture in the evolution of language , 2006, Proceedings of the National Academy of Sciences.