Optimization Models of Natural Communication

Abstract A family of information theoretic models of communication was introduced more than a decade ago to explain the origins of Zipf’s law for word frequencies. The family is a based on a combination of two information theoretic principles: maximization of mutual information between forms and meanings and minimization of form entropy. The family also sheds light on the origins of three other patterns: the principle of contrast; a related vocabulary learning bias; and the meaning-frequency law. Here two important components of the family, namely the information theoretic principles and the energy function that combines them linearly, are reviewed from the perspective of psycholinguistics, language learning, information theory and synergetic linguistics. The minimization of this linear function is linked to the problem of compression of standard information theory and might be tuned by self-organization.

[1]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[2]  Daniel Polani,et al.  Origins of Scaling in Genetic Code , 2009, ECAL.

[3]  Ramon Ferrer-i-Cancho,et al.  Compression and the origins of Zipf's law for word frequencies , 2016, Complex..

[4]  Herbert Rubenstein,et al.  Homographic entries in the internal lexicon: Effects of systematicity and relative frequency of meanings , 1971 .

[5]  Vittorio Loreto,et al.  Semiotic dynamics and collaborative tagging , 2006, Proceedings of the National Academy of Sciences.

[6]  Germinal Cocho,et al.  Fitting Ranked Linguistic Data with Two-Parameter Functions , 2010, Entropy.

[7]  Xin Yao,et al.  Parallel Problem Solving from Nature PPSN VI , 2000, Lecture Notes in Computer Science.

[8]  M. L. Millis,et al.  The effect of polysemy on lexical decision time: Now you see it, now you don’t , 1989, Memory & cognition.

[9]  Eve V. Clark,et al.  The principle of contrast: A constraint on language acquisition. , 1987 .

[10]  Ramon Ferrer-i-Cancho,et al.  The origins of Zipf's meaning‐frequency law , 2017, J. Assoc. Inf. Sci. Technol..

[11]  G Kellas,et al.  Lexical ambiguity and the timecourse of attentional allocation in word recognition. , 1988, Journal of experimental psychology. Human perception and performance.

[12]  Ramon Ferrer-i-Cancho,et al.  The frequency spectrum of finite samples from the intermittent silence process , 2009, J. Assoc. Inf. Sci. Technol..

[13]  Ricard V. Solé,et al.  Emergence of Zipf's Law in the Evolution of Communication , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  N. Chater,et al.  Simplicity: a unifying principle in cognitive science? , 2003, Trends in Cognitive Sciences.

[15]  Junko Kanero,et al.  Six Principles of Language Development: Implications for Second Language Learners , 2014, Developmental neuropsychology.

[16]  S. Hanser,et al.  Using information theory to assess the diversity, complexity, and development of communicative repertoires. , 2002, Journal of comparative psychology.

[17]  Christopher T. Kello,et al.  Scaling laws in cognitive sciences , 2010, Trends in Cognitive Sciences.

[18]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[19]  Russell R. Barton,et al.  Institute for Operations Research and the Management Sciences , 2004, INFORMS J. Comput..

[20]  Ramon Ferrer-i-Cancho,et al.  The Infochemical Core , 2016, J. Quant. Linguistics.

[21]  Eduardo G. Altmann,et al.  Stochastic model for the vocabulary growth in natural languages , 2012, ArXiv.

[22]  Jorge Hidalgo,et al.  Information-based fitness and the emergence of criticality in living systems , 2013, Proceedings of the National Academy of Sciences.

[23]  H. Rubenstein,et al.  Homographic entries in the internal lexicon , 1970 .

[24]  Thomas M. Cover,et al.  Elements of information theory (2. ed.) , 2006 .

[25]  Yasushi Hino,et al.  Effects of Polysemy in Lexical Decision and Naming: An Alternative to Lexical Access Accounts , 1996 .

[26]  Danielle S. McNamara,et al.  The Development of Polysemy and Frequency Use in English Second Language Speakers , 2010 .

[27]  Harry Eugene Stanley,et al.  Languages cool as they expand: Allometric scaling and the decreasing need for new words , 2012, Scientific Reports.

[28]  Gertraud Fenk-Oczlon Familiarity, information flow, and linguistic form , 2001 .

[29]  Daniel Polani,et al.  Phase transitions in least-effort communications , 2010 .

[30]  Dante R Chialvo,et al.  Brain organization into resting state networks emerges at criticality on a model of the human connectome. , 2012, Physical review letters.

[31]  LAURANCE R. DOYLE,et al.  Quantitative tools for comparing animal communication systems: information theory applied to bottlenose dolphin whistle repertoires , 1999, Animal Behaviour.

[32]  Ricard V. Solé,et al.  Least effort and the origins of scaling in human language , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Gabriel Altmann,et al.  Hapax Legomena and Language Typology , 2008, J. Quant. Linguistics.

[34]  Steven T Piantadosi,et al.  Word lengths are optimized for efficient communication , 2011, Proceedings of the National Academy of Sciences.

[35]  E. Morin Introduction à la pensée complexe , 2005 .

[36]  A. Chao,et al.  Nonparametric estimation of Shannon’s index of diversity when there are unseen species in sample , 2004, Environmental and Ecological Statistics.

[37]  Gabriel Altmann,et al.  Quantitative Linguistik / Quantitative Linguistics: Ein internationales Handbuch , 2005 .

[38]  G. Zipf The meaning-frequency relationship of words. , 1945, The Journal of general psychology.

[39]  Ramon Ferrer-i-Cancho,et al.  When is Menzerath-Altmann law mathematically trivial? A new approach , 2014, Statistical applications in genetics and molecular biology.

[40]  Eve V. Clark,et al.  The Lexicon in Acquisition , 1996 .

[41]  Ramon Ferrer i Cancho,et al.  When language breaks into pieces. A conflict between communication through isolated signals and language. , 2006, Bio Systems.

[42]  Michael Cysouw,et al.  The Entropy of Words - Learnability and Expressivity across More than 1000 Languages , 2017, Entropy.

[43]  W J Levelt,et al.  Spoken word production: A theory of lexical access , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Ramon Ferrer-i-Cancho,et al.  Can simple models explain Zipf's law for all exponents? , 2005, Glottometrics.

[45]  S. Piantadosi Zipf’s word frequency law in natural language: A critical review and future directions , 2014, Psychonomic Bulletin & Review.

[46]  Vittorio Loreto,et al.  Collaborative Tagging and Semiotic Dynamics , 2006, ArXiv.

[47]  Anton Benz,et al.  Bidirectional Optimality Theory: An introduction , 2011 .

[48]  Christopher T. Kello,et al.  Scale-Free Networks in Phonological and Orthographic Wordform Lexicons , 2007 .

[49]  Ramon Ferrer-i-Cancho,et al.  The optimality of attaching unlinked labels to unlinked meanings , 2013, Glottometrics.

[50]  E. Markman,et al.  Children's use of mutual exclusivity to constrain the meanings of words , 1988, Cognitive Psychology.

[51]  R. Ferrer-i-Cancho,et al.  The Evolution of the Exponent of Zipf's Law in Language Ontogeny , 2013, PloS one.

[52]  Jonathon Shlens,et al.  Notes on Kullback-Leibler Divergence and Likelihood , 2014, ArXiv.

[53]  M. Peruggia Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach (2nd ed.) , 2003 .

[54]  W. Marslen-Wilson,et al.  Making Sense of Semantic Ambiguity: Semantic Competition in Lexical Access , 2002 .

[55]  W. Levelt,et al.  Word frequency effects in speech production: Retrieval of syntactic information and of phonological form , 1994 .

[56]  Maria Dolores Jiménez-López,et al.  Genetic Code and Verbal Language: Syntactic and Semantic Analogies , 2011, Biology, Computation and Linguistics.

[57]  F. Chung,et al.  Generalizations of Polya's urn Problem , 2003 .

[58]  Ramon Ferrer-i-Cancho,et al.  Random Texts Do Not Exhibit the Real Zipf's Law-Like Rank Distribution , 2010, PloS one.

[59]  R. Mccall,et al.  The Genetic and Environmental Origins of Learning Abilities and Disabilities in the Early School , 2007, Monographs of the Society for Research in Child Development.

[60]  R. Harald Baayen,et al.  Semantic Density and Past-Tense Formation in Three Germanic Languages , 2005 .

[61]  Shari R. Baum,et al.  Disambiguating the ambiguity advantage effect in word recognition: An advantage for polysemous but not homonymous words , 2007, Journal of Neurolinguistics.

[62]  Ramon Ferrer-i-Cancho,et al.  The span of correlations in dolphin whistle sequences , 2012, 1205.0321.

[63]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[64]  Christopher T. Kello,et al.  Critical branching neural networks. , 2013, Psychological review.

[65]  Paula Buttery,et al.  Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms , 2015, PloS one.

[66]  R. Ferrer i Cancho,et al.  The variation of Zipf's law in human language , 2005 .

[67]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[68]  G. A. Miller,et al.  Finitary models of language users , 1963 .

[69]  Walter Willinger,et al.  More "normal" than normal: scaling distributions and complex systems , 2004, Proceedings of the 2004 Winter Simulation Conference, 2004..

[70]  J. Leeuw Nonlinear Regression with R , 2008 .

[71]  Kara D. Federmeier,et al.  Timed picture naming in seven languages , 2003, Psychonomic bulletin & review.

[72]  Jeanette A. Thomas,et al.  Sensory Abilities of Cetaceans , 1990, NATO ASI Series.

[73]  Paula Buttery,et al.  Zipf's law and the grammar of languages: A quantitative study of Old and Modern English parallel texts , 2014 .

[74]  E. Wagenmakers,et al.  AIC model selection using Akaike weights , 2004, Psychonomic bulletin & review.

[75]  REINHARD KÖHLER,et al.  SYSTEM THEORETICAL LINGUISTICS , 1987 .

[76]  B. Karaoglan,et al.  Investigation of Zipf’s ‘law-of-meaning’ on Turkish corpora , 2007, 2007 22nd international symposium on computer and information sciences.

[77]  Vera M. Ostrovskaya,et al.  Organization of Communication System in Tursiops Truncatus Montagu , 1990 .

[78]  Mario Bunge,et al.  La ciencia: su método y su filosofía , 1981 .

[79]  W. Merriman,et al.  The mutual exclusivity bias in children's word learning. , 1989, Monographs of the Society for Research in Child Development.

[80]  Ramon Ferrer-i-Cancho,et al.  A Law of Word Meaning in Dolphin Whistle Types , 2009, Entropy.

[81]  Ramon Ferrer-i-Cancho,et al.  The polysemy of the words that children learn over time , 2016, Interaction Studies.

[82]  Darryl Stewart,et al.  Zipf and Type-Token rules for the English, Spanish, Irish and Latin languages , 2006 .

[83]  Ramon Ferrer i Cancho,et al.  Decoding least effort and scaling in signal frequency distributions , 2005 .

[84]  R. Mishra Pierre-Yves Oudeyer Self-Organization in the Evolution of Speech , 2011, Cognitive Systems Research.

[85]  Jaume Baixeries i Juvillà,et al.  THE EVOLUTION OF POLYSEMY IN CHILD LANGUAGE , 2014 .

[86]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[87]  R. Ferrer i Cancho,et al.  Zipf's law from a communicative phase transition , 2005 .

[88]  Anton Benz,et al.  Bidirectional Optimality Theory , 2011 .

[89]  M. Tanenhaus,et al.  Time Course of Frequency Effects in Spoken-Word Recognition: Evidence from Eye Movements , 2001, Cognitive Psychology.

[90]  David Lusseau,et al.  Compression as a Universal Principle of Animal Behavior , 2013, Cogn. Sci..

[91]  Jean Christophe Verstraeh Frequency and the emergence of linguistic structure , 2005 .

[92]  Ramon Ferrer-i-Cancho,et al.  The meaning-frequency law in Zipfian optimization models of communication , 2014, Glottometrics.

[93]  R. F. Cancho,et al.  The global minima of the communicative energy of natural communication systems , 2007 .

[94]  Gemma Boleda,et al.  Zipf’s Law for Word Frequencies: Word Forms versus Lemmas in Long Texts , 2014, PloS one.

[95]  Luc Steels,et al.  Language as a Complex Adaptive System , 2000, PPSN.