Entropy rate as a measure of animal vocal complexity

Vocal complexity is an important concept for investigating the role and evolution of animal communication and sociality. However, no one definition of ‘complexity’ appears to be appropriate for all uses. Repertoire size has been used to quantify complexity in many bird and some mammalian studies, but is impractical in cases where vocalizations are highly diverse, and repertoire size is essentially non-limited at realistic sample sizes. Some researchers have used information-theoretic measures such as Shannon entropy, to describe vocal complexity, but these techniques are descriptive only, as they do not address hypotheses of the cognitive mechanisms behind vocal signal generation. In addition, it can be shown that simple measures of entropy, in particular, do not capture syntactic structure. In this work, I demonstrate the use of an alternative information-theoretic measure, the Markov entropy rate, which quantifies the diversity of transitions in a vocal sequence, and thus is capable of distinguishing sequences with syntactic structure from those generated by random, statistically independent processes. I use artificial sequences generated from different stochastic mechanisms, as well as real data from the vocalizations of the rock hyrax Procavia capensis, to show how different complexity metrics scale differently with sample size. I show that entropy rate provides a good measure of complexity for Markov processes and converges faster than repertoire size estimates, such as the Lempel–Ziv metric. The commonly used Shannon entropy performs poorly in quantifying complexity.

[1]  T. Ord,et al.  Is sociality required for the evolution of communicative complexity? Evidence weighed against alternative hypotheses in diverse taxonomic groups , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[2]  J. Vielliard,et al.  Using Shannon entropy on measuring the individual variability in the Rufous-bellied thrush Turdus rufiventris vocal communication. , 2000, Journal of theoretical biology.

[3]  Todd M Freeberg,et al.  Information theoretical approaches to chick-a-dee calls of Carolina chickadees (Poecile carolinensis). , 2012, Journal of comparative psychology.

[4]  M. Ryan,et al.  Redefining animal signaling: influence versus information in communication , 2010 .

[5]  Danielle Cholewiak,et al.  Humpback whale song hierarchical structure: Historical context and discussion of current classification issues , 2013 .

[6]  Analysis of the Temporal Structure of Primate Communication , 1979 .

[7]  Robin I. M. Dunbar,et al.  Social complexity as a proximate and ultimate factor in communicative complexity , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[8]  Wojbor A. Woyczyński,et al.  Introductory Statistics and Random Phenomena: Uncertainty, Complexity and Chaotic Behavior in Engineering and Science , 1998 .

[9]  Brenda McCowan,et al.  Applicability of Information Theory to the Quantification of Responses to Anthropogenic Noise by Southeast Alaskan Humpback Whales , 2008, Entropy.

[10]  Wentian Li,et al.  On the Relationship between Complexity and Entropy for Markov Chains and Regular Languages , 1991, Complex Syst..

[11]  Dezhe Z. Jin,et al.  A Compact Statistical Model of the Song Syntax in Bengalese Finch , 2010, PLoS Comput. Biol..

[12]  Faruk Göloglu,et al.  On Lempel-Ziv Complexity of Sequences , 2006, SETA.

[13]  Eduardo Mercado,et al.  Recurring patterns in the songs of humpback whales (Megaptera novaeangliae) , 2011, Behavioural Processes.

[14]  Randolph D. Nelson,et al.  Probability, stochastic processes, and queueing theory - the mathematics of computer performance modeling , 1995 .

[15]  Korbinian Strimmer,et al.  Entropy Inference and the James-Stein Estimator, with Application to Nonlinear Gene Association Networks , 2008, J. Mach. Learn. Res..

[16]  Simon Johnston,et al.  Information theory, animal communication, and the search for extraterrestrial intelligence , 2011 .

[17]  K. Zuberbühler,et al.  The Syntax and Meaning of Wild Gibbon Songs , 2006, PloS one.

[18]  Ricard V. Solé,et al.  Least effort and the origins of scaling in human language , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Ramon Ferrer-i-Cancho,et al.  The span of correlations in dolphin whistle sequences , 2012, 1205.0321.

[20]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[21]  G. Ruxton,et al.  Resolving current disagreements and ambiguities in the terminology of animal communication , 2011, Journal of evolutionary biology.

[22]  Ramon Ferrer-i-Cancho,et al.  A Law of Word Meaning in Dolphin Whistle Types , 2009, Entropy.

[23]  Stephanie Seneff,et al.  Comparing call-based versus subunit-based methods for categorizing Norwegian killer whale, Orcinus orca, vocalizations , 2011, Animal Behaviour.

[24]  R. Berwick,et al.  Songs to syntax: the linguistics of birdsong , 2011, Trends in Cognitive Sciences.

[25]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[26]  Ryuji Suzuki,et al.  Information entropy of humpback whale songs. , 1999, The Journal of the Acoustical Society of America.

[27]  LAURANCE R. DOYLE,et al.  Quantitative tools for comparing animal communication systems: information theory applied to bottlenose dolphin whistle repertoires , 1999, Animal Behaviour.

[28]  J. M. Jenkins,et al.  The appropriate use of Zipf's law in animal communication studies , 2005, Animal Behaviour.

[29]  V. Cane Behaviour Sequences as Semi‐Markov Chains , 1959 .

[30]  Randolph Nelson,et al.  Probability, Stochastic Processes, and Queueing Theory , 1995 .

[31]  P. Slater,et al.  SONG ORGANISATION AND SINGING PATTERNS OF THE WILLOW WARBLER, PHYLLOSCOPUS TROCHILUS , 2000 .

[32]  Masakazu Konishi,et al.  Decrystallization of adult birdsong by perturbation of auditory feedback , 1999, Nature.

[33]  Kimberly A. Pollard,et al.  Evolving communicative complexity: insights from rodents and beyond , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[34]  George D. Pollak,et al.  Versatility and Stereotypy of Free-Tailed Bat Songs , 2009, PloS one.

[35]  P. Slater,et al.  Bird Song: Biological Themes and Variations , 1995 .

[36]  Stephen Handel,et al.  Hierarchical and rhythmic organization in the songs of humpback whales (Megaptera novaeangliae) , 2012 .

[37]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[38]  S. H. Hulse,et al.  Perceptual mechanisms for individual vocal recognition in European starlings,Sturnus vulgaris , 1998, Animal Behaviour.

[39]  Yuriy L. Orlov,et al.  Complexity: an internet resource for analysis of DNA sequence complexity , 2004, Nucleic Acids Res..

[40]  D. Jin Generating variable birdsong syllable sequences with branching chain networks in avian premotor nucleus HVC. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  S. C. Evans,et al.  Network security through conservation of complexity , 2002, MILCOM 2002. Proceedings.

[42]  J G Robinson,et al.  An analysis of the organization of vocal communication in the titi monkey Callicebus moloch. , 2010, Zeitschrift fur Tierpsychologie.

[43]  Abraham Lempel,et al.  On the Complexity of Finite Sequences , 1976, IEEE Trans. Inf. Theory.

[44]  L. Blaustein,et al.  Syntactic structure and geographical dialects in the songs of male rock hyraxes , 2012, Proceedings of the Royal Society B: Biological Sciences.

[45]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[46]  Peter L. Tyack,et al.  Repeated call types in short‐finned pilot whales, Globicephala macrorhynchus , 2013 .

[47]  Bruce Edmonds,et al.  What is Complexity? - The philosophy of complexity per se with application to some examples in evolution , 1995 .

[48]  Masato Okada,et al.  Complex Sequencing Rules of Birdsong Can be Explained by Simple Hidden Markov Processes , 2010, PloS one.

[49]  John R. Buck,et al.  The use of Zipf's law in animal communication analysis , 2005, Animal Behaviour.

[50]  Thierry Aubin,et al.  Are bird song complexity and song sharing shaped by habitat structure? An information theory and statistical approach. , 2010, Journal of theoretical biology.

[51]  M. Small Applied Nonlinear Time Series Analysis: Applications in Physics, Physiology and Finance , 2005 .

[52]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[53]  Ramon Ferrer i Cancho,et al.  When language breaks into pieces. A conflict between communication through isolated signals and language. , 2006, Bio Systems.