Monkeys are perceptually tuned to facial expressions that exhibit a theta-like speech rhythm

Human speech universally exhibits a 3- to 8-Hz rhythm, corresponding to the rate of syllable production, which is reflected in both the sound envelope and the visual mouth movements. Artificial perturbation of the speech rhythm outside the natural range reduces speech intelligibility, demonstrating a perceptual tuning to this frequency band. One theory posits that the mouth movements at the core of this speech rhythm evolved through modification of ancestral primate facial expressions. Recent evidence shows that one such communicative gesture in macaque monkeys, lip-smacking, has motor parallels with speech in its rhythmicity, its developmental trajectory, and the coordination of vocal tract structures. Whether monkeys also exhibit a perceptual tuning to the natural rhythms of lip-smacking is unknown. To investigate this, we tested rhesus monkeys in a preferential-looking procedure, measuring the time spent looking at each of two side-by-side computer-generated monkey avatars lip-smacking at natural versus sped-up or slowed-down rhythms. Monkeys showed an overall preference for the natural rhythm compared with the perturbed rhythms. This lends behavioral support for the hypothesis that perceptual processes in monkeys are similarly tuned to the natural frequencies of communication signals as they are in humans. Our data provide perceptual evidence for the theory that speech may have evolved from ancestral primate rhythmic facial expressions.

[1]  K. Saberi,et al.  Cognitive restoration of reversed speech , 1999, Nature.

[2]  B. Karmel,et al.  Correlation of infants' brain and behavior response to temporal changes in visual stimulation. , 1977, Psychophysiology.

[3]  L. Fogassi,et al.  Neonatal Imitation in Rhesus Macaques , 2006, PLoS biology.

[4]  Asif A Ghazanfar,et al.  Monkey lipsmacking develops like the human speech rhythm. , 2012, Developmental science.

[5]  Asif A. Ghazanfar,et al.  The Natural Statistics of Audiovisual Speech , 2009, PLoS Comput. Biol..

[6]  S. Suomi,et al.  Reciprocal Face-to-Face Communication between Rhesus Macaque Mothers and Their Newborn Infants , 2009, Current Biology.

[7]  Steven Greenberg,et al.  Temporal properties of spontaneous speech - a syllable-centric perspective , 2003, J. Phonetics.

[8]  R. A. Hinde,et al.  COMMUNICATION BY POSTURES AND FACIAL EXPRESSIONS IN THE RHESUS MONKEY (MACACA MULATTA) , 2009 .

[9]  D. Lieberman,et al.  Hyoid and tongue surface movements in speaking and eating. , 2002, Archives of oral biology.

[10]  C A Moore,et al.  Does speech emerge from earlier appearing oral motor behaviors? , 1996, Journal of speech and hearing research.

[11]  D. Lewkowicz Developmental changes in infants' visual response to temporal frequency. , 1985 .

[12]  R. Ringel,et al.  Task-specific organization of activity in human jaw muscles. , 1988, Journal of speech and hearing research.

[13]  Richard S. J. Frackowiak,et al.  Endogenous Cortical Rhythms Determine Cerebral Specialization for Speech Perception and Production , 2007, Neuron.

[14]  Oded Ghitza,et al.  On the Role of Theta-Driven Syllabic Parsing in Decoding Speech: Intelligibility of Speech with a Manipulated Modulation Spectrum , 2012, Front. Psychology.

[15]  D. Poeppel,et al.  Auditory Cortex Tracks Both Auditory and Visual Stimulus Dynamics Using Low-Frequency Neuronal Phase Modulation , 2010, PLoS biology.

[16]  A. Ghazanfar,et al.  Cineradiography of Monkey Lip-Smacking Reveals Putative Precursors of Speech Dynamics , 2012, Current Biology.

[17]  Marcelo A. Montemurro,et al.  Spike-Phase Coding Boosts and Stabilizes Information Carried by Spatial and Temporal Spike Patterns , 2009, Neuron.

[18]  Ankoor S. Shah,et al.  An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. , 2005, Journal of neurophysiology.

[19]  R. Desimone,et al.  Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque. , 1981, Journal of neurophysiology.

[20]  D. Lewkowicz,et al.  Heterochrony and Cross-Species Intersensory Matching by Infant Vervet Monkeys , 2009, PloS one.

[21]  Leslie B. Cohen,et al.  Infant perception: From sensation to cognition , 1975 .

[22]  Stefano Panzeri,et al.  Visual Enhancement of the Information Representation in Auditory Cortex , 2010, Current Biology.

[23]  L. Rosenblum Primate Behavior: Developments in Field and Laboratory Research , 1970 .

[24]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[25]  Asif A Ghazanfar,et al.  Different neural frequency bands integrate faces and voices differently in the superior temporal sulcus. , 2009, Journal of neurophysiology.

[26]  Steven Greenberg,et al.  On the Possible Role of Brain Rhythms in Speech Perception: Intelligibility of Time-Compressed Speech with Periodic and Aperiodic Insertions of Silence , 2009, Phonetica.

[27]  P. MacNeilage,et al.  The frame/content theory of evolution of speech production , 1998, Behavioral and Brain Sciences.

[28]  Ana B. Chica,et al.  Attentional Routes to Conscious Perception , 2012, Front. Psychology.

[29]  Koichiro Matsuo,et al.  Kinematic linkage of the tongue, jaw, and hyoid during eating and speech. , 2010, Archives of oral biology.

[30]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  N. Logothetis,et al.  Neuroperception: Facial expressions linked to monkey calls , 2003, Nature.

[32]  K. Hiiemae,et al.  Tongue movements in feeding and speech. , 2003, Critical reviews in oral biology and medicine : an official publication of the American Association of Oral Biologists.

[33]  T H Crystal,et al.  Segmental durations in connected speech signals: preliminary results. , 1982, The Journal of the Acoustical Society of America.

[34]  Christophe Boesch,et al.  Buttress drumming by wild chimpanzees: Temporal patterning, phrase integration into loud calls, and preliminary evidence for individual distinctiveness , 1998, Primates.

[35]  Melanie Vitkovitch,et al.  Visible Speech as a Function of Image Quality: Effects of Display Parameters on Lipreading Ability , 1996 .

[36]  Asif A Ghazanfar,et al.  Monkey visual behavior falls into the uncanny valley , 2009, Proceedings of the National Academy of Sciences.

[37]  Ammie K. Kalan,et al.  Hand-clapping as a communicative gesture by wild female swamp gorillas , 2009, Primates.

[38]  Christoph Kayser,et al.  Monkey drumming reveals common networks for perceiving vocal and nonvocal communication sounds , 2009, Proceedings of the National Academy of Sciences.

[39]  Benedict Shien Wei Ng,et al.  EEG phase patterns reflect the selectivity of neural firing. , 2013, Cerebral cortex.

[40]  D. Ostry,et al.  Control of jaw orientation and position in mastication and speech. , 1994, Journal of neurophysiology.

[41]  L. Parr,et al.  Influence of Social Context on the Use of Blended and Graded Facial Displays in Chimpanzees , 2005, International Journal of Primatology.

[42]  Frédéric E. Theunissen,et al.  The Modulation Transfer Function for Speech Intelligibility , 2009, PLoS Comput. Biol..

[43]  Elizabeth M. Brannon,et al.  Monkeys Match the Number of Voices They Hear to the Number of Faces They See , 2005, Current Biology.

[44]  Oded Ghitza,et al.  Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm , 2011, Front. Psychology.

[45]  A. Woods,et al.  Context Modulates the Contribution of Time and Space in Causal Inference , 2012, Front. Psychology.

[46]  Jordan R. Green,et al.  Babbling, chewing, and sucking: oromandibular coordination at 9 months. , 2008, Journal of speech, language, and hearing research : JSLHR.

[47]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[48]  Asif A Ghazanfar,et al.  Interactions between the Superior Temporal Sulcus and Auditory Cortex Mediate Dynamic Face/Voice Integration in Rhesus Monkeys , 2008, The Journal of Neuroscience.

[49]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[50]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[51]  David Poeppel,et al.  The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' , 2003, Speech Commun..

[52]  William K. Redican,et al.  Facial Expressions in Nonhuman Primates , 1975 .

[53]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[54]  Robin I. M. Dunbar,et al.  Evolution and ecology of macaque societies , 1996 .

[55]  A. Malécot,et al.  Syllabic Rate and Utterance Length in French , 1972, Phonetica.

[56]  G. Buzsáki,et al.  Neuronal Oscillations in Cortical Networks , 2004, Science.

[57]  Asif A Ghazanfar,et al.  Dynamic, rhythmic facial expressions and the superior temporal sulcus of macaque monkeys: implications for the evolution of audiovisual speech , 2010, The European journal of neuroscience.

[58]  Asif A Ghazanfar,et al.  Multisensory Integration of Looming Signals by Rhesus Monkeys , 2004, Neuron.

[59]  Asif A. Ghazanfar,et al.  Monkeys and Humans Share a Common Computation for Face/Voice Integration , 2011, PLoS Comput. Biol..

[60]  Joost X. Maier,et al.  Multisensory Integration of Dynamic Faces and Voices in Rhesus Monkey Auditory Cortex , 2005 .

[61]  Roy D. Patterson,et al.  Vocal-Tract Resonances as Indexical Cues in Rhesus Monkeys , 2007, Current Biology.

[62]  E. Visalberghi,et al.  Facial Displays in Young Tufted Capuchin Monkeys (Cebus apella): Appearance, Meaning, Context and Target , 2007, Folia Primatologica.

[63]  Roger W. Steeve,et al.  Babbling and chewing: Jaw kinematics from 8 to 22 months , 2010, J. Phonetics.