Action to Language via the Mirror Neuron System: The role of vocal tract gestural action units in understanding the evolution of phonology

Language can be viewed as a structuring of cognitive units that can be transmitted among individuals for the purpose of communicating information. Cognitive units stand in specific and systematic relationships with one another, and linguists are interested in the characterization of these units and the nature of these relationships. Both can be examined at various levels of granularity. It has long been observed that languages exhibit distinct patterning of units in syntax and in phonology. This distinction, a universal characteristic of language, is termed duality of patterning (Hockett, 1960). Syntax refers to the structuring of words in sequence via hierarchical organization, where words are meaningful units belonging to an infinitely expandable set. But words also are composed of structured cognitive units. Phonology structures a small, closed set of recombinable, non-meaningful units that compose words (or signs, in the case of signed languages). It is precisely the use of a set of non-meaningful arbitrary discrete units that allows word creation to be productive. 1 In this chapter we outline a proposal that views the evolution of syntax and of phonology as arising from different sources and ultimately converging in a symbiotic relationship. Duality of patterning forms the intellectual basis for this proposal. Grasp and other manual gestures in early hominids are, as Arbib (Chapter 1, this volume) notes, well suited to provide a link from the iconic to the symbolic. Critically, the iconic aspects of manual gestures lend them a meaningful aspect that is critical to evolution of a system of symbolic units. However, we will argue that, given duality of patterning, phonological evolution crucially requires the emergence of effectively non-meaningful combinatorial units. We suggest that vocal tract action gestures are well suited to play a direct role in phonological evolution because, as argued by Studdert-Kennedy (2002a), they are

[1]  W. Prinz Perception and Action Planning , 1997 .

[2]  G. Rizzolatti,et al.  ß Federation of European Neuroscience Societies Mirror , 2003 .

[3]  L. Goldstein Emergence of discrete gestures , 2003 .

[4]  F H Guenther,et al.  Speech sound acquisition, coarticulation, and rate effects in a neural network model of speech production. , 1995, Psychological review.

[5]  A. Liberman,et al.  Minimal Rules for Synthesizing Speech , 1959 .

[6]  P. Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar , 2004 .

[7]  E. Thelen The (re)discovery of motor development: Learning new things from an old field. , 1989 .

[8]  I. Mattingly Phonetic Representation and Speech Synthesis by Rule , 1981 .

[9]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[10]  Elliot Saltzman,et al.  Task Dynamic Coordination of the Speech Articulators: A Preliminary Model , 1986 .

[11]  Adamantios I. Gafos,et al.  A Grammar of Gestural Coordination , 2002 .

[12]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[13]  Bart de Boer,et al.  Self-organization in vowel systems , 2000, J. Phonetics.

[14]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[15]  Z. Harris,et al.  Methods in structural linguistics. , 1952 .

[16]  Brett R Fajen,et al.  Behavioral dynamics of steering, obstacle avoidance, and route selection. , 2003, Journal of experimental psychology. Human perception and performance.

[17]  Kenneth N. Stevens,et al.  On the quantal nature of speech , 1972 .

[18]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[19]  Abigail C. Cohn THE STATUS OF NASALIZED CONTINUANTS , 1993 .

[20]  Andrew Butcher,et al.  VC vs. CV syllables: a comparison of Aboriginal languages with English , 2004, Journal of the International Phonetic Association.

[21]  Pierre Delattre CONSONANT GEMINATION IN FOUR LANGUAGES: AN ACOUSTIC, PERCEPTUAL, AND RADIOGRAPHIC STUDY PART I , 1971 .

[22]  Louis Goldstein,et al.  Dynamics and articulatory phonology , 1996 .

[23]  L. Rosenblum,et al.  An audiovisual test of kinematic primitives for visual speech perception. , 1996, Journal of experimental psychology. Human perception and performance.

[24]  Pierre Delattre CONSONANT GEMINATION IN FOUR LANGUAGES: AN ACOUSTIC, PERCEPTUAL, AND RADIOGRAPHIC STUDY PART II , 1971 .

[25]  Michael I. Jordan Motor Learning and the Degrees of Freedom Problem , 2018, Attention and Performance XIII.

[26]  G. Rizzolatti,et al.  Hearing Sounds, Understanding Actions: Action Representation in Mirror Neurons , 2002, Science.

[27]  Reza Shadmehr,et al.  Equilibrium point hypothesis , 1998 .

[28]  R. Krakow Physiological organization of syllables: a review , 1999 .

[29]  Michael C. Corballis,et al.  From Hand to Mouth: The Origins of Language , 2002 .

[30]  H. Haken,et al.  A theoretical model of phase transitions in human hand movements , 2004, Biological Cybernetics.

[31]  Frank H. Guenther,et al.  A neural network model of speech acquisition and motor equivalent speech production , 2004, Biological Cybernetics.

[32]  J. Doyne Farmer,et al.  A Rosetta stone for connectionism , 1990 .

[33]  M. Arbib From monkey-like action recognition to human language: An evolutionary framework for neurolinguistics , 2005, Behavioral and Brain Sciences.

[34]  C. Browman,et al.  Articulatory Phonology: An Overview , 1992, Phonetica.

[35]  A cross-language study of vowel nasalization , 1972 .

[36]  D W Massaro,et al.  Perception of asynchronous and conflicting visual and auditory speech. , 1996, The Journal of the Acoustical Society of America.

[37]  G. Aschersleben,et al.  The Theory of Event Coding (TEC): a framework for perception and action planning. , 2001, The Behavioral and brain sciences.

[38]  Antje S. Meyer,et al.  Phonetics and Phonology in Language Comprehension and Production , 2003 .

[39]  Tamar Flash,et al.  Computational approaches to motor control , 2001, Current Opinion in Neurobiology.

[40]  M. Studdert-Kennedy Hand and Mind: What Gestures Reveal About Thought. , 1994 .

[41]  Dani Byrd,et al.  The Elastic Phrase: Dynamics of Boundary-Adjacent Lengthening , 2003 .

[42]  Amanda Lathroum,et al.  Feature encoding by neural nets , 1989, Phonology.

[43]  Pierre-Yves Oudeyer,et al.  The Self-Organization of Speech Sounds , 2005, Journal of theoretical biology.

[44]  T. Gelder,et al.  Mind as Motion: Explorations in the Dynamics of Cognition , 1995 .

[45]  J. Kelso,et al.  Skilled actions: a task-dynamic approach. , 1987, Psychological review.

[46]  K G Munhall,et al.  Skill acquisition and development: the roles of state-, parameter, and graph dynamics. , 1992, Journal of motor behavior.

[47]  William L. Abler,et al.  On the particulate principle of self-diversifying systems , 1989 .

[48]  C. Browman,et al.  Competing constraints on intergestural coordination and self-organization of phonological structures , 2000 .

[49]  C. Fowler,et al.  Listening with eye and hand: cross-modal contributions to speech perception. , 1991, Journal of experimental psychology. Human perception and performance.

[50]  Gérard Bailly,et al.  Formant trajectories as audible gestures: An alternative for speech synthesis , 1991 .

[51]  Alvin M. Liberman,et al.  Speech: A Special Code , 1996 .

[52]  C. Stoel-Gammon,et al.  Phonetic inventories, 15-24 months: a longitudinal study. , 1985, Journal of speech and hearing research.

[53]  Anders Löfqvist,et al.  Dynamics of intergestural timing: a perturbation study of lip-larynx coordination , 1998, Experimental Brain Research.

[54]  Peter F. MacNeilage,et al.  The frame/content theory of evolution of speech: A comparison with a gestural-origins alternative , 2005 .

[55]  F. Dell,et al.  Syllables In Tashlhiyt Berber And In Moroccan Arabic , 2002 .

[56]  D. Falk Prelinguistic evolution in early hominins: Whence motherese? , 2004, Behavioral and Brain Sciences.

[57]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[58]  M. Studdert-Kennedy,et al.  Approaches to the Evolution of Language , 1999 .

[59]  A M Surprenant,et al.  The perception of speech gestures. , 1993, The Journal of the Acoustical Society of America.

[60]  R. Krakow NONSEGMENTAL INFLUENCES ON VELUM MOVEMENT PATTERNS: SYLLABLES, SENTENCES, STRESS, AND SPEAKING RATE , 1993 .

[61]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[62]  J. M. Pickett,et al.  Producing Speech: Contemporary Issues, for Katherine Safford Harris , 1996 .

[63]  C. Browman,et al.  Papers in Laboratory Phonology: Tiers in articulatory phonology, with some implications for casual speech , 1990 .

[64]  F. Dell,et al.  Syllabic Consonants and Syllabification in Imdlawn Tashlhiyt Berber , 1985 .

[65]  Dani Byrd,et al.  The Distinctions Between State, Parameter and Graph Dynamics in Sensorimotor Control and Coordination , 2006 .

[66]  Peter F. MacNeilage,et al.  Acquisition of Serial Complexity in Speech Production: A Comparison of Phonetic and Phonological Approaches to First Word Production , 2002, Phonetica.

[67]  Elliot Saltzman,et al.  Dynamics and coordinate systems in skilled sensorimotor activity , 1996 .

[68]  Dani Byrd,et al.  Influences on articulatory timing in consonant sequences , 1996 .

[69]  J A Kelso,et al.  The production and perception of syllable structure. , 1991, Journal of speech and hearing research.

[70]  Andrew Butcher,et al.  WHAT SPEAKERS OF AUSTRALIAN ABORIGINAL LANGUAGES DO WITH THEIR VELUMS AND WHY : THE PHONETICS OF THE NASAL / ORAL CONTRAST , 1999 .

[71]  Michael I. Jordan Constrained supervised learning , 1992 .

[72]  G. Rizzolatti,et al.  Response properties and behavioral modulation of ‘mouth’ neurons of the postarcuate cortex (area 6) in macaque monkeys , 1981, Brain Research.

[73]  P. Goldman-Rakic,et al.  An auditory domain in primate prefrontal cortex , 2002, Nature Neuroscience.

[74]  P. MacNeilage,et al.  The articulatory basis of babbling. , 1995, Journal of speech and hearing research.

[75]  L Polka,et al.  A cross-language comparison of /d/-/th/ perception: evidence for a new developmental pattern. , 2001, The Journal of the Acoustical Society of America.

[76]  Dani Byrd,et al.  Task-dynamics of gestural timing: Phase windows and multifrequency rhythms , 2000 .

[77]  P. MacNeilage,et al.  The motor core of speech: a comparison of serial organization patterns in infants and languages. , 2000, Child development.

[78]  Pierre-Yves Oudeyer L' auto-organisation de la parole , 2003 .

[79]  G. McRoberts,et al.  Infant Perception of Non-Native Consonant Contrasts that Adults Assimilate in Different Ways , 2003, Language and speech.

[80]  Bart de Boer,et al.  Emergence of Vowel Systems Through Self-Organisation , 2000, AI Commun..

[81]  Zellig S. Harris,et al.  Methods in structural linguistics. , 1952 .

[82]  Marianne Pouplier,et al.  The Dynamics of Error , 2003 .

[83]  P. MacNeilage,et al.  The frame/content theory of evolution of speech production , 1998, Behavioral and Brain Sciences.

[84]  M. Turvey,et al.  The motor theory of speech perception reviewed , 2006, Psychonomic bulletin & review.

[85]  Dani Byrd,et al.  A Phase Window Framework for Articulatory Timing , 1996, Phonology.

[86]  Michael Studdert-Kennedy,et al.  The Evolutionary Emergence of Language: Evolutionary Implications of the Particulate Principle: Imitation and the Dissociation of Phonetic Form from Semantic Function , 2000 .

[87]  L. Goldstein,et al.  Articulatory Phonology: A phonology for public language use , 2003 .

[88]  M. Arbib,et al.  Language within our grasp , 1998, Trends in Neurosciences.

[89]  Nan Bernstein Ratner,et al.  Phonological rule usage in mother-child speech , 1984 .

[90]  Marilyn M. Vihman,et al.  Phonological Development , 2014 .

[91]  Ferdinando A. Mussa-Ivaldi Geometrical principles in motor control , 1998 .

[92]  Michael Studdert-Kennedy,et al.  Launching Language: The Gestural Origin of Discrete Infinity , 2002 .

[93]  Stephen Grossberg,et al.  A neural network model for cursive script production , 1993, Biological Cybernetics.

[94]  L. Barsalou,et al.  Whither structured representation? , 1999, Behavioral and Brain Sciences.

[95]  Peter F. MacNeilage,et al.  Acquisition of Speech Production: The Achievement of Segmental Independence , 1990 .

[96]  A. Meltzoff,et al.  Explaining Facial Imitation: A Theoretical Model. , 1997, Early development & parenting.

[97]  Michael Studdert-Kennedy Mirror neurons, vocal imitation, and the evolution of particulate speech , 2002 .

[98]  K. Stevens,et al.  Linguistic experience alters phonetic perception in infants by 6 months of age. , 1992, Science.

[99]  Osamu Fujimura,et al.  Allophonic variation in English /l/ and its implications for phonetic implementation , 1993 .