Linguistic category structure influences early auditory processing: Converging evidence from mismatch responses and cortical oscillations

While previous research has established that language-specific knowledge influences early auditory processing, it is still controversial as to what aspects of speech sound representations determine early speech perception. Here, we propose that early processing primarily depends on information propagated top-down from abstractly represented speech sound categories. In particular, we assume that mid-vowels (as in 'bet') exert less top-down effects than the high-vowels (as in 'bit') because of their less specific (default) tongue height position as compared to either high- or low-vowels (as in 'bat'). We tested this assumption in a magnetoencephalography (MEG) study where we contrasted mid- and high-vowels, as well as the low- and high-vowels in a passive oddball paradigm. Overall, significant differences between deviants and standards indexed reliable mismatch negativity (MMN) responses between 200 and 300ms post-stimulus onset. MMN amplitudes differed in the mid/high-vowel contrasts and were significantly reduced when a mid-vowel standard was followed by a high-vowel deviant, extending previous findings. Furthermore, mid-vowel standards showed reduced oscillatory power in the pre-stimulus beta-frequency band (18-26Hz), compared to high-vowel standards. We take this as converging evidence for linguistic category structure to exert top-down influences on auditory processing. The findings are interpreted within the linguistic model of underspecification and the neuropsychological predictive coding framework.

[1]  A. Lahiri,et al.  “What you encode is not necessarily what you store”: Evidence for sparse feature representations from mismatch negativity , 2011, Brain Research.

[2]  T. Baldeweg Repetition effects to sounds: evidence for predictive coding in the auditory system , 2006, Trends in Cognitive Sciences.

[3]  Luc H. Arnal,et al.  Transitions in neural oscillations reflect prediction errors generated in audiovisual speech , 2011, Nature Neuroscience.

[4]  G. Bidelman Induced neural beta oscillations predict categorical speech perception abilities , 2015, Brain and Language.

[5]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6]  Nelson J. Trujillo-Barreto,et al.  A Sparse Neural Code for Some Speech Sounds but Not for Others , 2012, PloS one.

[7]  Philip J. Monahan,et al.  Asymmetries in the processing of vowel height. , 2012, Journal of speech, language, and hearing research : JSLHR.

[8]  J. Perkell Properties of the tongue help to define vowel categories: hypotheses based on physiologically-oriented modeling , 1996 .

[9]  Bernhard Ross,et al.  The Neurotopography of Vowels as Mirrored by Evoked Magnetic Field Measurements , 1996, Brain and Language.

[10]  A. Engel,et al.  Beta-band oscillations—signalling the status quo? , 2010, Current Opinion in Neurobiology.

[11]  Karl J. Friston,et al.  The mismatch negativity: A review of underlying mechanisms , 2009, Clinical Neurophysiology.

[12]  Xiao-Jing Wang Neurophysiological and computational principles of cortical rhythms in cognition. , 2010, Physiological reviews.

[13]  E. Miller,et al.  Response to Comment on "Top-Down Versus Bottom-Up Control of Attention in the Prefrontal and Posterior Parietal Cortices" , 2007, Science.

[14]  N. Cowan,et al.  Interactions between Transient and Long-Term Auditory Memory as Reflected by the Mismatch Negativity , 1996, Journal of Cognitive Neuroscience.

[15]  Holger Mitterer,et al.  Auditory cortical tuning to statistical regularities in phonology , 2005, Clinical Neurophysiology.

[16]  A. Lahiri,et al.  Neurobiological Evidence for Abstract Phonological Representations in the Mental Lexicon during Speech Recognition , 2004, Journal of Cognitive Neuroscience.

[17]  P. Zwitserlood,et al.  Processing Nasals with and without Consecutive Context Phonemes: Evidence from Explicit Categorization and the N100 , 2013, Front. Psychology.

[18]  David Poeppel,et al.  Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing , 2014, NeuroImage.

[19]  C. Pantev,et al.  A Beamformer Analysis of MEG Data Reveals Frontal Generators of the Musically Elicited Mismatch Negativity , 2013, PloS one.

[20]  Aditi Lahiri,et al.  Inequality across consonantal contrasts in speech perception: evidence from mismatch negativity. , 2013, Journal of experimental psychology. Human perception and performance.

[21]  Y. Prut,et al.  Do sensorimotor β-oscillations maintain muscle synergy representations in primary motor cortex? , 2015, Trends in Neurosciences.

[22]  G. Karmos,et al.  Adaptive modeling of the unattended acoustic environment reflected in the mismatch negativity event-related potential , 1996, Brain Research.

[23]  Jonathan Z. Simon,et al.  Abstract Journal of Neuroscience Methods 165 (2007) 297–305 Denoising based on time-shift PCA , 2007 .

[24]  Mathias Scharinger,et al.  Minimal representations of alternating vowels , 2009 .

[25]  Claudia K. Friedrich,et al.  Neurophysiological evidence for underspecified lexical representations: asymmetries with word initial variations. , 2008, Journal of experimental psychology. Human perception and performance.

[26]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[27]  Friedemann Pulvermüller,et al.  Strength of Word-Specific Neural Memory Traces Assessed Electrophysiologically , 2011, PloS one.

[28]  M. Dorman,et al.  Neurophysiologic correlates of cross-language phonetic perception. , 2000, The Journal of the Acoustical Society of America.

[29]  P. Brockhoff,et al.  lmerTest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package) , 2014 .

[30]  Jonas Obleser,et al.  Suppressed alpha oscillations predict intelligibility of speech and its acoustic details. , 2012, Cerebral cortex.

[31]  J. Obleser,et al.  Frequency modulation entrains slow neural oscillations and optimizes human listening behavior , 2012, Proceedings of the National Academy of Sciences.

[32]  R. Ilmoniemi,et al.  Language-specific phoneme representations revealed by electric and magnetic brain responses , 1997, Nature.

[33]  I. Winkler,et al.  The concept of auditory stimulus representation in cognitive neuroscience. , 1999, Psychological bulletin.

[34]  W. Idsardi,et al.  Sparseness of vowel category structure: Evidence from English dialect comparison. , 2014, Lingua. International review of general linguistics. Revue internationale de linguistique generale.

[35]  R. Näätänen,et al.  The mismatch negativity (MMN) in basic research of central auditory processing: A review , 2007, Clinical Neurophysiology.

[36]  D. Poeppel,et al.  Auditory Cortex Accesses Phonological Categories: An MEG Mismatch Study , 2000, Journal of Cognitive Neuroscience.

[37]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[38]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[39]  F. Pulvermüller,et al.  Language outside the focus of attention: The mismatch negativity as a tool for studying higher cognitive processes , 2006, Progress in Neurobiology.

[40]  G. Buzsáki Rhythms of the brain , 2006 .

[41]  Risto Näätänen,et al.  Native and foreign vowel discrimination as indexed by the mismatch negativity (MMN) response , 2003, Neuroscience Letters.

[42]  M. Tervaniemi,et al.  Pre-Attentive Discriminability of Sound Order as a Function of Tone Duration and Interstimulus Interval: A Mismatch Negativity Study , 1999, Audiology and Neurotology.

[43]  Emmanuel Dupoux,et al.  Electrophysiological Correlates of Phonological Processing: A Cross-linguistic Study , 2000, Journal of Cognitive Neuroscience.

[44]  M. Hallett,et al.  Integrative visuomotor behavior is associated with interregionally coherent oscillations in the human brain. , 1998, Journal of neurophysiology.

[45]  R. Näätänen,et al.  Mismatch negativity--the measure for central sound representation accuracy. , 1997, Audiology & neuro-otology.

[46]  O. Bertrand,et al.  Stimulus Frequency Dependence of the Transient Oscillatory Auditory Evoked Responses (40 Hz) Studied by Electric and Magnetic Recordings in Human , 1994 .

[47]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[48]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[49]  C. Schroeder,et al.  Neuronal oscillations as a mechanistic substrate of auditory temporal prediction , 2015, Annals of the New York Academy of Sciences.

[50]  Roshan Cools,et al.  Region-specific modulations in oscillatory alpha activity serve to facilitate processing in the visual and auditory modalities , 2014, NeuroImage.

[51]  Jonas Obleser,et al.  Magnetic Brain Response Mirrors Extraction of Phonological Features from Spoken Vowels , 2004, Journal of Cognitive Neuroscience.

[52]  E. Schröger The mismatch negativity as a tool to study auditory processing , 2005 .

[53]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[54]  Sibylle C. Herholz,et al.  Looking for a pattern: An MEG study on the abstract mismatch negativity in musicians and nonmusicians , 2009, BMC Neuroscience.

[55]  Anne-Lise Giraud,et al.  The contribution of frequency-specific activity to hierarchical information processing in the human auditory cortex , 2014, Nature Communications.

[56]  Paavo Alku,et al.  Automatic Auditory Processing of English Words as Indexed by the Mismatch Negativity, Using a Multiple Deviant Paradigm , 2004, Ear and hearing.

[57]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[58]  R. Näätänen The perception of speech sounds by the human brain as reflected by the mismatch negativity (MMN) and its magnetic equivalent (MMNm). , 2001, Psychophysiology.

[59]  I. Winkler Interpreting the Mismatch Negativity , 2007 .

[60]  G. Buzsáki,et al.  Neuronal Oscillations in Cortical Networks , 2004, Science.

[61]  Robert Oostenveld,et al.  FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data , 2010, Comput. Intell. Neurosci..

[62]  Matthew H. Davis,et al.  Neural Oscillations Carry Speech Rhythm through to Comprehension , 2012, Front. Psychology.

[63]  Luc H. Arnal,et al.  Cortical oscillations and sensory predictions , 2012, Trends in Cognitive Sciences.

[64]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[65]  Henning Reetz,et al.  Distinctive features: Phonological underspecification in representation and processing , 2010, J. Phonetics.

[66]  M. Bastiaansen,et al.  Fast oscillatory dynamics during language comprehension: Unification versus maintenance and prediction? , 2015, Brain and Language.

[67]  E. Schröger,et al.  Early electrophysiological indicators for predictive processing in audition: a review. , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[68]  R Näätänen,et al.  Pre-attentive detection of vowel contrasts utilizes both phonetic and auditory memory representations. , 1999, Brain research. Cognitive brain research.

[69]  B. Hangya,et al.  Phase Entrainment of Human Delta Oscillations Can Mediate the Effects of Expectation on Reaction Speed , 2010, The Journal of Neuroscience.

[70]  T. Hothorn,et al.  Multiple Comparisons Using R , 2010 .

[71]  Mathias Scharinger,et al.  A Comprehensive Three-dimensional Cortical Map of Vowel Space , 2011, Journal of Cognitive Neuroscience.

[72]  William D. Marslen-Wilson,et al.  Tracking speech comprehension in space and time , 2006, NeuroImage.

[73]  Alain de Cheveigné,et al.  Sensor noise suppression , 2008, Journal of Neuroscience Methods.

[74]  Sonja A. Kotz,et al.  Alpha and theta brain oscillations index dissociable processes in spoken word recognition , 2014, NeuroImage.