Focal versus distributed temporal cortex activity for speech sound category assignment

Percepts and words can be decoded from largely distributed neural activity measures. The existence of widespread representations might, however, conflict with the fundamental notions of hierarchical processing and efficient coding. Using fMRI and MEG during syllable identification, we first show that sensory and decisional activity co-localize to a restricted part of the posterior superior temporal cortex. Next, using intracortical recordings we demonstrate that early and focal neural activity in this region distinguishes correct from incorrect decisions and can be machine-decoded to classify syllables. Crucially, significant machine-decoding was possible from neuronal activity sampled across widespread regions, despite weak or absent sensory or decision-related responses. These findings show that a complex behavior like speech sound categorization relies on an efficient readout of focal neural activity, while distributed activity, although decodable by machine-learning, reflects collateral processes of sensory perception and decision.

[1]  Rainer Goebel,et al.  "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech , 2008, Science.

[2]  Matthew H. Davis,et al.  Prediction Errors but Not Sharpened Signals Simulate Multivoxel fMRI Patterns during Speech Perception , 2016, PLoS biology.

[3]  J. Snyder,et al.  Changes in auditory cortex parallel rapid perceptual learning. , 2006, Cerebral cortex.

[4]  Shihab Shamma,et al.  On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system. , 2013, The Journal of the Acoustical Society of America.

[5]  Jonas Obleser,et al.  Bilateral Speech Comprehension Reflects Differential Sensitivity to Spectral and Temporal Features , 2008, The Journal of Neuroscience.

[6]  R. Ilmoniemi,et al.  Interpreting magnetic fields of the brain: minimum norm estimates , 2006, Medical and Biological Engineering and Computing.

[7]  Edward F Chang,et al.  The auditory representation of speech sounds in human motor cortex , 2016, eLife.

[8]  S. Dehaene,et al.  Characterizing the dynamics of mental representations: the temporal generalization method , 2014, Trends in Cognitive Sciences.

[9]  Luc H. Arnal,et al.  Cortical oscillations and sensory predictions , 2012, Trends in Cognitive Sciences.

[10]  Einat Liebenthal,et al.  Neural Dynamics of Phonological Processing in the Dorsal Auditory Stream , 2013, The Journal of Neuroscience.

[11]  Si Wu,et al.  Perceptual training continuously refines neuronal population codes in primary visual cortex , 2014, Nature Neuroscience.

[12]  D. T. Ives,et al.  Optimal Combination of Neural Temporal Envelope and Fine Structure Cues to Explain Speech Identification in Background Noise , 2014, The Journal of Neuroscience.

[13]  陈琦 Neural Organization of Spoken Language Revealed by Lesion-Symptom Mapping. , 2014 .

[14]  N. Kriegeskorte,et al.  Author ' s personal copy Representational geometry : integrating cognition , computation , and the brain , 2013 .

[15]  David A. Medler,et al.  Neural correlates of sensory and decision processes in auditory object identification , 2004, Nature Neuroscience.

[16]  Daniel D. Dilks,et al.  The Occipital Place Area Is Causally and Selectively Involved in Scene Perception , 2013, The Journal of Neuroscience.

[17]  Anne-Lise Giraud,et al.  The contribution of frequency-specific activity to hierarchical information processing in the human auditory cortex , 2014, Nature Communications.

[18]  R. Oostenveld,et al.  Nonparametric statistical testing of EEG- and MEG-data , 2007, Journal of Neuroscience Methods.

[19]  P. Fries Rhythms for Cognition: Communication through Coherence , 2015, Neuron.

[20]  Charles B. Mikell,et al.  Categorical speech representation in human superior temporal gyrus. , 2010, Neurosurgery.

[21]  D. Mirman,et al.  Neural Organization of Spoken Language Revealed by Lesion-Symptom Mapping , 2015, Nature Communications.

[22]  Jia-Hong Gao,et al.  Predictive Brain Mechanisms in Sound-to-Meaning Mapping during Speech Processing , 2016, The Journal of Neuroscience.

[23]  E. Formisano,et al.  How learning to abstract shapes neural sound representations , 2014, Front. Neurosci..

[24]  W. Fitch,et al.  Computational Constraints on Syntactic Processing in a Nonhuman Primate , 2004, Science.

[25]  W. Bastiaan Kleijn,et al.  Linear Predictive Analysis by Synthesis Coding , 1995 .

[26]  R. Zatorre,et al.  Spectral and temporal processing in human auditory cortex. , 2001, Cerebral cortex.

[27]  Vladimir Goncharoff,et al.  Interpolation of LPC spectra via pole shifting , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[28]  H. Sompolinsky,et al.  Population coding in neuronal systems with correlated noise. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Yinjuan Du,et al.  Noise differentially impacts phoneme representations in the auditory and speech motor systems , 2014, Proceedings of the National Academy of Sciences.

[30]  Richard Granger,et al.  Categorical Speech Processing in Broca's Area: An fMRI Study Using Multivariate Pattern-Based Analysis , 2012, The Journal of Neuroscience.

[31]  Giancarlo Valente,et al.  Task-Dependent Decoding of Speaker and Vowel Identity from Auditory Cortical Response Patterns , 2014, The Journal of Neuroscience.

[32]  Matthew H. Davis,et al.  Inferior Frontal Cortex Contributions to the Recognition of Spoken Words and Their Constituent Speech Sounds , 2017, Journal of Cognitive Neuroscience.

[33]  Giancarlo Valente,et al.  Brain-Based Translation: fMRI Decoding of Spoken Words in Bilinguals Reveals Language-Independent Semantic Representations in Anterior Temporal Lobe , 2014, The Journal of Neuroscience.

[34]  Tim Oates,et al.  A critique of multi-voxel pattern analysis , 2010 .

[35]  Jessica S. Arsenault,et al.  Distributed Neural Representations of Phonological Features during Speech Perception , 2015, The Journal of Neuroscience.

[36]  Janneke F. M. Jehee,et al.  Less Is More: Expectation Sharpens Representations in the Primary Visual Cortex , 2012, Neuron.

[37]  Friedemann Pulvermüller,et al.  Representational Similarity Mapping of Distributional Semantics in Left Inferior Frontal, Middle Temporal, and Motor Cortex , 2017, Cerebral cortex.

[38]  K. H. Britten,et al.  Neuronal plasticity that underlies improvement in perceptual performance. , 1994, Science.

[39]  Jonathan Grainger,et al.  Early Activity in Broca's Area During Reading Reflects Fast Access to Articulatory Codes From Print. , 2015, Cerebral cortex.

[40]  Thomas L. Griffiths,et al.  Supplementary Information for Natural Speech Reveals the Semantic Maps That Tile Human Cerebral Cortex , 2022 .

[41]  Anders M. Dale,et al.  Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature , 2010, NeuroImage.

[42]  Raymond J. Dolan,et al.  fMRI Activity Patterns in Human LOC Carry Information about Object Exemplars within Category , 2008, Journal of Cognitive Neuroscience.

[43]  Richard M. Leahy,et al.  Brainstorm: A User-Friendly Application for MEG/EEG Analysis , 2011, Comput. Intell. Neurosci..

[44]  J. Snyder,et al.  Age-related differences in auditory evoked responses during rapid perceptual learning , 2008, Clinical Neurophysiology.

[45]  S. Dehaene,et al.  Time-Resolved Decoding of Two Processing Chains during Dual-Task Interference , 2015, Neuron.

[46]  Robert T Knight,et al.  Decoding speech for understanding and treating aphasia. , 2013, Progress in brain research.

[47]  Mark A. Chevillet,et al.  Automatic Phoneme Category Selectivity in the Dorsal Auditory Stream , 2013, The Journal of Neuroscience.

[48]  E. Chang,et al.  Categorical Speech Representation in Human Superior Temporal Gyrus , 2010, Nature Neuroscience.

[49]  Tonio Ball,et al.  Causal interpretation rules for encoding and decoding models in neuroimaging , 2015, NeuroImage.

[50]  Robert Oostenveld,et al.  FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data , 2010, Comput. Intell. Neurosci..

[51]  Emily B. Myers,et al.  The Perception of Voice Onset Time: An fMRI Investigation of Phonetic Category Structure , 2005, Journal of Cognitive Neuroscience.

[52]  P. Marquis,et al.  Intracerebral Evoked Potentials in Pitch Perception Reveal a Functional Asymmetry of the Human Auditory Cortex , 2001, Annals of the New York Academy of Sciences.

[53]  Andrew S. Liu,et al.  Causal contribution of primate auditory cortex to auditory perceptual decision-making , 2015, Nature Neuroscience.

[54]  Mingzhou Ding,et al.  Estimating Granger causality from fourier and wavelet transforms of time series data. , 2007, Physical review letters.

[55]  Tomaso Poggio,et al.  Fast Readout of Object Identity from Macaque Inferior Temporal Cortex , 2005, Science.

[56]  Shreyas Mandre,et al.  Short-time dynamics of partial wetting. , 2008, Physical review letters.

[57]  R. Dolan,et al.  Fmri activity patterns in human loc carry information about object exemplars within category , 2008 .

[58]  Joel Z. Leibo,et al.  The dynamics of invariant object recognition in the human visual system. , 2014, Journal of neurophysiology.

[59]  Mingzhou Ding,et al.  Analyzing information flow in brain networks with nonparametric Granger causality , 2008, NeuroImage.

[60]  G. DeAngelis,et al.  How Can Single Sensory Neurons Predict Behavior? , 2015, Neuron.

[61]  Alexander G. Huth,et al.  Attention During Natural Vision Warps Semantic Representation Across the Human Brain , 2013, Nature Neuroscience.

[62]  Hideki Kawahara,et al.  Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[63]  Etienne Koechlin,et al.  Foundations of human reasoning in the prefrontal cortex , 2014, Science.

[64]  Sejin Yoo,et al.  Articulation-based sound perception in verbal repetition: a functional NIRS study , 2013, Front. Hum. Neurosci..

[65]  David A. Medler,et al.  Cerebral Cortex doi:10.1093/cercor/bhi040 Cerebral Cortex Advance Access published February 9, 2005 , 2022 .

[66]  Michael L. Anderson Précis of After Phrenology: Neural Reuse and the Interactive Brain , 2014, Behavioral and Brain Sciences.

[67]  Matthew H. Davis,et al.  Predictive Top-Down Integration of Prior Knowledge during Speech Perception , 2012, The Journal of Neuroscience.

[68]  P Sterzer,et al.  Contributions of sensory input, auditory search and verbal comprehension to cortical activity during speech processing. , 2004, Cerebral cortex.

[69]  Vocal tract motor patterns and resonance during constant frequency song: the white-throated sparrow , 2009, Journal of Comparative Physiology A.

[70]  Keith Johnson,et al.  Phonetic Feature Encoding in Human Superior Temporal Gyrus , 2014, Science.