Neural Representation of Concurrent Vowels in Macaque Primary Auditory Cortex123

Abstract Successful speech perception in real-world environments requires that the auditory system segregate competing voices that overlap in frequency and time into separate streams. Vowels are major constituents of speech and are comprised of frequencies (harmonics) that are integer multiples of a common fundamental frequency (F0). The pitch and identity of a vowel are determined by its F0 and spectral envelope (formant structure), respectively. When two spectrally overlapping vowels differing in F0 are presented concurrently, they can be readily perceived as two separate “auditory objects” with pitches at their respective F0s. A difference in pitch between two simultaneous vowels provides a powerful cue for their segregation, which in turn, facilitates their individual identification. The neural mechanisms underlying the segregation of concurrent vowels based on pitch differences are poorly understood. Here, we examine neural population responses in macaque primary auditory cortex (A1) to single and double concurrent vowels (/a/ and /i/) that differ in F0 such that they are heard as two separate auditory objects with distinct pitches. We find that neural population responses in A1 can resolve, via a rate-place code, lower harmonics of both single and double concurrent vowels. Furthermore, we show that the formant structures, and hence the identities, of single vowels can be reliably recovered from the neural representation of double concurrent vowels. We conclude that A1 contains sufficient spectral information to enable concurrent vowel segregation and identification by downstream cortical areas.

[1]  C. Schroeder,et al.  A spatiotemporal profile of visual system activation revealed by current source density analysis in the awake macaque. , 1998, Cerebral cortex.

[2]  M. Steinschneider,et al.  Spectral resolution of monkey primary auditory cortex (A1) revealed with two-noise masking. , 2006, Journal of neurophysiology.

[3]  M. Sachs,et al.  Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate. , 1979, The Journal of the Acoustical Society of America.

[4]  B. Delgutte,et al.  Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. , 1996, Journal of neurophysiology.

[5]  J. Hillenbrand,et al.  A narrow band pattern-matching model of vowel perception. , 2003, The Journal of the Acoustical Society of America.

[6]  Charles H. Brown,et al.  A multidimensional scaling analysis of vowel discrimination in humans and monkeys , 1997 .

[7]  B. Shinn-Cunningham,et al.  Selective Attention in Normal and Impaired Hearing , 2008, Trends in amplification.

[8]  J. L. Goldstein An optimum processor theory for the central formation of the pitch of complex tones. , 1973, The Journal of the Acoustical Society of America.

[9]  Christoph Kayser,et al.  Tuning to sound frequency in auditory field potentials. , 2007, Journal of neurophysiology.

[10]  Jing Yu Wang,et al.  Representations of cat meows and human vowels in the primary auditory cortex of awake cats. , 2008, Journal of neurophysiology.

[11]  S. David,et al.  Rapid Synaptic Depression Explains Nonlinear Modulation of Spectro-Temporal Tuning in Primary Auditory Cortex by Natural Stimuli , 2009, The Journal of Neuroscience.

[12]  J. Culling,et al.  Perceptual and computational separation of simultaneous vowels: cues arising from low-frequency beating. , 1994, The Journal of the Acoustical Society of America.

[13]  J. Kaas,et al.  Tonotopic organization, architectonic fields, and connections of auditory cortex in macaque monkeys , 1993, The Journal of comparative neurology.

[14]  M Steinschneider,et al.  Consonance and dissonance of musical chords: neural correlates in auditory cortex of monkeys and humans. , 2001, Journal of neurophysiology.

[15]  M M Merzenich,et al.  Representation of a species-specific vocalization in the primary auditory cortex of the common marmoset: temporal and spectral characteristics. , 1995, Journal of neurophysiology.

[16]  Andrey A. Ptitsyn,et al.  Permutation test for periodicity in short time series data , 2006, BMC Bioinformatics.

[17]  R. Eckhorn,et al.  Stimulus-dependent modulations of correlated high-frequency oscillations in cat visual cortex. , 1997, Cerebral cortex.

[18]  J. Eggermont,et al.  The Neurophysiology of Auditory Perception: From Single Units to Evoked Potentials , 2002, Audiology and Neurotology.

[19]  C. Darwin,et al.  Perceptual separation of simultaneous vowels: within and across-formant grouping by F0. , 1993, The Journal of the Acoustical Society of America.

[20]  Xiaoqin Wang,et al.  Contribution of Inhibition to Stimulus Selectivity in Primary Auditory Cortex of Awake Primates , 2010, The Journal of Neuroscience.

[21]  N. Mesgarani,et al.  Selective cortical representation of attended speaker in multi-talker speech perception , 2012, Nature.

[22]  D. Bendor,et al.  Neural coding of temporal information in auditory thalamus and cortex , 2008, Neuroscience.

[23]  S McAdams,et al.  Identification of concurrent harmonic and inharmonic vowels: a test of the theory of harmonic cancellation and enhancement. , 1995, The Journal of the Acoustical Society of America.

[24]  G. Recanzone,et al.  Frequency and intensity response properties of single neurons in the auditory cortex of the behaving macaque monkey. , 2000, Journal of neurophysiology.

[25]  Gerald Langner,et al.  Periodicity coding in the auditory system , 1992, Hearing Research.

[26]  Kerry M. M. Walker,et al.  Neural Ensemble Codes for Stimulus Periodicity in Auditory Cortex , 2010, The Journal of Neuroscience.

[27]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[28]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[29]  R. Christopher deCharms,et al.  Primary cortical representation of sounds by the coordination of action-potential timing , 1996, Nature.

[30]  S. Cruikshank,et al.  Thalamocortical inputs trigger a propagating envelope of gamma-band activity in auditory cortex in vitro , 1999, Experimental Brain Research.

[31]  Q. Summerfield,et al.  Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. , 1990, The Journal of the Acoustical Society of America.

[32]  A. Oxenham Pitch Perception and Auditory Stream Segregation: Implications for Hearing Loss and Cochlear Implants , 2008, Trends in amplification.

[33]  Nima Mesgarani,et al.  Phoneme representation and classification in primary auditory cortex. , 2008, The Journal of the Acoustical Society of America.

[34]  J. Edeline,et al.  How do auditory cortex neurons represent communication sounds? , 2013, Hearing Research.

[35]  J. Kaas,et al.  Subdivisions of AuditoryCortex and Levels of Processing in Primates , 1998, Audiology and Neurotology.

[36]  Mitchell Steinschneider,et al.  Neural Correlates of Auditory Scene Analysis Based on Inharmonicity in Monkey Primary Auditory Cortex , 2010, The Journal of Neuroscience.

[37]  Christoph E Schreiner,et al.  Auditory Cortical Local Subnetworks Are Characterized by Sharply Synchronous Activity , 2013, The Journal of Neuroscience.

[38]  C. Nicholson,et al.  Theory of current source-density analysis and determination of conductivity tensor for anuran cerebellum. , 1975, Journal of neurophysiology.

[39]  C. Micheyl,et al.  Neural Representation of Harmonic Complex Tones in Primary Auditory Cortex of the Awake Monkey , 2013, The Journal of Neuroscience.

[40]  D B Moody,et al.  Formant frequency discrimination by Japanese macaques (Macaca fuscata). , 1992, The Journal of the Acoustical Society of America.

[41]  Olivier Bertrand,et al.  Neural Substrate of Concurrent Sound Perception: Direct Electrophysiological Recordings from Human Auditory Cortex , 2007, Frontiers in human neuroscience.

[42]  D. Bendor,et al.  Neural coding of temporal information in auditory thalamus and cortex , 2008, Neuroscience.

[43]  M. Merzenich,et al.  Representation of the cochlear partition of the superior temporal plane of the macaque monkey. , 1973, Brain research.

[44]  P. Roelfsema,et al.  Chronic multiunit recordings in behaving animals: advantages and limitations. , 2005, Progress in brain research.

[45]  Mitchell Steinschneider,et al.  Temporally dynamic frequency tuning of population responses in monkey primary auditory cortex , 2009, Hearing Research.

[46]  B. Delgutte,et al.  Pitch Representations in the Auditory Nerve: Two Concurrent Complex Tones Chair, Department Committee on Graduate Students , 2022 .

[47]  Mitchell Steinschneider,et al.  Coding of repetitive transients by auditory cortex on Heschl's gyrus. , 2009, Journal of neurophysiology.

[48]  Claude Alain,et al.  Age-related changes in neural activity associated with concurrent vowel segregation. , 2005, Brain research. Cognitive brain research.

[49]  Claude Alain Breaking the wave: Effects of attention and learning on concurrent sound perception , 2007, Hearing Research.

[50]  Kirill V. Nourski,et al.  Representation of speech in human auditory cortex: Is it special? , 2013, Hearing Research.

[51]  U. Mitzdorf,et al.  Functional anatomy of the inferior colliculus and the auditory cortex: current source density analyses of click-evoked potentials , 1984, Hearing Research.

[52]  Keith Johnson,et al.  Phonetic Feature Encoding in Human Superior Temporal Gyrus , 2014, Science.

[53]  J. Arezzo,et al.  Representation of the voice onset time (VOT) speech parameter in population responses within primary auditory cortex of the awake monkey. , 2003, The Journal of the Acoustical Society of America.

[54]  P F Assmann,et al.  Pitches of concurrent vowels. , 1994, The Journal of the Acoustical Society of America.

[55]  Michael K. Qin,et al.  Effects of Envelope-Vocoder Processing on F0 Discrimination and Concurrent-Vowel Identification , 2005, Ear and hearing.

[56]  D. Bendor,et al.  The neuronal representation of pitch in primate auditory cortex , 2005, Nature.

[57]  C. H. Brown,et al.  A multidimensional scaling analysis of vowel discrimination in humans and monkeys. , 1993, Perception & psychophysics.

[58]  Ian M. Winter,et al.  Reverberation impairs brainstem temporal representations of voiced vowel sounds: challenging “periodicity-tagged” segregation of competing speech in rooms , 2015, Front. Syst. Neurosci..

[59]  Mitchell Steinschneider,et al.  Neural Representation of Concurrent Harmonic Sounds in Monkey Primary Auditory Cortex: Implications for Models of Auditory Scene Analysis , 2014, The Journal of Neuroscience.

[60]  Stephen V. David,et al.  Mechanisms of noise robust representation of speech in primary auditory cortex , 2014, Proceedings of the National Academy of Sciences.

[61]  Roy D. Patterson,et al.  Locating the initial stages of speech–sound processing in human temporal cortex , 2006, NeuroImage.

[62]  Charles E. Schroeder,et al.  Dual Mechanism of Neuronal Ensemble Inhibition in Primary Auditory Cortex , 2011, Neuron.

[63]  G. V. Simpson,et al.  Cellular generators of the cortical auditory evoked potential initial component. , 1992, Electroencephalography and clinical neurophysiology.

[64]  Michael V Keebler,et al.  Pitch perception for mixtures of spectrally overlapping harmonic complex tones. , 2010, The Journal of the Acoustical Society of America.

[65]  A R Palmer,et al.  The representation of the spectra and fundamental frequencies of steady-state single- and double-vowel sounds in the temporal discharge patterns of guinea pig cochlear-nerve fibers. , 1990, The Journal of the Acoustical Society of America.

[66]  J. Sinnott,et al.  Detection and discrimination of synthetic English vowels by Old World monkeys (Cercopithecus, Macaca) and humans. , 1989, The Journal of the Acoustical Society of America.

[67]  Mitchell Steinschneider,et al.  Neural mechanisms of rhythmic masking release in monkey primary auditory cortex: implications for models of auditory scene analysis. , 2012, Journal of neurophysiology.

[68]  M. Sahani,et al.  The Consequences of Response Nonlinearities for Interpretation of Spectrotemporal Receptive Fields , 2008, The Journal of Neuroscience.

[69]  Ying-Yee Kong,et al.  Effects of Spectral Degradation on Attentional Modulation of Cortical Auditory Responses to Continuous Speech , 2015, Journal of the Association for Research in Otolaryngology.

[70]  D. Barth,et al.  Three-dimensional analysis of spontaneous and thalamically evoked gamma oscillations in auditory cortex. , 1998, Journal of neurophysiology.

[71]  Bertrand Delgutte,et al.  Behavioral / Systems / Cognitive Spatiotemporal Representation of the Pitch of Harmonic Complex Tones in the Auditory Nerve , 2010 .

[72]  E D Young,et al.  The representation of concurrent vowels in the cat anesthetized ventral cochlear nucleus: evidence for a periodicity-tagged spectral representation. , 1997, The Journal of the Acoustical Society of America.

[73]  T. Sejnowski,et al.  Regulation of spike timing in visual cortical circuits , 2008, Nature Reviews Neuroscience.

[74]  M M Merzenich,et al.  Temporal information transformed into a spatial code by a neural network with realistic properties , 1995, Science.

[75]  R. Fay,et al.  Pitch : neural coding and perception , 2005 .

[76]  W. Singer,et al.  Modulation of Neuronal Interactions Through Neuronal Synchronization , 2007, Science.

[77]  Hideki Kawahara,et al.  Missing-data model of vowel identification. , 1999, The Journal of the Acoustical Society of America.

[78]  J. Rauschecker,et al.  Vowel sound extraction in anterior superior temporal cortex , 2006, Human brain mapping.

[79]  J. E. Rose,et al.  Phase-locked response to low-frequency tones in single auditory nerve fibers of the squirrel monkey. , 1967, Journal of neurophysiology.

[80]  T. Sejnowski,et al.  Synchrony of Thalamocortical Inputs Maximizes Cortical Reliability , 2010, Science.

[81]  Kerry M. M. Walker,et al.  Multiplexed and Robust Representations of Sound Features in Auditory Cortex , 2011, The Journal of Neuroscience.

[82]  Wilkin Chau,et al.  Left thalamo-cortical network implicated in successful speech separation and identification , 2005, NeuroImage.

[83]  A S Bregman,et al.  The perceptual segregation of simultaneous vowels with harmonic, shifted, or random components , 1993, Perception & psychophysics.

[84]  A. de Cheveigné,et al.  Vowel-specific effects in concurrent vowel identification. , 1999, The Journal of the Acoustical Society of America.

[85]  C. Schroeder,et al.  Speech-evoked activity in primary auditory cortex: effects of voice onset time. , 1994, Electroencephalography and clinical neurophysiology.

[86]  B. Delgutte,et al.  Pitch of complex tones: rate-place and interspike interval representations in the auditory nerve. , 2005, Journal of neurophysiology.

[87]  C. Schreiner,et al.  Spectral envelope coding in cat primary auditory cortex: linear and non‐linear effects of stimulus characteristics , 1998, The European journal of neuroscience.

[88]  C. Nicholson,et al.  Experimental optimization of current source-density technique for anuran cerebellum. , 1975, Journal of neurophysiology.

[89]  M Steinschneider,et al.  Click train encoding in primary auditory cortex of the awake monkey: evidence for two mechanisms subserving pitch perception. , 1998, The Journal of the Acoustical Society of America.

[90]  O. Bertrand,et al.  Effects of Selective Attention on the Electrophysiological Representation of Concurrent Sounds in the Human Auditory Cortex , 2007, The Journal of Neuroscience.

[91]  J J Eggermont,et al.  Neural interaction in cat primary auditory cortex II. Effects of sound stimulation. , 1994, Journal of neurophysiology.

[92]  A. Oxenham,et al.  Pitch, harmonicity and concurrent sound segregation: Psychoacoustical and neurophysiological findings , 2010, Hearing Research.

[93]  Shihab A. Shamma,et al.  Patterns of inhibition in auditory cortical cells in awake squirrel monkeys , 1985, Hearing Research.