Hearing Two Things at Once: Neurophysiological Indices of Speech Segregation and Identification

The discrimination of concurrent sounds is paramount to speech perception. During social gatherings, listeners must extract information from a composite acoustic wave, which sums multiple individual voices that are simultaneously active. The observers' ability to identify two simultaneously presented vowels improves with increasing separation between the fundamental frequencies (f0) of the two vowels. Event-related potentials to stimuli presented during attend and ignore conditions revealed activity between 130 and 170 msec after sound onset that reflected the f0 differences between the two vowels. Another, more posterior and right-lateralized, negative wave maximal at 250 msec, and a central-parietal slow negativity were observed only during vowel identification and may index stimulus categorization. This sequence of neural events supports a multistage model of auditory scene analysis in which the spectral pattern of each vowel constituent is automatically extracted and then matched against representations of those vowels in working memory.

[1]  E. Donchin,et al.  Is the P300 component a manifestation of context updating? , 1988, Behavioral and Brain Sciences.

[2]  L Collet,et al.  Perceptual auditory stream segregation of sequences of complex sounds in subjects with normal and impaired hearing , 2001, British journal of audiology.

[3]  T. Elbert,et al.  Cortical representation of vowels reflects acoustic dissimilarity determined by formant frequencies. , 2003, Brain research. Cognitive brain research.

[4]  Claude Alain,et al.  Neural activity associated with distinguishing concurrent auditory objects. , 2002, The Journal of the Acoustical Society of America.

[5]  C L Mackersie,et al.  The role of sequential stream segregation and frequency selectivity in the perception of simultaneous sentences by listeners with sensorineural hearing loss. , 2001, Journal of speech, language, and hearing research : JSLHR.

[6]  R Plomp,et al.  Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. , 1992, The Journal of the Acoustical Society of America.

[7]  Claude Alain,et al.  Perceptual learning modulates sensory evoked response during vowel segregation. , 2003, Brain research. Cognitive brain research.

[8]  H Scheich,et al.  Orderly cortical representation of vowels based on formant interaction. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[9]  A R Palmer,et al.  The representation of the spectra and fundamental frequencies of steady-state single- and double-vowel sounds in the temporal discharge patterns of guinea pig cochlear-nerve fibers. , 1990, The Journal of the Acoustical Society of America.

[10]  R Plomp,et al.  The effect of head-induced interaural time and level differences on speech intelligibility in noise. , 1987, The Journal of the Acoustical Society of America.

[11]  A. Treisman THE EFFECT OF IRRELEVANT MATERIAL ON THE EFFICIENCY OF SELECTIVE LISTENING. , 1964, The American journal of psychology.

[12]  W M Hartmann,et al.  Pitch, periodicity, and auditory organization. , 1996, The Journal of the Acoustical Society of America.

[13]  R Hari,et al.  Auditory stream segregation in dyslexic adults. , 1999, Brain : a journal of neurology.

[14]  B C Moore,et al.  Perceptual grouping of tone sequences by normally hearing and hearing-impaired listeners. , 1997, The Journal of the Acoustical Society of America.

[15]  R D Patterson,et al.  The time course of auditory segregation: concurrent vowels that vary in duration. , 1995, The Journal of the Acoustical Society of America.

[16]  G. Kidd,et al.  The effect of spatial separation on informational and energetic masking of speech. , 2002, The Journal of the Acoustical Society of America.

[17]  Claude Alain,et al.  Effects of Attentional Load on Auditory Scene Analysis , 2003, Journal of Cognitive Neuroscience.

[18]  S. Hillyard,et al.  Electrical Signs of Selective Attention in the Human Brain , 1973, Science.

[19]  Claude Alain,et al.  Representation of concurrent acoustic objects in primary auditory cortex. , 2004, The Journal of the Acoustical Society of America.

[20]  Antoine J. Shahin,et al.  Enhancement of Neuroplastic P2 and N1c Auditory Evoked Potentials in Musicians , 2003, The Journal of Neuroscience.

[21]  D Friedman,et al.  A brain event related to the making of a sensory discrimination. , 1979, Science.

[22]  Paavo Alku,et al.  The auditory N1m reveals the left-hemispheric representation of vowel identity in humans , 2003, Neuroscience Letters.

[23]  C. Darwin,et al.  Perceptual separation of simultaneous vowels: within and across-formant grouping by F0. , 1993, The Journal of the Acoustical Society of America.

[24]  J. L. Goldstein,et al.  Mechanisms of signal analysis and pattern perception in periodicity pitch. , 1978, Audiology : official organ of the International Society of Audiology.

[25]  S. G. Nooteboom,et al.  Intonation and the perceptual separation of simultaneous voices , 1982 .

[26]  John C. Webster,et al.  Responding to One of Two Simultaneous Messages , 1954 .

[27]  A. Bregman,et al.  The perceptual segregation of simultaneous auditory signals: Pulse train segregation and vowel segregation , 1989, Perception & psychophysics.

[28]  T. Carrell,et al.  Central auditory system plasticity: generalization to novel stimuli following listening training. , 1997, The Journal of the Acoustical Society of America.

[29]  Stuart Gatehouse,et al.  Perceptual segregation of competing speech sounds: the role of spatial location. , 1999, The Journal of the Acoustical Society of America.

[30]  D. Kurtzberg,et al.  The effects of decreased audibility produced by high-pass noise masking on cortical event-related potentials to speech sounds/ba/and/da. , 1997, The Journal of the Acoustical Society of America.

[31]  E D Young,et al.  The representation of concurrent vowels in the cat anesthetized ventral cochlear nucleus: evidence for a periodicity-tagged spectral representation. , 1997, The Journal of the Acoustical Society of America.

[32]  P. Berg,et al.  The correction of ocular artifacts: a topographic perspective , 2000, Clinical Neurophysiology.

[33]  Q. Summerfield,et al.  Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. , 1990, The Journal of the Acoustical Society of America.

[34]  T. Griffiths,et al.  The planum temporale as a computational hub , 2002, Trends in Neurosciences.

[35]  Q Summerfield,et al.  The contribution of waveform interactions to the perception of concurrent vowels. , 1994, The Journal of the Acoustical Society of America.

[36]  P F Assmann,et al.  Pitches of concurrent vowels. , 1994, The Journal of the Acoustical Society of America.

[37]  Alan R. Palmer,et al.  Psychophysical and Physiological Advances in Hearing , 1998 .

[38]  C Alain,et al.  Age-related changes in detecting a mistuned harmonic. , 2001, The Journal of the Acoustical Society of America.

[39]  W M Hartmann,et al.  The pitch of a mistuned harmonic: evidence for a template model. , 1998, The Journal of the Acoustical Society of America.

[40]  R. Verleger On the utility of P3 latency as an index of mental chronometry. , 1997, Psychophysiology.

[41]  H G Vaughan,et al.  Manipulation of event-related potential manifestations of information processing stages. , 1982, Science.

[42]  P Berg,et al.  A multiple source approach to the correction of eye artifacts. , 1994, Electroencephalography and clinical neurophysiology.

[43]  T. Picton,et al.  Bottom-up and top-down influences on auditory scene analysis: evidence from event-related brain potentials. , 2001, Journal of experimental psychology. Human perception and performance.

[44]  C. Petkov,et al.  Auditory scene analysis in dyslexics , 2000, Neuroreport.

[45]  J. Bird Effects of a difference in fundamental frequency in separating two sentences. , 1997 .

[46]  B. Delgutte,et al.  Neural correlates of the pitch of complex tones. II. Pitch shift, pitch ambiguity, phase invariance, pitch circularity, rate pitch, and the dominance region for pitch. , 1996, Journal of neurophysiology.

[47]  D A Bodnar,et al.  Midbrain combinatorial code for temporal and spectral information in concurrent acoustic signals. , 1999, Journal of neurophysiology.

[48]  B. Delgutte,et al.  Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. , 1996, Journal of neurophysiology.