Listening for the Norm: Adaptive Coding in Speech Categorization

Perceptual aftereffects have been referred to as “the psychologist’s microelectrode” because they can expose dimensions of representation through the residual effect of a context stimulus upon perception of a subsequent target. The present study uses such context-dependence to examine the dimensions of representation involved in a classic demonstration of “talker normalization” in speech perception. Whereas most accounts of talker normalization have emphasized talker-, speech-, or articulatory-specific dimensions’ significance, the present work tests an alternative hypothesis: that the long-term average spectrum (LTAS) of speech context is responsible for patterns of context-dependent perception considered to be evidence for talker normalization. In support of this hypothesis, listeners’ vowel categorization was equivalently influenced by speech contexts manipulated to sound as though they were spoken by different talkers and non-speech analogs matched in LTAS to the speech contexts. Since the non-speech contexts did not possess talker, speech, or articulatory information, general perceptual mechanisms are implicated. Results are described in terms of adaptive perceptual coding.

[1]  A. Liberman,et al.  Tempo of frequency change as a cue for distinguishing classes of speech sounds. , 1956, Journal of experimental psychology.

[2]  D. Broadbent,et al.  Information Conveyed by Vowels , 1957 .

[3]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[4]  Kenneth N. Stevens,et al.  Speech recognition: A model and a program for research , 1962, IRE Trans. Inf. Theory.

[5]  J. Leather,et al.  Speaker normalization in perception of lexical tone , 1983 .

[6]  Virginia A. Mann,et al.  Distinguishing universal and language-dependent levels of speech perception: Evidence from Japanese listeners' perception of English “l” and “r” , 1986, Cognition.

[7]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[8]  Joanne L. Miller,et al.  Speech Perception , 1990, Springer Handbook of Auditory Research.

[9]  H. Barlow Vision: A theory about the functional role and synaptic mechanism of visual after-effects , 1991 .

[10]  A. J. Watkins,et al.  Perceptual compensation for speaker differences and for spectral-envelope distortion. , 1994, The Journal of the Acoustical Society of America.

[11]  A. J. Watkins,et al.  Effects of spectral contrast on perceptual compensation for spectral-envelope distortion. , 1996, The Journal of the Acoustical Society of America.

[12]  Richard S. McGowan Normalization for articulatory recovery , 1997 .

[13]  Corinne B. Moore,et al.  Speaker normalization in the perception of Mandarin Chinese tones. , 1997, The Journal of the Acoustical Society of America.

[14]  R S McGowan,et al.  Vocal tract normalization for midsagittal articulatory recovery with analysis-by-synthesis. , 1999, The Journal of the Acoustical Society of America.

[15]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[16]  P. Iverson,et al.  Vowel normalization for accent: an investigation of best exemplar locations in northern and southern British English sentences. , 2004, The Journal of the Acoustical Society of America.

[17]  L. Holt Temporally Nonadjacent Nonlinguistic Sounds Affect Speech Categorization , 2005, Psychological science.

[18]  B. Story A parametric model of the vocal tract area function for vowel and consonant simulation. , 2005, The Journal of the Acoustical Society of America.

[19]  G. Rhodes,et al.  Fitting the Mind to the World: Adaptation and after-effects in high-level vision , 2005 .

[20]  L. Holt,et al.  Perceptual effects of preceding nonspeech rate on temporal properties of speech categories , 2005, Perception & psychophysics.

[21]  Rachel A Robbins,et al.  Adaptation and Face Perception - How Aftereffects Implicate Norm-Based Coding of Faces , 2005 .

[22]  L. Holt Speech categorization in context: joint effects of nonspeech and speech precursors. , 2006, The Journal of the Acoustical Society of America.

[23]  Kenneth D. Miller,et al.  Adaptive filtering enhances information transmission in visual cortex , 2006, Nature.

[24]  L. Holt The mean matters: effects of statistically defined nonspeech spectral distributions on speech categorization. , 2006, The Journal of the Acoustical Society of America.

[25]  H. Nusbaum,et al.  Acoustic differences, listener expectations, and the perceptual accommodation of talker variability. , 2007, Journal of experimental psychology. Human perception and performance.

[26]  D. Poeppel,et al.  Speech perception at the interface of neurobiology and linguistics , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[27]  Valentin Dragoi,et al.  Adaptive coding of visual information in neural populations , 2008, Nature.

[28]  L. Holt,et al.  General perceptual contributions to lexical tone normalization. , 2009, The Journal of the Acoustical Society of America.