A critical examination of the spectral contrast account of compensation for coarticulation

Vocal tract gestures for adjacent phones overlap temporally, rendering the acoustic speech signal highly context dependent. For example, following a segment with an anterior place of articulation, a posterior segment’s place of articulation is pulled frontward, and listeners’ category boundaries shift appropriately. Some theories assume that listeners perceptually attune or compensate for coarticulatory context. An alternative is that shifts result from spectral contrast. Indeed, shifts occur when speech precursors are replaced by pure tones, frequency matched to the formant offset at the assumed locus of contrast (Lotto & Kluender, 1998). However, tone analogues differ from natural formants in several ways, raising the possibility that conditions for contrast may not exist in natural speech. When we matched tones to natural formant intensities and trajectories, boundary shifts diminished. When we presented only the critical spectral region of natural speech tokens, no compensation was observed. These results suggest that conditions for spectral contrast do not exist in typical speech.

[1]  V. Mann,et al.  Contrast effects do not underlie effects of preceding liquids on stop-consonant identification by humans. , 2000, Journal of experimental psychology. Human perception and performance.

[2]  V. Mann Influence of preceding liquid on stop-consonant perception. , 1980, Perception & psychophysics.

[3]  Lori L Holt,et al.  A critical evaluation of visually moderated phonetic context effects , 2005, Perception & psychophysics.

[4]  Joanne L. Miller,et al.  Speech Perception , 1990, Springer Handbook of Auditory Research.

[5]  Joseph D. W. Stephens,et al.  Preceding phonetic context affects perception of nonspeech. , 2003, The Journal of the Acoustical Society of America.

[6]  C. Fowler Compensation for coarticulation reflects gesture perception, not spectral contrast , 2006, Perception & psychophysics.

[7]  J. Perkell,et al.  Invariance and variability in speech processes , 1987 .

[8]  C A Fowler,et al.  Sound-producing sources as objects of perception: rate normalization and nonspeech perception. , 1990, The Journal of the Acoustical Society of America.

[9]  Holger Mitterer,et al.  On the causes of compensation for coarticulation: Evidence for phonological mediation , 2006, Perception & psychophysics.

[10]  V. Mann,et al.  Influence of preceding fricative on stop consonant perception. , 1981, The Journal of the Acoustical Society of America.

[11]  A. Lotto,et al.  General contrast effects in speech perception: Effect of preceding liquid on stop consonant identification , 1998, Perception & psychophysics.

[12]  C. Fowler,et al.  Compensation for coarticulation: disentangling auditory and gestural theories of perception of coarticulatory effects in speech. , 2010, Journal of experimental psychology. Human perception and performance.

[13]  L. Holt,et al.  Effects of later-occurring nonlinguistic sounds on speech categorization. , 2005, The Journal of the Acoustical Society of America.

[14]  Kim E. A. Silverman,et al.  F₀ Segmental Cues Depend on Intonation: The Case of the Rise after Voiced Stops , 1986 .

[15]  Joseph D. W. Stephens,et al.  Preceding phonetic context affects perception of nonspeech (L) , 2003 .

[16]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[17]  A. Lotto,et al.  Putting phonetic context effects into context: A commentary on Fowler (2006) , 2006, Perception & psychophysics.

[18]  A. Lotto,et al.  Neighboring spectral content influences vowel identification. , 2000, The Journal of the Acoustical Society of America.