On the internal perceptual structure of distinctive features: The [voice] contrast

Several fixed classification experiments test the hypothesis that F(1), f(0), and closure voicing covary between intervocalic stops contrasting for [voice] because they integrate perceptually. The perceptual property produced by the integration of these acoustic properties was at first predicted to be the presence of low frequency energy in the vicinity of the stop, which is considerable in [+voice] stops but slight in [-voice] stops. Both F(1) and f(0) at the edges of vowels flanking the stop were found to integrate perceptually with the continuation of voicing into the stop, but not to integrate with one another. These results indicate that the perceptually relevant property is instead the continuation of low frequency energy across the vowel-consonant border and not merely the amount of low frequency energy present near the stop. Other experiments establish that neither F(1) nor f(0) at vowel edge integrate perceptually with closure duration, which shows that only auditorily similar properties integrate and not any two properties that reliably covary. Finally, the experiments show that these acoustic properties integrate perceptually (or fail to) in the same way in non-speech analogues as in the original speech. This result indicates that integration arises from the auditory similarity of certain acoustic correlates of the [voice] contrast.

[1]  T. M. Nearey,et al.  Speech perception as pattern recognition. , 1997, The Journal of the Acoustical Society of America.

[2]  Mark Haggard,et al.  Psychoacoustical and cultural determinants of phoneme boundaries: evidence from trading F0 cues in the voiced–voiceless distinction , 1981 .

[3]  Gerard James Docherty An experimental phonetic study of the timing of voicing in English obstruents. , 1989 .

[4]  D H Whalen,et al.  Gradient Effects of Fundamental Frequency on Stop Consonant Voicing Judgments , 1990, Phonetica.

[5]  S. Blumstein,et al.  Perceptual invariance and onset spectra for stop consonants in different vowel environments , 1976 .

[6]  Randy L. Diehl,et al.  First formant spectral properties and initial stop–consonant [voice] judgments. , 1996 .

[7]  José Benkí,et al.  Place of articulation and first formant transition pattern both affect perception of voicing in English , 2001, J. Phonetics.

[8]  P. Denes Effect of Duration on the Perception of Voicing , 1955 .

[9]  Terrance M. Nearey,et al.  The segment as a unit of speech perception , 1990 .

[10]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[11]  Catherine T. Best,et al.  Perceptual equivalence of acoustic cues in speech and nonspeech perception , 1981, Perception & psychophysics.

[12]  Ashby Fg,et al.  Integrating information from separable psychological dimensions. , 1990 .

[13]  S. Blumstein,et al.  A reconsideration of acoustic invariance for place of articulation in diffuse stop consonants: evidence from a cross-language study. , 1981, The Journal of the Acoustical Society of America.

[14]  R. Port,et al.  Consonant/vowel ratio as a cue for voicing in English , 1982, Perception & psychophysics.

[15]  R. Diehl,et al.  An auditory basis for the stimulus-length effect in the perception of stops and glides. , 1989, The Journal of the Acoustical Society of America.

[16]  S. Blumstein,et al.  Perceptual invariance and onset spectra for stop consonants in different vowel environments. , 1980, The Journal of the Acoustical Society of America.

[17]  A M Liberman,et al.  Perceptual equivalence of two acoustic cues for stop-consonant manner , 1980, Perception & psychophysics.

[18]  D. Massaro,et al.  The contribution of fundamental frequency and voice onset time to the /zi/-/si/ distinction. , 1976, The Journal of the Acoustical Society of America.

[19]  R. Diehl,et al.  Phonology and Phonetic Evidence: Intermediate properties in the perception of distinctive feature values , 1995 .

[20]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[21]  Michelle Caisse Cross‐linguistic differences in fundamental frequency perturbation induced by voiceless unaspirated stops , 1981 .

[22]  R L Diehl,et al.  On the interpretability of speech/nonspeech comparisons: a reply to Fowler. , 1991, The Journal of the Acoustical Society of America.

[23]  R L Diehl,et al.  Effect of Fundamental Frequency on Medial [+Voice] / [–Voice] Judgments , 1995, Phonetica.

[24]  N. Perrin,et al.  Varieties of perceptual independence. , 1986, Psychological review.

[25]  Dennis H. Klatt,et al.  Software for a cascade/parallel formant synthesizer , 1980 .

[26]  P. Denes On the Motor Theory of Speech Perception , 1965 .

[27]  J Kingston,et al.  Integrality in the perception of tongue root position and voice quality in vowels. , 1997, The Journal of the Acoustical Society of America.

[28]  C A Fowler,et al.  Auditory perception is not special: we see the world, we feel the world, we hear the world. , 1991, The Journal of the Acoustical Society of America.

[29]  D B Pisoni,et al.  Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants. , 1983, The Journal of the Acoustical Society of America.

[30]  K. Kohler,et al.  Dimensions in the Perception of Fortis and Lenis Plosives , 1979, Phonetica.

[31]  Carol A. Fowler,et al.  Vowel duration and closure duration in voiced and unvoiced stops: there are no contrast effects here , 1992 .

[32]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[33]  Keith R. Kluender,et al.  Speech perception as a tractable problem in cognitive science. , 1994 .

[34]  S. Blumstein,et al.  Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants. , 1979, The Journal of the Acoustical Society of America.

[35]  L. Lisker,et al.  Letter: Is it VOT or a first-formant transition detector? , 1975, The Journal of the Acoustical Society of America.

[36]  Peter D. Eimas,et al.  Perspectives on the study of speech , 1981 .

[37]  J Kingston,et al.  Integrality of nasalization and F1. II. Basic sensitivity and phonetic labeling measure distinct sensory and decision-rule interactions. , 1999, The Journal of the Acoustical Society of America.

[38]  M. Haggard,et al.  Pitch as a voicing cue. , 1970, The Journal of the Acoustical Society of America.

[39]  K. Stevens,et al.  Role of formant transitions in the voiced-voiceless distinction for stops. , 1974, The Journal of the Acoustical Society of America.

[40]  John Kingston,et al.  On the internal perceptual structure of phonological features: The [voice] distinction , 1995 .

[41]  John Kingston,et al.  Integrality of nasalization and F1 in vowels in isolation and before oral and nasal consonants: A detection‐theoretic application of the Garner paradigm , 1995 .

[42]  W. R. Garner The Processing of Information and Structure , 1974 .

[43]  B. Repp Phonetic trading relations and context effects: new experimental evidence for a speech mode of perception. , 1982, Psychological bulletin.

[44]  R. Nosofsky,et al.  Integrating information from separable psychological dimensions. , 1990, Journal of experimental psychology. Human perception and performance.

[45]  Kim E. A. Silverman,et al.  F₀ Segmental Cues Depend on Intonation: The Case of the Rise after Voiced Stops , 1986 .

[46]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[47]  L. Lisker On buzzing the English /b/ , 1978 .

[48]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[49]  John Kingston,et al.  Papers in Laboratory Phonology: Index of names , 1990 .

[50]  C A Fowler,et al.  Sound-producing sources as objects of perception: rate normalization and nonspeech perception. , 1990, The Journal of the Acoustical Society of America.

[51]  A. Lotto,et al.  Influence of fundamental frequency on stop-consonant voicing perception: a case of learned covariation or auditory enhancement? , 1999, The Journal of the Acoustical Society of America.

[52]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[53]  W. V. Summers,et al.  F1 structure provides information for final-consonant voicing. , 1988, The Journal of the Acoustical Society of America.

[54]  K. Kluender,et al.  Effects of first formant onset properties on voicing judgments result from processes not specific to humans. , 1991, The Journal of the Acoustical Society of America.

[55]  Randy L. Diehl,et al.  Effects of fundamental frequency on medial and final [voice] judgments , 1996 .

[56]  L. Raphael Preceding vowel duration as a cue to the perception of the voicing characteristic of word-final consonants in American English. , 1972, The Journal of the Acoustical Society of America.

[57]  W. Todd Maddox,et al.  Perceptual and decisional separability. , 1992 .

[58]  L. Lisker Closure Duration and the Intervocalic Voiced-Voiceless Distinction in English , 1957 .

[59]  C A Fowler,et al.  Listeners do hear sounds, not tongues. , 1996, The Journal of the Acoustical Society of America.

[60]  S. Blumstein,et al.  Invariant cues for place of articulation in stop consonants. , 1978, The Journal of the Acoustical Society of America.

[61]  John Kingston,et al.  Resonance versus source characteristics in perceiving spectral continuity between vowels and consonants , 1990 .

[62]  C. Fowler An event approach to the study of speech perception from a direct realist perspective , 1986 .

[63]  L. Lisker “Voicing” in English: A Catalogue of Acoustic Features Signaling /b/ Versus /p/ in Trochees , 1986, Language and speech.

[64]  Q. Summerfield,et al.  On the dissociation of spectral and temporal cues to the voicing distinction in initial stop consonants. , 1977, The Journal of the Acoustical Society of America.

[65]  R. Diehl,et al.  Trading relations in speech and nonspeech , 1986, Perception & psychophysics.

[66]  R. N. Ohde,et al.  Spectral and duration properties of front vowels as cues to final stop-consonant voicing. , 1990, The Journal of the Acoustical Society of America.