Categorization and discrimination of nonspeech sounds: differences between steady-state and rapidly-changing acoustic cues.

Different patterns of performance across vowels and consonants in tests of categorization and discrimination indicate that vowels tend to be perceived more continuously, or less categorically, than consonants. The present experiments examined whether analogous differences in perception would arise in nonspeech sounds that share critical transient acoustic cues of consonants and steady-state spectral cues of simplified synthetic vowels. Listeners were trained to categorize novel nonspeech sounds varying along a continuum defined by a steady-state cue, a rapidly-changing cue, or both cues. Listeners' categorization of stimuli varying on the rapidly changing cue showed a sharp category boundary and posttraining discrimination was well predicted from the assumption of categorical perception. Listeners more accurately discriminated but less accurately categorized steady-state nonspeech stimuli. When listeners categorized stimuli defined by both rapidly-changing and steady-state cues, discrimination performance was accurate and the categorization function exhibited a sharp boundary. These data are similar to those found in experiments with dynamic vowels, which are defined by both steady-state and rapidly-changing acoustic cues. A general account for the speech and nonspeech patterns is proposed based on the supposition that the perceptual trace of rapidly-changing sounds decays faster than the trace of steady-state sounds.

[1]  D J Van Tasell,et al.  Speech waveform envelope cues for consonant recognition. , 1987, The Journal of the Acoustical Society of America.

[2]  A. Ades,et al.  Vowels, Consonants, Speech, and Nonspeech. , 1977 .

[3]  W T Maddox,et al.  Response time distributions in multidimensional perceptual categorization , 1998, Perception & psychophysics.

[4]  B. Auditory and phonetic memory codes in the discrimination of consonants and vowels * , 2022 .

[5]  Thresholds for formant-frequency discrimination of vowels in consonantal context. , 1991, The Journal of the Acoustical Society of America.

[6]  S Fusi,et al.  Forming classes by stimulus frequency: Behavior and theory , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[8]  D. Pisoni,et al.  Identification and discrimination of rise time: is it categorical or noncategorical? , 1984, The Journal of the Acoustical Society of America.

[9]  N. Cowan,et al.  The use of auditory and phonetic memory in vowel discrimination. , 1986, The Journal of the Acoustical Society of America.

[10]  A. Jongman,et al.  Acoustic characteristics of English fricatives. , 2000, The Journal of the Acoustical Society of America.

[11]  B H Repp,et al.  Context independence and phonetic mediation in categorical perception. , 1982, Journal of experimental psychology. Human perception and performance.

[12]  Neil A. Macmillan,et al.  Beyond the categorical/continuous distinction: A psychophysical approach to processing modes. , 1987 .

[13]  S E Blumstein,et al.  Further evidence of acoustic invariance in speech production: the stop-glide contrast. , 1983, The Journal of the Acoustical Society of America.

[14]  James E. Cutting,et al.  Two left-hemisphere mechanisms in speech perception , 1974 .

[15]  B. C. Griffith,et al.  The discrimination of speech sounds within and across phoneme boundaries. , 1957, Journal of experimental psychology.

[16]  J E Cutting,et al.  Plucks and bows are categorically perceived, sometimes , 1982, Perception & psychophysics.

[17]  C. Watson,et al.  Formant-frequency discrimination for isolated English vowels. , 1994, The Journal of the Acoustical Society of America.

[18]  T. M. Nearey,et al.  Identification of resynthesized /hVd/ utterances: effects of formant contour. , 1999, The Journal of the Acoustical Society of America.

[19]  Y. Samson,et al.  Lateralization of Speech and Auditory Temporal Processing , 1998, Journal of Cognitive Neuroscience.

[20]  D. Pisoni,et al.  Reaction times to comparisons within and across phonetic categories , 1974, Perception & psychophysics.

[21]  P. Tallal,et al.  Rate of acoustic change may underlie hemispheric specialization for speech perception , 1980, Science.

[22]  H. Heffner,et al.  Temporal lobe lesions and perception of species-specific vocalizations by macaques. , 1984, Science.

[23]  T. M. Nearey,et al.  Effects of consonant environment on vowel formant patterns. , 1997, The Journal of the Acoustical Society of America.

[24]  P. D. Eimas,et al.  The Relation between Identification and Discrimination along Speech and Non-Speech Continua , 1963 .

[25]  R. Zatorre,et al.  Structure and function of auditory cortex: music and speech , 2002, Trends in Cognitive Sciences.

[26]  M. Hauser,et al.  Left hemisphere dominance for processing vocalizations in adult, but not infant, rhesus monkeys: field experiments. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[27]  D Kewley-Port,et al.  Vowel formant discrimination: towards more ordinary listening conditions. , 1999, The Journal of the Acoustical Society of America.

[28]  Robert J Zatorre,et al.  Anatomical Correlates of Learning Novel Speech Sounds , 2002, Neuron.

[29]  David Poeppel,et al.  The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' , 2003, Speech Commun..

[30]  R L Diehl,et al.  Formant Transition Duration and Amplitude Rise Time as Cues to the Stop/Glide Distinction , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[31]  W. Strange,et al.  Identification of coarticulated vowels. , 1980, The Journal of the Acoustical Society of America.

[32]  Brian L. Scott,et al.  Burst Cues, Transition Cues, and Hemispheric Specialization with Real Speech Sounds , 1975 .

[33]  Jessica Maye,et al.  Infant sensitivity to distributional information can affect phonetic discrimination , 2002, Cognition.

[34]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[35]  K. Armitage,et al.  Genetic Variation in Social Mammals: The Marmot Model , 1980, Science.

[36]  M. Hauser,et al.  Orienting asymmetries in rhesus monkeys: the effect of time-domain changes on acoustic perception , 1998, Animal Behaviour.

[37]  R L Diehl,et al.  Perception of vowel height: the role of F1-F0 distance. , 1994, The Journal of the Acoustical Society of America.

[38]  Neil A. Macmillan,et al.  Resolution for speech sounds: basic sensitivity and context memory on vowel and consonant continua , 1988 .

[39]  B. Repp Categorical Perception: Issues, Methods, Findings , 1984 .

[40]  Winifred Strange,et al.  Consonant environment specifies vowel identity. , 1974, The Journal of the Acoustical Society of America.

[41]  A J van Hessen,et al.  Modeling phoneme perception. I: Categorical perception. , 1992, The Journal of the Acoustical Society of America.

[42]  R. Miller Auditory Tests with Synthetic Vowels , 1951 .

[43]  C. C. Wood Discriminability, response bias, and phoneme categories in discrimination of voice onset time. , 1976, The Journal of the Acoustical Society of America.

[44]  D. Shankweiler,et al.  Consonant environment specifies vowel identity. , 1976, The Journal of the Acoustical Society of America.