On the perception of speech from time-varying acoustic information: Contributions of amplitude variation

The cyclic variation in the energy envelope of the speech signal results from the production of speech in syllables. This acoustic property is often identified as a source of information in the perception of syllable attributes, though spectral variation can also provide this information reliably. In the present study of the relative contributions of the energy and spectral envelopes in speech perception, we employed sinusoidal replicas of utterances, which permitted us to examine the roles of these acoustic properties in establishing or maintaining time-varying perceptual coherence. Three experiments were carried out to assess the independent perceptual effects of variation in sinusoidal amplitude and frequency, using sentence-length signals. In Experiment 1, we found that the fine grain of amplitude variation was not necessary for the perception of segmental and suprasegmental linguistic attributes; in Experiment 2, we found that amplitude variation was nonetheless effective in influencing syllable perception, and that in some circumstances it was crucial to segmental perception; in Experiment 3, we observed that coarse-grain amplitude variation, above all, proved to be extremely important in phonetic perception. We conclude that in perceiving sinusoidal replicas, the perceiver derives much from following the coherent pattern of frequency variation and gross signal energy, but probably derives rather little from tracking the precise details of the energy envelope. These findings encourage the view that the perceiver uses time-varying acoustic properties selectively in understanding speech.

[1]  J. P. Egan Articulation testing methods , 1948, The Laryngoscope.

[2]  D. J. Foss,et al.  On the Role of Sentence Stress in Sentence Processing , 1977, Language and speech.

[3]  R E Remez,et al.  Perceptual normalization of vowels produced by sinusoidal voices. , 1987, Journal of experimental psychology. Human perception and performance.

[4]  Paul A. Luce,et al.  Time-varying features of initial stop consonants in auditory running spectra: A first report , 1984, Perception & psychophysics.

[5]  Christopher J. Darwin,et al.  Pitch continuity and speech source attribution , 1977 .

[6]  D. Whalen Subcategorical phonetic mismatches slow phonetic judgments , 1984, Perception & psychophysics.

[7]  A shift in formant frequencies is not the same as a shift in the center of gravity of a multiformant energy concentration , 1985 .

[8]  P. Price,et al.  Sonority and Syllabicity: Acoustic Correlates of Perception , 1980, Phonetica.

[9]  Anne Cutler,et al.  The role of strong syllables in segmentation for lexical access , 1988 .

[10]  J. C. R. Licklider,et al.  Effects of Amplitude Distortion upon the Intelligibility of Speech , 1946 .

[11]  P E Rubin,et al.  On the perception of intonation from sinusoidal sentences , 1984, Perception & psychophysics.

[12]  L. Nakatani,et al.  Hearing "words" without words: prosodic cues for word perception. , 1978, The Journal of the Acoustical Society of America.

[13]  Fant Cg Descriptive analysis of the acoustic aspects of speech. , 1962 .

[14]  Robert E. Remez,et al.  Units of Organization and Analysis in the Perception of Speech , 1987 .

[15]  D. Pisoni,et al.  Speech perception without traditional speech cues. , 1981, Science.

[16]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[17]  Franklin S. Cooper,et al.  IN SEARCH OF THE ACOUSTIC CUES , 1972 .

[18]  Catherine T. Best,et al.  Discovering phonetic coherence in acoustic patterns , 1989, Perception & psychophysics.

[19]  C. Fant,et al.  Descriptive analysis of the acoustic aspects of speech. , 1962, Logos.

[20]  Jean‐Pierre A. Radley,et al.  Acoustic Properties of Stop Consonants , 1957 .

[21]  A M Liberman,et al.  Perceptual equivalence of two acoustic cues for stop-consonant manner , 1980, Perception & psychophysics.

[22]  P. Rubin,et al.  The stream of speech. , 1983 .

[23]  J. Jenkins,et al.  Identification of vowels in “vowelless” syllables , 1983, Perception & psychophysics.

[24]  Q Summerfield,et al.  Information in speech: observations on the perception of [s]-stop clusters. , 1980, Journal of experimental psychology. Human perception and performance.