Auditory accounts of temporal factors in the perception of Norwegian disyllables and speech analogs

Abstract The first part of this study investigates temporal factors in the perception of V:C vs. VC: rhymes in Norwegian. To that aim, a listening test was performed with stimuli of the form /mV:Ce / vs. /mVC:e/with varying vowel, consonant closure and schwa duration. For a group of native speakers, boundaries in the perception of /V:/-/V/were established. Both longer consonantal closure and schwa duration appeared to cause a perceptual shortening of the vowel. These results were interpreted by appealing to Kingston & Diehl's (1994) duration ratio hypothesis, which states that the durations of a vowel and a following consonant are mutually enhancing acoustic cues. In two listening tests on speech analogs, listeners judged the duration of the first tone in tone-gap-tone sequences. Mimicking the speech stimuli, these sequences featured varying gap and second tone durations. Longer durations of these two signal parts turned out to perceptually lengthen first tone duration. The divergent results from the speech as opposed to the non-speech stimuli were explained by assuming different perceptual strategies for the two types of signals. While in speech the listeners can rely on well-established temporal patterns, such a framework is absent in the case of non-speech.

[1]  M. Fourakis,et al.  Tempo, stress, and vowel reduction in American English. , 1991, The Journal of the Acoustical Society of America.

[2]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[3]  J L Miller,et al.  Internal Structure of Phonetic Categories: Effects of Speaking Rate , 1997, Phonetica.

[4]  J. L. Miller Some Effects of Speaking Rate on Phonetic Perception , 1981, Phonetica.

[5]  T. Crystal,et al.  Articulation rate and the duration of syllables and stress groups in connected speech. , 1990, The Journal of the Acoustical Society of America.

[6]  C A Fowler,et al.  Auditory perception is not special: we see the world, we feel the world, we hear the world. , 1991, The Journal of the Acoustical Society of America.

[7]  R L Diehl,et al.  On the interpretability of speech/nonspeech comparisons: a reply to Fowler. , 1991, The Journal of the Acoustical Society of America.

[8]  J. Sawusch,et al.  Perceptual normalization for speaking rate: Effects of temporal distance , 1996, Perception & psychophysics.

[9]  K. Fintoft,et al.  The Duration of some Norwegian Speech Sounds , 1961 .

[10]  Robert F. Port,et al.  The influence of tempo on stop closure duration as a cue for voicing and place , 1979 .

[11]  Yoshitaka Nakajima,et al.  A New Illusion of Time Perception , 1991 .

[12]  I. Lehiste,et al.  Labeling, discrimination and repetition of stimuli with level and changing fundamental frequency , 1980 .

[13]  R. Port Linguistic timing factors in combination. , 1981, The Journal of the Acoustical Society of America.

[14]  A. Liberman,et al.  Some effects of later-occurring information on the perception of stop consonant and semivowel , 1979, Perception & psychophysics.

[15]  R. Diehl,et al.  Vowel-length differences before voiced and voiceless consonants: an auditory explanation , 1988 .

[16]  C A Fowler,et al.  Sound-producing sources as objects of perception: rate normalization and nonspeech perception. , 1990, The Journal of the Acoustical Society of America.

[17]  B H Repp,et al.  Perceptual integration of acoustic cues for stop, fricative, and affricate manner. , 1978, Journal of experimental psychology. Human perception and performance.

[18]  Wim A. van Dommelen,et al.  Does dynamic F0 increase perceived duration? New light on an old issue , 1993 .

[19]  J L Miller,et al.  The influence of sentential speaking rate on the internal structure of phonetic categories. , 1994, The Journal of the Acoustical Society of America.

[20]  Carol A. Fowler,et al.  Vowel duration and closure duration in voiced and unvoiced stops: there are no contrast effects here , 1992 .