Studies on fast speech have shown that word-level timing of fast speech differs from that of normal rate speech in that unstressed syllables are shortened more than stressed syllables as speech rate increases. An earlier experiment showed that the intelligibility of time-compressed speech could not be improved by making its temporal organisation closer to natural fast speech. To test the hypothesis that segmental intelligibility is more important than prosodic timing in listening to timecompressed speech, the intelligibility of bisyllabic words was tested in three time-compression conditions: either stressed and unstressed syllable were compressed to the same degree, or the stressed syllable was compressed more than the unstressed syllable, or the reverse. As was found before, imitating wordlevel timing of fast speech did not improve intelligibility over linear compression. However, the results did not confirm the hypothesis either: there was no difference in intelligibility between the three compression conditions. We conclude that segmental intelligibility plays an important role, but further research is necessary to decide between the contributions of prosody and segmental intelligibility to the word-level intelligibility of time-compressed speech.
[1]
Jan P. H. van Santen,et al.
Assignment of segmental duration in text-to-speech synthesis
,
1994,
Comput. Speech Lang..
[2]
Esther Janse,et al.
Fast speech timing in Dutch: durational correlates of lexical stress and pitch accent
,
2000,
INTERSPEECH.
[3]
Malcolm Slaney,et al.
MACH 1 FOR NONUNIFORM TIME-SCALE MODIFICATION OF SPEECH : THEORY , TECHNIQUE , AND COMPARISONS
,
1998
.
[4]
Ann Cutler,et al.
Prosody in the Comprehension of Spoken Language: A Literature Review
,
1997,
Language and speech.
[5]
J. Klein,et al.
Syntactic structure and acoustic pattern in speech perception Arthur Wingfield
,
1971
.
[6]
A Wingfield,et al.
Prosodic features and the intelligibility of accelerated speech: syntactic versus periodic segmentation.
,
1984,
Journal of speech and hearing research.
[7]
S G Nooteboom,et al.
Production and perception of vowel length in spoken sentences.
,
1980,
Journal of the Acoustical Society of America.
[8]
D. Pisoni,et al.
Recognizing Spoken Words: The Neighborhood Activation Model
,
1998,
Ear and hearing.