Perceiving speech rate differences between natural and time-scale modified utterances

The effect of time-compression and -expansion on the perception of speech rate differences is investigated. Natural utterances were compared with modified versions time-scaled to the same duration. A set of ten German sentences was produced by one native speaker at slow and fast speed. In a forced choice discrimination task 15 participants were asked to select the faster one of two versions of the same sentence. In the case of low speech rate, versions that had been slowed down were perceived as slower than the corresponding natural utterances, whereas at high speech rates, stimuli with increased speed were judged as relatively faster. The effect turned out to be stronger for the slow stimuli. These findings suggest that the underlying articulatory effort plays an important role in the perception of speech rate.

[1]  W. L. Nelson Physical principles for economies of skilled movements , 1983, Biological Cybernetics.

[2]  Sumio Ohno,et al.  A method for quantitative analysis of the local speech rate , 1995, EUROSPEECH.

[3]  K. Moll,et al.  A cineradiographic study of VC and CV articulatory velocities , 1976 .

[4]  Malcolm Slaney,et al.  MACH1: nonuniform time-scale modification of speech , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  Hansjörg Mixdorff,et al.  Analysing fundamental frequency contours and local speech rate in map task dialogs , 2005, Speech Commun..

[6]  Hartmut R. Pfitzinger,et al.  Acoustic correlates of the IPA vowel diagram , 2003 .

[7]  H. Pfitzinger Segmental effects on the prosody of voice quality , 2008 .

[8]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[9]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[10]  K. Kohler,et al.  Parameters of Speech Rate Perception in German Words and Sentences: Duration, F o Movement, and F o Level , 1986, Language and speech.

[11]  H. Pfitzinger Towards functional modelling of relationships between the acoustics and perception of vowels , 2005 .

[12]  Hartmut R. Pfitzinger Dynamic vowel quality: a new determination formalism based on perceptual experiments , 1995, EUROSPEECH.

[13]  Hans G. Tillmann,et al.  Local Speech Rate: Relationships between Articulation and Speech Acoustics , 2003 .

[14]  N. Campbell,et al.  Voice Quality : the 4 th Prosodic Dimension , 2004 .

[15]  Esther Janse,et al.  Production and perception of fast speech , 2003 .

[16]  Hansjörg Mixdorff,et al.  Unresolved Anger : Prosodic analysis and classification of speech from a therapeutic setting , 2010 .

[17]  Esther Janse,et al.  Intelligibility of time-compressed speech: three ways of time-compression , 2000, INTERSPEECH.

[18]  Hartmut R. Pfitzinger,et al.  Local speech rate as a combination of syllable and phone rate , 1998, ICSLP.

[19]  H. Pfitzinger,et al.  Comparing perceptual local speech rate of German and Japanese speech , 2006, Speech Prosody 2006.