论文信息 - Vowel quality in spontaneous speech: what makes a good vowel?

Vowel quality in spontaneous speech: what makes a good vowel?

Clear speech is characterised by longer segmental durations and less target undershoot [9] which results in more extreme spectral features. This paper deals with the clarity of vowels produced in spontaneous speech in a large corpus of task-oriented dialogues. We present an automatic technique for measuring vowel clarity on the basis of a vowel’s spectral characteristics. This technique was evaluated using a perceptual test. Subjects rated the ’goodness’ of vowels with different spectral characteristics with controlled duration and amplitude and these results were compared with an automatic rating. Results indicated that although agreement between subjects and the automatic measurement was poor it was as poor as the agreement between subjects. On the basis of these results we address the following questions: 1. Can subjects reliably judge the clarity of vowels excerpted from spontaneous speech without duration cues? 2. Can a statistical model [3] reliably predict the subjects’ response to such vowels?

Matthew P. Aylett | Alice Turk | M. Aylett | A. Turk

[1] B. Lindblom,et al. Interaction between duration, context, and speaking style in English stressed vowels , 1994 .

[2] Anne H. Anderson,et al. The Hcrc Map Task Corpus , 1991 .

[3] John A. Hartigan,et al. Clustering Algorithms , 1975 .

[4] R. H. Baayen,et al. The CELEX Lexical Database (CD-ROM) , 1996 .

[5] P. Lieberman. Some Effects of Semantic and Grammatical Context on the Production and Perception of Speech , 1963 .

[6] N I Durlach,et al. Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech. , 1985, Journal of speech and hearing research.

[7] Steve Young,et al. The HTK book , 1995 .

[8] Sharon Hunnicutt,et al. Intelligibility Versus Redundancy - Conditions of Dependency , 1985 .

[9] L D Braida,et al. Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing. , 1994, The Journal of the Acoustical Society of America.

[10] Using statistics to model the vowel spaceMatthew , 1996 .

[11] E. Zwicker,et al. Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[12] Dick R. van Bergem,et al. Acoustic vowel reduction as a function of sentence accent, word stress, and word class , 1993, Speech Commun..

[13] Matthew Aylett Human. Modelling Clarity Change In Spontaneous Speech , 1999 .