Acoustic and articulatory features of diphthong production: a speech clarity study.

PURPOSE The purpose of this study was to evaluate how speaking clearly influences selected acoustic and orofacial kinematic measures associated with diphthong production. METHOD Forty-nine speakers, drawn from the University of Wisconsin X-Ray Microbeam Speech Production Database (J. R. Westbury, 1994), served as participants. Samples of clear and conversational productions of the word combine were extracted for analysis. Analyses included listener ratings of speech clarity and a number of acoustic and articulatory kinematic measures associated with production of the diphthong /aI/. RESULTS Key results indicate that speaking clearly is associated with (a) increased duration of diphthong-related acoustic and kinematic events, (b) larger F1 and F2 excursions and associated tongue and mandible movements, and (c) minimal evidence of change in formant transition rate. CONCLUSIONS Overall, the results suggest that clarity-related changes in diphthong production are accomplished through larger, longer, but not necessarily faster diphthong-related transitions. The clarity-related adjustments in diphthong production observed in this study conform to a simple model that assumes speech clarity arises out of reduced overlap of articulatory gestures.

[1]  J. Perkell,et al.  Economy of effort in different speaking conditions. I. A preliminary study of intersubject differences and modeling issues. , 2002, The Journal of the Acoustical Society of America.

[2]  T. Gay Effect of speaking rate on diphthong formant movements. , 1968, The Journal of the Acoustical Society of America.

[3]  B. Lindblom,et al.  Interaction between duration, context, and speaking style in English stressed vowels , 1994 .

[4]  Anne Cutler,et al.  Word boundary cues in clear speech: A supplementary report , 1991, Speech Commun..

[5]  G. E. Peterson,et al.  Transitions, Glides, and Diphthongs , 1961 .

[6]  J. Westbury,et al.  Defining and measuring speech movement events. , 2002, Journal of speech, language, and hearing research : JSLHR.

[7]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[8]  Sheng Liu,et al.  Temporal properties in clear speech perception. , 2006, The Journal of the Acoustical Society of America.

[9]  Gary Weismer,et al.  Effects of speaking rate on second formant trajectories of selected vocalic nuclei. , 2003, The Journal of the Acoustical Society of America.

[10]  G Weismer,et al.  Speaking-rate-induced variability in F2 trajectories. , 1998, Journal of speech, language, and hearing research : JSLHR.

[11]  James M Hillenbrand,et al.  Open source software for experiment design and control. , 2005, Journal of speech, language, and hearing research : JSLHR.

[12]  A. Simpson,et al.  Dynamic consequences of differences in male and female vocal tract dimensions. , 2001, The Journal of the Acoustical Society of America.

[13]  Diane Kewley-Port,et al.  Talker differences in clear and conversational speech: acoustic characteristics of vowels. , 2007, Journal of speech, language, and hearing research : JSLHR.

[14]  S. H. Ferguson,et al.  Talker differences in clear and conversational speech: vowel intelligibility for normal-hearing listeners. , 2004, The Journal of the Acoustical Society of America.

[15]  Carlos Gussenhoven,et al.  Confluent talker- and listener-oriented forces in clear speech production , 2002 .

[16]  Raymond D. Kent,et al.  The acoustic signature for intelligibility test words. , 1988, The Journal of the Acoustical Society of America.

[17]  J S Perkell,et al.  Variation in anticipatory coarticulation with changes in clarity and rate. , 2001, Journal of speech, language, and hearing research : JSLHR.

[18]  Zinny S. Bond,et al.  A note on the acoustic-phonetic characteristics of inadvertently clear speech , 1994, Speech Commun..

[19]  Raymond D. Kent,et al.  X‐ray microbeam speech production database , 1990 .

[20]  Anne Cutler,et al.  Durational cues to word boundaries in clear speech , 1990, Speech Commun..

[21]  Carlos Gussenhoven,et al.  Laboratory Phonology 7 , 2002 .

[22]  D. Kewley-Port,et al.  Vowel intelligibility in clear and conversational speech , 1999 .

[23]  D. Kewley-Port,et al.  Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners. , 2002, The Journal of the Acoustical Society of America.

[24]  Johan Wouters,et al.  Effects of prosodic factors on spectral dynamics. I. Analysis. , 2002, The Journal of the Acoustical Society of America.

[25]  M. Picheny,et al.  Speaking clearly for the hard of hearing. II: Acoustic characteristics of clear and conversational speech. , 1986, Journal of speech and hearing research.

[26]  Jean C. Krause,et al.  The effects of speaking rate on the intelligibility of speech for various speaking modes , 1995 .