Temporal organization of English clear and conversational speech.

This study investigated the effect of hyperarticulated, intelligibility-enhancing clear speech on temporal characteristics as reflected in number, durations, and variability of consonant and vowel intervals in sentence- and paragraph-length utterances. The results of sentence-in-noise listening tests showed a consistent clear speech intelligibility gain across the utterances of varying complexity indicating that the talkers successfully maintained clear speech articulatory modifications throughout longer stretches of speech. The acoustic analysis revealed that some temporal restructuring accompanied changes in speaking style. This temporal restructuring was observed in the insertion of consonant and vowel segments that were dropped or coarticulated in conversational speech and in an increase in the number of prosodic phrases for clear speech. Importantly, coefficients of variation (variation of consonantal and vocalic intervals normalized for changes in speaking rate) for both consonantal and vowel intervals remained stable in the two speaking styles. Overall, these results suggest that increased intelligibility of clear speech may be attributed to prosodic structure enhancement (increased phrasing and enhanced segmentability) and stable global temporal properties.

[1]  K S Helfer,et al.  Auditory and auditory-visual recognition of clear and conversational speech by older adults. , 1998, Journal of the American Academy of Audiology.

[2]  B. Lindblom,et al.  Interaction between duration, context, and speaking style in English stressed vowels , 1994 .

[3]  John G. Harris,et al.  Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments , 2006, Speech Commun..

[4]  Taehong Cho Prosodic strengthening and featural enhancement: evidence from acoustic and articulatory realizations of /a,i/ in English. , 2005, The Journal of the Acoustical Society of America.

[5]  J C Junqua,et al.  The Lombard reflex and its role on human listeners and automatic speech recognizers. , 1993, The Journal of the Acoustical Society of America.

[6]  J. L. Miller,et al.  Phonetic prototypes: influence of place of articulation and speaking rate on the internal structure of voicing categories. , 1992, The Journal of the Acoustical Society of America.

[7]  Ann R Bradlow,et al.  Production and perception of clear speech in Croatian and English. , 2004, The Journal of the Acoustical Society of America.

[8]  Taehong Cho,et al.  Domain-initial strengthening in four languages , 2003 .

[9]  P. Keating,et al.  Articulatory strengthening at edges of prosodic domains. , 1997, The Journal of the Acoustical Society of America.

[10]  Sungbok Lee,et al.  How far, how long: on the temporal scope of prosodic boundary effects. , 2006, The Journal of the Acoustical Society of America.

[11]  Ann R. Bradlow,et al.  Clear speech intelligibility: Listener and talker effects , 2007 .

[12]  J. C. Krause,et al.  Acoustic properties of naturally produced clear speech at normal speaking rates. , 1996, The Journal of the Acoustical Society of America.

[13]  K. D. Jong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation , 1995 .

[14]  Dani Byrd,et al.  The elastic phrase: modeling the dynamics of boundary-adjacent lengthening , 2003, J. Phonetics.

[15]  Yukari Hirata,et al.  Effects of speaking rate on the single/geminate stop distinction in Japanese. , 2005, The Journal of the Acoustical Society of America.

[16]  Richard Wright,et al.  The Hyperspace Effect: Phonetic Targets Are Hyperarticulated. , 1993 .

[17]  A. Christophea,et al.  Phonological phrase boundaries constrain lexical access I . Adult data q , 2003 .

[18]  M. Picheny,et al.  Speaking clearly for the hard of hearing. II: Acoustic characteristics of clear and conversational speech. , 1986, Journal of speech and hearing research.

[19]  A. Cutler,et al.  Mora or Phoneme? Further Evidence for Language-Specific Listening , 1994 .

[20]  Colin W. Wightman,et al.  Segmental durations in the vicinity of prosodic phrase boundaries. , 1992, The Journal of the Acoustical Society of America.

[21]  Ann R Bradlow,et al.  Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners. , 2007, The Journal of the Acoustical Society of America.

[22]  Tessa Bent,et al.  The clear speech effect for non-native listeners. , 2002, The Journal of the Acoustical Society of America.

[23]  Yukari Hirata,et al.  Effects of speaking rate on the vowel length distinction in Japanese , 2004, J. Phonetics.

[24]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[25]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[26]  Louis D Braida,et al.  Investigating alternative forms of clear speech: the effects of speaking rate and speaking mode on intelligibility. , 2002, The Journal of the Acoustical Society of America.

[27]  G. Studebaker A "rationalized" arcsine transform. , 1985, Journal of speech and hearing research.

[28]  D. Kewley-Port,et al.  Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners. , 2002, The Journal of the Acoustical Society of America.

[29]  Sheila E. Blumstein,et al.  Effects of speaking rate on voice-onset time and vowel production: Some implications for perception studies , 1998 .

[30]  J. L. Miller,et al.  Effect of speaking rate on the perceptual structure of a phonetic category , 1989, Perception & psychophysics.

[31]  G. E. Peterson,et al.  Duration of Syllable Nuclei in English , 1960 .

[32]  J. L. Miller,et al.  Effects of speaking rate and lexical status on phonetic perception. , 1988, Journal of experimental psychology. Human perception and performance.

[33]  F. Ramus,et al.  Correlates of linguistic rhythm in the speech signal , 1999, Cognition.

[34]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[35]  Jean-Pierre Gagné,et al.  Auditory, visual and audiovisual clear speech , 2002, Speech Commun..

[36]  K. D. de Jong The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. , 1995, The Journal of the Acoustical Society of America.

[37]  E. Grabe,et al.  Durational variability in speech and the rhythm class hypothesis , 2005 .

[38]  Nina Kraus,et al.  Speaking clearly for children with learning disabilities: sentence perception in noise. , 2003, Journal of speech, language, and hearing research : JSLHR.

[39]  S. H. Ferguson,et al.  Talker differences in clear and conversational speech: vowel intelligibility for normal-hearing listeners. , 2004, The Journal of the Acoustical Society of America.

[40]  V. Boucher Timing relations in speech and the identification of voice-onset times: A stable perceptual boundary for voicing categories across speaking rates , 2002, Perception & psychophysics.

[41]  P. Kuhl,et al.  Cross-language analysis of phonetic units in language addressed to infants. , 1997, Science.

[42]  J. Mehler,et al.  Phonological phrase boundaries constrain lexical access II. Infant data , 2004 .

[43]  Carlos Gussenhoven,et al.  Durational variability in speech and the Rhythm Class Hypothesis , 2002 .

[44]  Sheng Liu,et al.  Clear speech perception in acoustic and electric hearing. , 2004, The Journal of the Acoustical Society of America.

[45]  May Uchanski Rosalie Spectral and temporal contributions to speech clarity for hearing impaired listeners , 1988 .

[46]  Mark Hasegawa-Johnson,et al.  Prosodic effects on acoustic cues to stop voicing and place of articulation: Evidence from Radio News speech , 2007, J. Phonetics.

[47]  C. Fougeron,et al.  Prosodically conditioned articulatory variations: A review , 1999 .

[48]  Nina Kraus,et al.  Speaking clearly for learning‐disabled children: Sentence perception in noise , 2000 .

[49]  Petra Wagner,et al.  Bonntempo-corpus and bonntempo-tools: a database for the study of speech rhythm and rate , 2004, INTERSPEECH.

[50]  Ann R. Bradlow,et al.  Stability of temporal contrasts across speaking styles in English and Croatian , 2008, J. Phonetics.

[51]  Taehong Cho,et al.  Prosodically driven phonetic detail in speech processing: The case of domain-initial strengthening in English , 2007, J. Phonetics.

[52]  D J Schum,et al.  Intelligibility of clear and conversational speech of young and elderly talkers. , 1996, Journal of the American Academy of Audiology.

[53]  Kyoko Nagao,et al.  Perceptual rate normalization in naturally produced rate-varied speech. , 2007, The Journal of the Acoustical Society of America.

[54]  L D Braida,et al.  Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing. , 1994, The Journal of the Acoustical Society of America.

[55]  Diane Kewley-Port,et al.  Talker differences in clear and conversational speech: acoustic characteristics of vowels. , 2007, Journal of speech, language, and hearing research : JSLHR.

[56]  A. Reeves,et al.  Speaking Rate and Segments: A Look at the Relation between Speech Production and Speech Perception for the Voicing Contrast , 1986 .

[57]  Shrikanth S. Narayanan,et al.  Phrasal signatures in articulation , 2000 .

[58]  Sheila E. Blumstein,et al.  Effects of Speaking Rate on the Singleton/Geminate Consonant Contrast in Italian , 1999, Phonetica.

[59]  Joanne L. Miller,et al.  Effects of speaking rate on the perceived internal structure of phonetic categories , 1986 .

[60]  Low Ee Ling,et al.  Q uantitative Characterizations of Speech Rhythm: Syllable-Timing in Singapore English , 2000, Language and speech.

[61]  Anne Fernald,et al.  Speech to Infants as Hyperspeech: Knowledge-Driven Processes in Early Word Recognition , 2000, Phonetica.

[62]  J. Perkell,et al.  Economy of effort in different speaking conditions. I. A preliminary study of intersubject differences and modeling issues. , 2002, The Journal of the Acoustical Society of America.

[63]  L. Braida,et al.  Speaking clearly for the hard of hearing IV: Further studies of the role of speaking rate. , 1996, Journal of speech and hearing research.

[64]  N I Durlach,et al.  Speaking clearly for the hard of hearing. III: An attempt to determine the contribution of speaking rate to differences in intelligibility between clear and conversational speech. , 1989, Journal of speech and hearing research.