Exaggeration of featural contrasts in clarifications of misheard speech in English

Abstract This study investigates the extent to which speakers manipulate featural distinctions when trying to clarify misunderstood speech, focusing on voicing contrasts in stops and height and backness (represented by F 1 and F 2) and durational contrasts in vowels. Participants interacted with a simulated speech recognizer, repeating words when they were “guessed” incorrectly. Both phonemically voiced and voiceless stops showed more extreme VOT values when elicited by an incorrect guess in which the consonant was a minimal pair in voicing with the target consonant (e.g. subject reads “bit”, computer guesses “pit”), but not when elicited by an open-ended request for repetition (e.g. subject reads “bit”, computer guesses “What did you say?”). A follow-up study showed that the change in VOT between the two repetitions was only present when the incorrect guess contrasted in voicing, but not when it contrasted in place or manner. In contrast, for vowels, the amount and direction of formant change in the F 1– F 2 space was not significantly different from zero for either type of incorrect guess. However, when there was a durational component to the vowel contrast (/i/ vs. / ɪ / ), speakers exaggerated the durational differences between the segments, as opposed to when there was not a durational contrast (e.g. /i/ vs. /u/). The results show that speakers perform local, systematic, and phonologically informed manipulations of temporal contrasts online when clarifying phonetic segments.

[1]  J. Perkell,et al.  Economy of effort in different speaking conditions. I. A preliminary study of intersubject differences and modeling issues. , 2002, The Journal of the Acoustical Society of America.

[2]  Ann R. Bradlow,et al.  Stability of temporal contrasts across speaking styles in English and Croatian , 2008, J. Phonetics.

[3]  Jean C. Krause,et al.  Acoustic properties of naturally produced clear speech at normal speaking rates. , 1996, The Journal of the Acoustical Society of America.

[4]  W. A. Ainsworth,et al.  Duration as a Cue in the Recognition of Synthetic Vowels , 1972 .

[5]  Nina Kraus,et al.  Speaking clearly for children with learning disabilities: sentence perception in noise. , 2003, Journal of speech, language, and hearing research : JSLHR.

[6]  Richard A. Wright,et al.  Intraspeaker variability in vowel production: An investigation of motherese, hyperspeech, and Lombard speech in Jamaican speakers , 2007, J. Phonetics.

[7]  M. Picheny,et al.  Speaking clearly for the hard of hearing. II: Acoustic characteristics of clear and conversational speech. , 1986, Journal of speech and hearing research.

[8]  Matthew Goldrick,et al.  Mechanisms of interaction in speech production , 2009, Language and cognitive processes.

[9]  S Oviatt,et al.  Modeling global and focal hyperarticulation during human-computer error resolution. , 1998, The Journal of the Acoustical Society of America.

[10]  Ann R Bradlow,et al.  Temporal organization of English clear and conversational speech. , 2008, The Journal of the Acoustical Society of America.

[11]  D J Schum,et al.  Intelligibility of clear and conversational speech of young and elderly talkers. , 1996, Journal of the American Academy of Audiology.

[12]  Ann R. Bradlow,et al.  Speaking and Hearing Clearly: Talker and Listener Factors in Speaking Style Changes , 2009, Lang. Linguistics Compass.

[13]  D. Dubois,et al.  Influence of sound immersion and communicative interaction on the Lombard effect. , 2010, Journal of speech, language, and hearing research : JSLHR.

[14]  Ann R Bradlow,et al.  Production and perception of clear speech in Croatian and English. , 2004, The Journal of the Acoustical Society of America.

[15]  Carlos Gussenhoven,et al.  Confluent talker- and listener-oriented forces in clear speech production , 2002 .

[16]  A. Jongman,et al.  Acoustic characteristics of clearly spoken English fricatives. , 2009, The Journal of the Acoustical Society of America.

[17]  Martin Cooke,et al.  Spectral and temporal changes to speech produced in the presence of energetic and informational maskers. , 2010, The Journal of the Acoustical Society of America.

[18]  David B. Pisoni,et al.  Intelligibility of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics , 1996, Speech Commun..

[19]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[20]  Khalil Iskarous,et al.  Vowel Production and Perception: Hyperarticulation without a Hyperspace Effect , 2004, Language and speech.

[21]  Valérie Hazan,et al.  An acoustic-phonetic comparison of the clear speaking styles of Finnish-English late bilinguals , 2012, J. Phonetics.

[22]  Taehong Cho,et al.  Effects of initial position versus prominence in English , 2007, J. Phonetics.

[23]  B. Lindblom,et al.  Interaction between duration, context, and speaking style in English stressed vowels , 1994 .

[24]  E BrennanSusan,et al.  Adapting speaking after evidence of misrecognition , 2008 .

[25]  Donna Erickson,et al.  Articulation of Extreme Formant Patterns for Emphasized Vowels , 2002, Phonetica.

[26]  Taehong Cho,et al.  Articulatory and acoustic studies on domain-initial strengthening in Korean , 2001, J. Phonetics.

[27]  J Vincent,et al.  What is the smallest prosodic domain , 1994 .

[28]  Kenneth de Jong,et al.  Comparing stress, lexical focus, and segmental focus: patterns of variation in Arabic vowel duration , 2002, J. Phonetics.

[29]  Stefanie Shattuck-Hufnagel,et al.  A prosody tutorial for investigators of auditory sentence processing , 1996, Journal of psycholinguistic research.

[30]  P. Boersma Praat : doing phonetics by computer (version 5.1.05) , 2009 .

[31]  S Oviatt,et al.  Linguistic Adaptations During Spoken and Multimodal Error Resolution , 1998, Language and speech.

[32]  S. G. Guion,et al.  Clear speech production of Korean stops: changing phonetic targets and enhancement strategies. , 2008, The Journal of the Acoustical Society of America.

[33]  Amanda Stent,et al.  Adapting speaking after evidence of misrecognition: Local and global hyperarticulation , 2008, Speech Commun..

[34]  Keren Rice,et al.  The Blackwell Companion to Phonology , 2011 .

[35]  Taehong Cho,et al.  Prosodic influences on consonant production in Dutch: Effects of prosodic boundaries, phrasal accent and lexical stress , 2005, J. Phonetics.

[36]  Maria Uther,et al.  Do you speak E-NG-L-I-SH? A comparison of foreigner- and infant-directed speech , 2007, Speech Commun..

[37]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[38]  Diane Kewley-Port,et al.  Talker differences in clear and conversational speech: acoustic characteristics of vowels. , 2007, Journal of speech, language, and hearing research : JSLHR.

[39]  Ann R. Bradlow Confluent talker- and listener-oriented forces in clear speech production , 2002 .

[40]  Taehong Cho Prosodic strengthening and featural enhancement: evidence from acoustic and articulatory realizations of /a,i/ in English. , 2005, The Journal of the Acoustical Society of America.

[41]  Kuniko Y. Nielsen Specificity and abstractness of VOT imitation , 2011, J. Phonetics.

[42]  S. Blumstein,et al.  Effects of speaking rate on voice-onset time in Thai, French, and English , 1997 .

[43]  Mirjam Ernestus,et al.  Prosodic conditioning of phonetic detail in German plosives , 2011, J. Phonetics.

[44]  Koopmans-Van Beinum,et al.  Vowel contrast reduction : an acoustic and perceptual study of Dutch vowels in various speech conditions , 1980 .

[45]  J. Hillenbrand,et al.  Some effects of duration on vowel recognition. , 2000, The Journal of the Acoustical Society of America.

[46]  Paul Boersma,et al.  UvA-DARE (Digital Academic Repository) A cross-dialect acoustic description of vowels: Brazilian and European Portuguese , 2009 .

[47]  Kenneth de Jong,et al.  Stress, lexical focus, and segmental focus in English: patterns of variation in vowel duration , 2004, J. Phonetics.

[48]  Kyoung-Ho Kang,et al.  Clear Speech Production and Perception of Korean Stops and the Sound Change in Korean Stops. , 2009 .

[49]  Sheng Liu,et al.  Temporal properties in clear speech perception. , 2006, The Journal of the Acoustical Society of America.

[50]  V. Hazan,et al.  Acoustic-phonetic characteristics of speech produced with communicative intent to counter adverse listening conditions. , 2011, The Journal of the Acoustical Society of America.

[51]  Sahyang Kim,et al.  Communicatively driven versus prosodically driven hyper-articulation in Korean , 2011, J. Phonetics.