A CUE-BASED APPROACH TO PROSODIC DISFLUENCY ANNOTATION

This paper elaborates a proposal for labelling prosodic disfluencies in American English, in conjunction with the ToBI framework for prosodic labelling. Incorporating disfluency annotation ideas developed for other languages, and for stuttered speech, the proposal introduces explicit disfluencyrelated labels into the Break Index tier, providing a more fine-grained categorization of the prosodic disfluency type than established ToBI disfluency labels. In addition, it explicitly labels 'speech errors' (e.g. word and sound sequencing errors) even when not disfluent, enabling separate study of these phenomena, and of their interaction. The paper further explores data labelled using this system by two independent labellers (recordings of spontaneous speech by 5 female speakers of Mainstream American English). The relative frequency of specific disfluency types, alone or in combination with others, reflects a high degree of diversity. Differences in disfluency patterns used by individual speakers are discussed.

[1]  Noam Amir,et al.  Do social anxiety individuals hesitate more? The prosodic profile of hesitation disfluencies in Social Anxiety Disorder individuals , 2016 .

[2]  J. Pierrehumbert The phonology and phonetics of English intonation , 1987 .

[3]  Sun-Ah Jun,et al.  A comparison of disfluency patterns in normal and stuttered speech , 2005, DiSS.

[4]  Timothy Arbisi-Kelm,et al.  Intonation Structure and Disfluency Detection in Stuttering , 2007 .

[5]  Stefanie Shattuck-Hufnagel,et al.  Cue-based annotation and analysis of prosodic boundary events , 2018, Speech Prosody 2018.

[6]  Hideaki Kikuchi,et al.  X-JToBI: an extended j-toBI for spontaneous speech , 2002, INTERSPEECH.

[7]  Mari Ostendorf,et al.  Multi-domain disfluency and repair detection , 2014, INTERSPEECH.

[8]  Stefanie Shattuck-Hufnagel,et al.  DISTRIBUTION OF DISFLUENCIES AND ERRORS IN ENGLISH DISCOURSE , 2007 .

[9]  D. Donaldson,et al.  Listening to the sound of silence: disfluent silent pauses in speech have consequences for listeners , 2010, Neuropsychologia.

[10]  J. Pierrehumbert,et al.  Intonational structure in Japanese and English , 1986, Phonology.

[11]  D. Donaldson,et al.  Not all disfluencies are are equal: The effects of disfluent repetitions on language comprehension , 2009, Brain and Language.

[12]  Laura C. Dilley,et al.  An enhanced autosegmental-metrical theory (AM+) facilitates phonetically transparent prosodic annotation , 2018, 6th International Symposium on Tonal Aspects of Languages (TAL 2018).

[13]  Stefanie Shattuck-Hufnagel,et al.  The Use of Prosody in Syntactic Disambiguation , 1991, HLT.

[14]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[15]  M. Inés Torres,et al.  Annotation and analysis of disfluencies in a spontaneous speech corpus in Spanish , 2001, DiSS.

[16]  Stefanie Shattuck-Hufnagel,et al.  The Prosodic Characteristics of Non-referential Co-speech Gestures in a Sample of Academic-Lecture-Style Speech , 2018, Front. Psychol..

[17]  Stefanie Shattuck-Hufnagel,et al.  The original ToBI system and the evolution of the ToBI framework , 2003 .

[18]  Stefanie Shattuck-Hufnagel,et al.  The alternatives (alt) tier for toBI: advantages of capturing prosodic ambiguity , 2008, Speech Prosody 2008.

[19]  Kenneth N Stevens,et al.  Toward a model for lexical access based on acoustic landmarks and distinctive features. , 2002, The Journal of the Acoustical Society of America.

[20]  Duane G. Watson,et al.  The disfluent discourse: Effects of filled pauses on recall. , 2011, Journal of memory and language.

[21]  H. H. Clark,et al.  Using uh and um in spontaneous speaking , 2002, Cognition.

[22]  Elisabeth Schriberg,et al.  Preliminaries to a Theory of Speech Disfluencies , 1994 .

[23]  S. Brennan,et al.  Disfluency Rates in Conversation: Effects of Age, Relationship, Topic, Role, and Gender , 2001, Language and speech.

[24]  Kirsty McDougall,et al.  Profiling fluency: An analysis of individual variation in disfluencies in adult males , 2017, Speech Commun..

[25]  J. E. Tree Listeners' uses of um and uh in speech comprehension. , 2001 .

[26]  Byron T. Ahn,et al.  ANNOTATING PROSODYWITH POLAR: CONVENTIONS FOR A DECOMPOSITIONAL ANNOTATION SYSTEM , 2019 .

[27]  Alejna Mari Brugos The interaction of pitch and timing in the perception of prosodic grouping , 2015 .