Modelling Expectation in the Self-Repair Processing of Annotat-, um, Listeners

This paper describes a statistical corpus study of self-repairs in the disfluencyannotated Switchboard corpus which examines the time-linear nature of self-repair processing for annotators and listeners in dialogue. The study suggests a strictly local detection and processing mechanism for self-repairs is sufficient, an advantage currently not used effectively under the bonnet of state-of-the-art automatic disfluency processing. We then show how simple local fluency measures using modified language models can be strongly indicative of repair onset detection, and how simple information theoretic measures could characterize different classes of repairs.

[1]  W. Levelt,et al.  Monitoring and self-repair in speech , 1983, Cognition.

[2]  李幼升,et al.  Ph , 1989 .

[3]  Elisabeth Schriberg,et al.  Preliminaries to a Theory of Speech Disfluencies , 1994 .

[4]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Elizabeth Shriberg DISFLUENCIES IN SWITCHBOARD , 1996 .

[6]  Andreas Stolcke,et al.  How far do speakers back up in repairs? a quantitatve model , 1998, ICSLP.

[7]  James F. Allen,et al.  Speech repains, intonational phrases, and discourse markers: modeling speakers’ utterances in spoken dialogue , 1999, CL.

[8]  S. Brennan,et al.  How Listeners Compensate for Disfluencies in Spontaneous Speech , 2001 .

[9]  Eugene Charniak,et al.  Edit Detection and Parsing for Transcribed Speech , 2001, NAACL.

[10]  H. H. Clark,et al.  Using uh and um in spontaneous speaking , 2002, Cognition.

[11]  Eugene Charniak,et al.  A TAG-based noisy-channel model of speech repairs , 2004, ACL.

[12]  Kallirroi Georgila Using Integer Linear Programming for Detecting Speech Disfluencies , 2009, HLT-NAACL.

[13]  Mark Johnson,et al.  Detecting Speech Repairs Incrementally Using a Noisy Channel Approach , 2010, COLING.

[14]  Jonathan Ginzburg,et al.  The interactive stance : meaning for conversation , 2012 .

[15]  Yang Liu,et al.  Disfluency Detection Using Multi-step Stacked Learning , 2013, NAACL.

[16]  Arash Eshghi,et al.  Incremental Grammar Induction from Child-Directed Dialogue Utterances , 2013, CMCL.

[17]  Frank Keller,et al.  Incremental, Predictive Parsing with Psycholinguistically Motivated Tree-Adjoining Grammar , 2013, CL.

[18]  Alexander Clark,et al.  Statistical Representation of Grammaticality Judgements: the Limits of N-Gram Models , 2013, CMCL.

[19]  Jonathan Ginzburg,et al.  Disfluencies as intra-utterance dialogue moves , 2014 .