Joint Incremental Disfluency Detection and Dependency Parsing

We present an incremental dependency parsing model that jointly performs disfluency detection. The model handles speech repairs using a novel non-monotonic transition system, and includes several novel classes of features. For comparison, we evaluated two pipeline systems, using state-of-the-art disfluency detectors. The joint model performed better on both tasks, with a parse accuracy of 90.5% and 84.0% accuracy at disfluency detection. The model runs in expected linear time, and processes over 550 tokens a second.

[1]  Joakim Nivre,et al.  Squibs: Going to the Roots of Dependency Parsing , 2013, CL.

[2]  Yang Guo,et al.  Structured Perceptron with Inexact Search , 2012, NAACL.

[3]  Yue Zhang,et al.  Fast and Accurate Shift-Reduce Constituent Parsing , 2013, ACL.

[4]  Mark Johnson,et al.  Detecting Speech Repairs Incrementally Using a Noisy Channel Approach , 2010, COLING.

[5]  Percy Liang,et al.  Semi-Supervised Learning for Natural Language , 2005 .

[6]  Joakim Nivre,et al.  Transition-based Dependency Parsing with Rich Non-local Features , 2011, ACL.

[7]  Joakim Nivre,et al.  Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[8]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[9]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[10]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11]  Michael Collins,et al.  Efficient Third-Order Dependency Parsers , 2010, ACL.

[12]  Douglas A. Reynolds,et al.  Measuring the readability of automatic speech-to-text transcripts , 2003, INTERSPEECH.

[13]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[14]  Joakim Nivre,et al.  A Dynamic Oracle for Arc-Eager Dependency Parsing , 2012, COLING.

[15]  K. Rayner,et al.  Making and correcting errors during sentence comprehension: Eye movements in the analysis of structurally ambiguous sentences , 1982, Cognitive Psychology.

[16]  Yang Liu,et al.  Disfluency Detection Using Multi-step Stacked Learning , 2013, NAACL.

[17]  Christiane Fellbaum,et al.  Obituary: George A. Miller , 2013, CL.

[18]  Mark Johnson,et al.  A Non-Monotonic Arc-Eager Transition System for Dependency Parsing , 2013, CoNLL.

[19]  Shuangzhi Wu,et al.  Punctuation Prediction with Transition-based Parsing , 2013, ACL.

[20]  Mark Johnson,et al.  The impact of language models and loss functions on repair disfluency detection , 2011, ACL.

[21]  Fredrik Jørgensen The Effects of Disfluency Detection in Parsing Spoken Language , 2007, NODALIDA.

[22]  Eugene Charniak,et al.  Edit Detection and Parsing for Transcribed Speech , 2001, NAACL.

[23]  Eugene Charniak,et al.  A TAG-based noisy-channel model of speech repairs , 2004, ACL.

[24]  Giorgio Satta,et al.  A Transition-Based Dependency Parser Using a Dynamic Parsing Strategy , 2013, ACL.

[25]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[26]  Alexander Gelbukh,et al.  Computational Linguistics and Intelligent Text Processing , 2015, Lecture Notes in Computer Science.

[27]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[28]  Eugene Charniak,et al.  Immediate-Head Parsing for Language Models , 2001, ACL.

[29]  Elisabeth Schriberg,et al.  Preliminaries to a Theory of Speech Disfluencies , 1994 .

[30]  Stephen Clark,et al.  Syntactic Processing Using the Generalized Perceptron and Beam Search , 2011, CL.

[31]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[32]  Mohammad Sadegh Rasooli,et al.  Joint Parsing and Disfluency Detection in Linear Time , 2013, EMNLP.

[33]  Xu Sun,et al.  Latent Variable Perceptron Algorithm for Structured Classification , 2009, IJCAI.

[34]  Mary P. Harper,et al.  SParseval: Evaluation Metrics for Parsing Speech , 2006, LREC.