论文信息 - Global Features for Shallow Discourse Parsing

Global Features for Shallow Discourse Parsing

A coherently related group of sentences may be referred to as a discourse. In this paper we address the problem of parsing coherence relations as defined in the Penn Discourse Tree Bank (PDTB). A good model for discourse structure analysis needs to account both for local dependencies at the token-level and for global dependencies and statistics. We present techniques on using inter-sentential or sentence-level (global), data-driven, non-grammatical features in the task of parsing discourse. The parser model follows up previous approach based on using token-level (local) features with conditional random fields for shallow discourse parsing, which is lacking in structural knowledge of discourse. The parser adopts a two-stage approach where first the local constraints are applied and then global constraints are used on a reduced weighted search space (n-best). In the latter stage we experiment with different rerankers trained on the first stage n-best parses, which are generated using lexico-syntactic local features. The two-stage parser yields significant improvements over the best performing model of discourse parser on the PDTB corpus.

Richard Johansson | Giuseppe Riccardi | Sucheta Ghosh

[1] Richard Johansson,et al. End-to-End Discourse Parser Evaluation , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[2] Ani Nenkova,et al. Using Syntax to Disambiguate Explicit Discourse Connectives in Text , 2009, ACL.

[3] Daniel Marcu,et al. NP Bracketing by Maximum Entropy Tagging and SVM Reranking , 2004, EMNLP.

[4] Michael Collins,et al. New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[5] Richard Johansson,et al. Shallow Discourse Parsing with Conditional Random Fields , 2011, IJCNLP.

[6] Richard Johansson,et al. Syntactic and Semantic Structure for Opinion Expression Detection , 2010, CoNLL.

[7] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[8] Livio Robaldo,et al. The Penn Discourse TreeBank 2.0. , 2008, LREC.

[9] Yoav Freund,et al. Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[10] Daniel Marcu,et al. Sentence Level Discourse Parsing using Syntactic and Lexical Information , 2003, NAACL.

[11] Dan Klein,et al. Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.