论文信息 - Constituency Parsing with a Self-Attentive Encoder

Constituency Parsing with a Self-Attentive Encoder

We demonstrate that replacing an LSTM encoder with a self-attentive architecture can lead to improvements to a state-of-the-art discriminative constituency parser. The use of attention makes explicit the manner in which information is propagated between different locations in the sentence, which we use to both analyze our model and propose potential improvements. For example, we find that separating positional and content information in the encoder can lead to improved parsing accuracy. Additionally, we evaluate different approaches for lexical representation. Our parser achieves new state-of-the-art results for single models trained on the Penn Treebank: 93.55 F1 without the use of any external data, and 95.13 F1 when using pre-trained word representations. Our parser also outperforms the previous best-published accuracy figures on 8 of the 9 languages in the SPMRL dataset.

Dan Klein | Nikita Kitaev | Nikita Kitaev | D. Klein

[1] Wolfgang Seeker,et al. (Re)ranking Meets Morphosyntax: State-of-the-art Results from the SPMRL 2013 Shared Task , 2013, SPMRL@EMNLP.

[2] Nizar Habash,et al. Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages , 2013, SPMRL@EMNLP.

[3] Dan Klein,et al. Less Grammar, More Features , 2014, ACL.

[4] Anders Björkelund,et al. The IMS-Wrocław-Szeged-CIS Entry at the SPMRL 2014 Shared Task: Reranking and Morphosyntax Meet Unlabeled Data⇤ , 2014 .

[5] Geoffrey E. Hinton,et al. Grammar as a Foreign Language , 2014, NIPS.

[6] James Cross,et al. Span-Based Constituency Parsing with a Structure-Label System and Provably Optimal Dynamic Oracles , 2016, EMNLP.

[7] Noah A. Smith,et al. Recurrent Neural Network Grammars , 2016, NAACL.

[8] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[9] Dan Klein,et al. Effective Inference for Generative Neural Parsing , 2017, EMNLP.

[10] Dan Klein,et al. A Minimal Span-Based Neural Constituency Parser , 2017, ACL.

[11] Dan Klein,et al. Improving Neural Parsing by Disentangling Model Combination and Reranking Effects , 2017, ACL.

[12] Yue Zhang,et al. In-Order Transition-based Constituent Parsing , 2017, TACL.

[13] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[14] Dan Klein,et al. What’s Going On in Neural Constituency Parsers? An Analysis , 2018, NAACL.