论文信息 - Enhancing Unsupervised Generative Dependency Parser with Contextual Information

Enhancing Unsupervised Generative Dependency Parser with Contextual Information

Most of the unsupervised dependency parsers are based on probabilistic generative models that learn the joint distribution of the given sentence and its parse. Probabilistic generative models usually explicit decompose the desired dependency tree into factorized grammar rules, which lack the global features of the entire sentence. In this paper, we propose a novel probabilistic model called discriminative neural dependency model with valence (D-NDMV) that generates a sentence and its parse from a continuous latent representation, which encodes global contextual information of the generated sentence. We propose two approaches to model the latent representation: the first deterministically summarizes the representation from the sentence and the second probabilistically models the representation conditioned on the sentence. Our approach can be regarded as a new type of autoencoder model to unsupervised dependency parsing that combines the benefits of both generative and discriminative techniques. In particular, our approach breaks the context-free independence assumption in previous generative approaches and therefore becomes more expressive. Our extensive experimental results on seventeen datasets from various sources show that our approach achieves competitive accuracy compared with both generative and discriminative state-of-the-art unsupervised dependency parsers.

Kewei Tu | Yong Jiang | Wenjuan Han

[1] Yang Liu,et al. Dependency Grammar Induction with a Neural Variational Transition-based Parser , 2018, AAAI.

[2] John DeNero,et al. Painless Unsupervised Learning with Features , 2010, NAACL.

[3] Valentin I. Spitkovsky,et al. Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction , 2013, EMNLP.

[4] Sampo Pyysalo,et al. Universal Dependencies v1: A Multilingual Treebank Collection , 2016, LREC.

[5] M. A. R T A P A L,et al. The Penn Chinese TreeBank: Phrase structure annotation of a large corpus , 2005, Natural Language Engineering.

[6] Noémie Elhadad,et al. A convex and feature-rich discriminative approach to dependency grammar induction , 2015, ACL.

[7] Dan Klein,et al. Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency , 2004, ACL.

[8] Noah A. Smith,et al. Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction , 2008, NIPS.

[9] Kewei Tu,et al. CRF Autoencoder for Unsupervised Dependency Parsing , 2017, EMNLP.

[10] Phong Le,et al. Unsupervised Dependency Parsing: Let’s Use Supervised Parsers , 2015, NAACL.

[11] Ben Taskar,et al. Sparsity in Dependency Grammar Induction , 2010, ACL.