论文信息 - Improving NLP through Marginalization of Hidden Syntactic Structure

Improving NLP through Marginalization of Hidden Syntactic Structure

Many NLP tasks make predictions that are inherently coupled to syntactic relations, but for many languages the resources required to provide such syntactic annotations are unavailable. For others it is unclear exactly how much of the syntactic annotations can be effectively leveraged with current models, and what structures in the syntactic trees are most relevant to the current task. We propose a novel method which avoids the need for any syntactically annotated data when predicting a related NLP task. Our method couples latent syntactic representations, constrained to form valid dependency graphs or constituency parses, with the prediction task via specialized factors in a Markov random field. At both training and test time we marginalize over this hidden structure, learning the optimal latent representations for the problem. Results show that this approach provides significant gains over a syntactically un-informed baseline, outperforming models that observe syntax on an English relation extraction task, and performing comparably to them in semantic role labeling.

David A. Smith | Sebastian Riedel | Jason Naradowsky

[1] Fernando Pereira,et al. Non-Projective Dependency Parsing using Spanning Tree Algorithms , 2005, HLT.

[2] Christopher D. Manning,et al. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[3] Ming-Wei Chang,et al. Discriminative Learning over Constrained Latent Representations , 2010, NAACL.

[4] Qiang Liu,et al. Variational algorithms for marginal MAP , 2011, J. Mach. Learn. Res..

[5] Christopher D. Manning,et al. Joint Parsing and Named Entity Recognition , 2009, NAACL.

[6] Dan Klein,et al. Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[7] Razvan C. Bunescu,et al. Learning to Extract Relations from the Web using Minimal Supervision , 2007, ACL.

[8] Francisco Casacuberta,et al. Submission to ICGI-2000 Computational complexity of problems on probabilistic grammars and transducers , 2007 .

[9] Sanjeev Khudanpur,et al. Variational Decoding for Statistical Machine Translation , 2009, ACL.

[10] Andrew McCallum,et al. A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance , 2005, UAI.

[11] Jian Su,et al. Exploring Various Knowledge in Relation Extraction , 2005, ACL.