论文信息 - Learning Greedy Policies for the Easy-First Framework - 字舞流文

Learning Greedy Policies for the Easy-First Framework

Easy-first, a search-based structured prediction approach, has been applied to many NLP tasks including dependency parsing and coreference resolution. This approach employs a learned greedy policy (action scoring function) to make easy decisions first, which constrains the remaining decisions and makes them easier. We formulate greedy policy learning in the Easy-first approach as a novel non-convex optimization problem and solve it via an efficient Majorization Minimization (MM) algorithm. Results on within-document coreference and cross-document joint entity and event coreference tasks demonstrate that the proposed approach achieves statistically significant performance improvement over existing training regimes for Easy-first and is less susceptible to overfitting.

Thomas G. Dietterich | Xiaoli Z. Fern | Chao Ma | Janardhan Rao Doppa | Prasad Tadepalli | Jun Xie | Prashanth Mannem

[1] Yuchen Zhang,et al. CoNLL-2012 Shared Task: Modeling Multilingual Unrestricted Coreference in OntoNotes , 2012, EMNLP-CoNLL Shared Task.

[2] Alan Fern,et al. HC-Search: A Learning Framework for Search-based Structured Prediction , 2014, J. Artif. Intell. Res..

[3] Richard Johansson,et al. Dependency-based Syntactic–Semantic Analysis with PropBank and NomBank , 2008, CoNLL.

[4] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[5] Alan Fern,et al. Structured prediction via output space search , 2014, J. Mach. Learn. Res..

[6] D. Hunter,et al. A Tutorial on MM Algorithms , 2004 .

[7] Philipp Koehn,et al. Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[8] Thomas G. Dietterich,et al. Prune-and-Score: Learning for Greedy Coreference Resolution , 2014, EMNLP.

[9] Andrew McCallum,et al. First-Order Probabilistic Models for Coreference Resolution , 2007, NAACL.

[10] Dan Klein,et al. Easy Victories and Uphill Battles in Coreference Resolution , 2013, EMNLP.

[11] Vincent Ng,et al. Supervised Noun Phrase Coreference Research: The First Fifteen Years , 2010, ACL.

[12] Heeyoung Lee,et al. A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[13] Heeyoung Lee,et al. Joint Entity and Event Coreference Resolution across Documents , 2012, EMNLP.

[14] Luke S. Zettlemoyer,et al. Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves , 2013, EMNLP.

[15] Dan Roth,et al. Learning-based Multi-Sieve Co-reference Resolution with Knowledge , 2012, EMNLP-CoNLL.

[16] Xiaoqiang Luo,et al. On Coreference Resolution Performance Metrics , 2005, HLT.

[17] Dan Klein,et al. Coreference Resolution in a Modular, Entity-Centered Model , 2010, NAACL.

[18] Nianwen Xue,et al. CoNLL-2011 Shared Task: Modeling Unrestricted Coreference in OntoNotes , 2011, CoNLL Shared Task.

[19] Dan Roth,et al. Illinois-Coref: The UI System in the CoNLL-2012 Shared Task , 2012, EMNLP-CoNLL Shared Task.

[20] Alan Fern,et al. Learning Linear Ranking Functions for Beam Search with Application to Planning , 2009, J. Mach. Learn. Res..

[21] Veselin Stoyanov,et al. Easy-first Coreference Resolution , 2012, COLING.

[22] Lynette Hirschman,et al. A Model-Theoretic Coreference Scoring Scheme , 1995, MUC.

[23] Breck Baldwin,et al. Algorithms for Scoring Coreference Chains , 1998 .

[24] Daniel Marcu,et al. Learning as search optimization: approximate large margin methods for structured prediction , 2005, ICML.

[25] Dan Roth,et al. Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[26] Giorgio Satta,et al. Guided Learning for Bidirectional Sequence Classification , 2007, ACL.

[27] Yoav Goldberg,et al. An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing , 2010, NAACL.