论文信息 - A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing

A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing

Dual decomposition, and more generally Lagrangian relaxation, is a classical method for combinatorial optimization; it has recently been applied to several inference problems in natural language processing (NLP). This tutorial gives an overview of the technique. We describe example algorithms, describe formal guarantees for the method, and describe practical issues in implementing the algorithms. While our examples are predominantly drawn from the NLP literature, the material should be of general relevance to inference problems in machine learning. A central theme of this tutorial is that Lagrangian relaxation is naturally applied in conjunction with a broad class of combinatorial algorithms, allowing inference in models that go significantly beyond previous work on Lagrangian relaxation for inference in graphical models.

Alexander M. Rush | Michael Collins | Michael Collins

[1] Torres Martins,et al. The Geometry of Constrained Structured Prediction: Applications to Inference and Learning of Natural Language Syntax , 2012 .

[2] Claude Lemaréchal,et al. Lagrangian Relaxation , 2000, Computational Combinatorial Optimization.

[3] Ben Taskar,et al. Word Alignment via Quadratic Assignment , 2006, NAACL.

[4] John DeNero,et al. Model-Based Aligner Combination Using Dual Decomposition , 2011, ACL.

[5] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[6] Nikos Komodakis,et al. MRF Optimization via Dual Decomposition: Message-Passing Revisited , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[7] Yurii Nesterov,et al. Smooth minimization of non-smooth functions , 2005, Math. Program..

[8] Stephen P. Boyd,et al. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[9] Alexander M. Rush,et al. On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing , 2010, EMNLP.

[10] A. Hasman,et al. Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[11] D. Sontag. 1 Introduction to Dual Decomposition for Inference , 2010 .

[12] David A. Smith,et al. Dependency Parsing by Belief Propagation , 2008, EMNLP.

[13] Daniel P. Huttenlocher,et al. Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14] Thomas Hofmann,et al. Using Combinatorial Optimization within Max-Product Belief Propagation , 2007 .

[15] Martin J. Wainwright,et al. MAP estimation via agreement on trees: message-passing and linear programming , 2005, IEEE Transactions on Information Theory.

[16] Ofer Meshi,et al. An Alternating Direction Method for Dual MAP LP Relaxation , 2011, ECML/PKDD.

[17] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[18] Eric P. Xing,et al. An Augmented Lagrangian Approach to Constrained MAP Inference , 2011, ICML.

[19] Nikos Komodakis,et al. MRF Energy Minimization and Beyond via Dual Decomposition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Stephen Gould,et al. Accelerated dual decomposition for MAP inference , 2010, ICML.

[21] Xavier Carreras,et al. Simple Semi-supervised Dependency Parsing , 2008, ACL.

[22] George B. Dantzig,et al. Decomposition Principle for Linear Programs , 1960 .

[23] Tommi S. Jaakkola,et al. Fixing Max-Product: Convergent Message Passing Algorithms for MAP LP-Relaxations , 2007, NIPS.

[24] Vladimir Kolmogorov,et al. Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Alexander M. Rush,et al. Exact Decoding of Syntactic Translation Models through Lagrangian Relaxation , 2011, ACL.