论文信息 - Stochastic Optimization for CRF Autoencoders

Stochastic Optimization for CRF Autoencoders

The goal of this project is to implement existing stochastic optimization methods in the context of conditional random field (CRF) autoencoders [1]. CRF autoencoders are a class of probabilistic models which was designed to address unsupervised and semi-supervised problems in natural language processing. However, for concreteness, we will focus on a particular instantiation of CRF autoencoders for the classic problem of bitext word alignment [4]. In this milestone, we experimented with several variations of stochastic gradient descent (see §3.1 for details). By the end of the semester, we plan to also experiment with stochastic expectation maximization (see §3.1.1 for details).

Waleed Ammar | Fan Yang

[1] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[2] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[3] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[4] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[5] Dan Klein,et al. Online EM for Unsupervised Models , 2009, NAACL.

[6] O. Cappé,et al. On‐line expectation–maximization algorithm for latent data models , 2009 .

[7] Ben Taskar,et al. Posterior Regularization for Structured Latent Variable Models , 2010, J. Mach. Learn. Res..

[8] Léon Bottou,et al. Stochastic Gradient Descent Tricks , 2012, Neural Networks: Tricks of the Trade.

[9] Noah A. Smith,et al. Conditional Random Field Autoencoders for Unsupervised Structured Prediction , 2014, NIPS.

[10] Dimitri P. Bertsekas,et al. Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey , 2015, ArXiv.