Learning Beam Search Policies via Imitation Learning
暂无分享,去创建一个
[1] Slav Petrov,et al. Globally Normalized Transition-Based Neural Networks , 2016, ACL.
[2] Slav Petrov,et al. Structured Training for Neural Network Transition-Based Parsing , 2015, ACL.
[3] Claudio Gentile,et al. On the generalization ability of on-line learning algorithms , 2001, IEEE Transactions on Information Theory.
[4] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Noah A. Smith,et al. Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions , 2010, NAACL.
[6] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[7] John Langford,et al. Machine Learning Techniques—Reductions Between Prediction Quality Metrics , 2008 .
[8] Alexander M. Rush,et al. Sequence-to-Sequence Learning as Beam-Search Optimization , 2016, EMNLP.
[9] Daniel Marcu,et al. Learning as search optimization: approximate large margin methods for structured prediction , 2005, ICML.
[10] Alan Fern,et al. On learning linear ranking functions for beam search , 2007, ICML '07.
[11] Yang Guo,et al. Structured Perceptron with Inexact Search , 2012, HLT-NAACL.
[12] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..
[13] John Langford,et al. Search-based structured prediction , 2009, Machine Learning.
[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[15] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[16] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[17] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, J. Comput. Syst. Sci..
[18] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[19] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[20] John Langford,et al. Learning to Search Better than Your Teacher , 2015, ICML.
[21] Ben Taskar,et al. Max-Margin Markov Networks , 2003, NIPS.
[22] Graham Neubig,et al. A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models , 2017, AAAI.
[23] J. Andrew Bagnell,et al. Reinforcement and Imitation Learning via Interactive No-Regret Learning , 2014, ArXiv.
[24] Brian Roark,et al. Incremental Parsing with the Perceptron Algorithm , 2004, ACL.
[25] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.