论文信息 - Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization - 字舞流文

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

This paper tackles the reduction of redundant repeating generation that is often observed in RNN-based encoder-decoder models. Our basic idea is to jointly estimate the upper-bound frequency of each target vocabulary in the encoder and control the output words based on the estimation in the decoder. Our method shows significant improvement over a strong RNN-based encoder-decoder baseline and achieved its best results on an abstractive summarization benchmark.

Masaaki Nagata | Jun Suzuki | M. Nagata | Jun Suzuki

[1] Alexander M. Rush,et al. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[2] Bowen Zhou,et al. Sequence-to-Sequence RNNs for Text Summarization , 2016, ArXiv.

[3] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.

[4] Bernhard Schölkopf,et al. A tutorial on support vector regression , 2004, Stat. Comput..

[5] Maria Leonor Pacheco,et al. of the Association for Computational Linguistics: , 2001 .

[6] Timothy M. Hospedales,et al. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing , 2016 .

[7] Alexander M. Rush,et al. Sequence-to-Sequence Learning as Beam-Search Optimization , 2016, EMNLP.

[8] Zhiyuan Liu,et al. Neural Headline Generation with Minimum Risk Training , 2016, ArXiv.

[9] Paul Over,et al. DUC in context , 2007, Inf. Process. Manag..

[10] Graham Neubig,et al. Controlling Output Length in Neural Encoder-Decoders , 2016, EMNLP.

[11] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[12] Naoaki Okazaki,et al. Neural Headline Generation on Abstract Meaning Representation , 2016, EMNLP.

[13] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[14] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[15] Zhiguo Wang,et al. Coverage Embedding Models for Neural Machine Translation , 2016, EMNLP.

[16] Yang Liu,et al. Modeling Coverage for Neural Machine Translation , 2016, ACL.

[17] Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013 , 2013, ICML.

[18] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[19] Bowen Zhou,et al. Pointing the Unknown Words , 2016, ACL.

[20] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[21] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[22] Yang Liu,et al. Minimum Risk Training for Neural Machine Translation , 2015, ACL.

[23] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[24] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[25] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[26] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.