Deep Reinforcement Learning for Sequence-to-Sequence Models
暂无分享,去创建一个
Naren Ramakrishnan | Chandan K. Reddy | Tian Shi | Yaser Keneshloo | Naren Ramakrishnan | C. Reddy | Tian Shi | Yaser Keneshloo
[1] Ming Zhou,et al. Selective Encoding for Abstractive Sentence Summarization , 2017, ACL.
[2] Mirella Lapata,et al. Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.
[3] Xiang Zhang,et al. Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems , 2015, ICLR.
[4] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[5] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[6] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[7] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[9] Trevor Darrell,et al. Sequence to Sequence -- Video to Text , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[10] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[12] Zhi Chen,et al. Adversarial Feature Matching for Text Generation , 2017, ICML.
[13] William B. Dolan,et al. Collecting Highly Parallel Data for Paraphrase Evaluation , 2011, ACL.
[14] Xuanjing Huang,et al. Attention-Based Convolutional Neural Network for Semantic Relation Extraction , 2016, COLING.
[15] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.
[16] Ming Zhu,et al. A Hierarchical Attention Retrieval Model for Healthcare Question Answering , 2019, WWW.
[17] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Adam Coates,et al. Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.
[19] Kevin Lin,et al. Adversarial Ranking for Language Generation , 2017, NIPS.
[20] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[21] Stefan Riezler,et al. Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning , 2018, ACL.
[22] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[23] Yejin Choi,et al. Deep Communicating Agents for Abstractive Summarization , 2018, NAACL.
[24] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.
[25] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[26] Ilya Sutskever,et al. Learning to Generate Reviews and Discovering Sentiment , 2017, ArXiv.
[27] Dawn Xiaodong Song,et al. SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.
[28] Lantao Yu,et al. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.
[29] Rémi Munos,et al. Implicit Quantile Networks for Distributional Reinforcement Learning , 2018, ICML.
[30] Li Wang,et al. A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization , 2018, IJCAI.
[31] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Kuzman Ganchev,et al. Semantic Role Labeling with Neural Network Factors , 2015, EMNLP.
[33] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.
[34] Jianfeng Gao,et al. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.
[35] Hang Li,et al. Paraphrase Generation with Deep Reinforcement Learning , 2017, EMNLP.
[36] Frank K. Soong,et al. TTS synthesis with bidirectional LSTM based recurrent neural networks , 2014, INTERSPEECH.
[37] Chen Chen,et al. Chinese Zero Pronoun Resolution with Deep Neural Networks , 2016, ACL.
[38] Daniel Jurafsky,et al. Learning to Decode for Future Success , 2017, ArXiv.
[39] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[40] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[41] Wojciech Zaremba,et al. Reinforcement Learning Neural Turing Machines - Revised , 2015 .
[42] Yoshua Bengio,et al. Maximum-Likelihood Augmented Discrete Generative Adversarial Networks , 2017, ArXiv.
[43] Naren Ramakrishnan,et al. Neural Abstractive Text Summarization with Sequence-to-Sequence Models , 2018, Trans. Data Sci..
[44] Vicente Ordonez,et al. Im2Text: Describing Images Using 1 Million Captioned Photographs , 2011, NIPS.
[45] Sergey Levine,et al. The Mirage of Action-Dependent Baselines in Reinforcement Learning , 2018, ICML.
[46] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.
[47] Radu Soricut,et al. Cold-Start Reinforcement Learning with Softmax Policy Gradient , 2017, NIPS.
[48] Marc G. Bellemare,et al. Safe and Efficient Off-Policy Reinforcement Learning , 2016, NIPS.
[49] Margaret Mitchell,et al. Generating Natural Questions About an Image , 2016, ACL.
[50] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.
[51] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[52] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[53] Dumitru Erhan,et al. Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[54] Subhashini Venugopalan,et al. Translating Videos to Natural Language Using Deep Recurrent Neural Networks , 2014, NAACL.
[55] Martial Hebert,et al. Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.
[56] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[57] Joelle Pineau,et al. An Actor-Critic Algorithm for Sequence Prediction , 2016, ICLR.
[58] Jure Leskovec,et al. Inferring Networks of Substitutable and Complementary Products , 2015, KDD.
[59] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[60] Philip Bachman,et al. Machine Comprehension by Text-to-Text Neural Question Generation , 2017, Rep4NLP@ACL.
[61] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[62] Ruslan Salakhutdinov,et al. Semi-Supervised QA with Generative Domain-Adaptive Nets , 2017, ACL.
[63] Yuxi Li,et al. Deep Reinforcement Learning: An Overview , 2017, ArXiv.
[64] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[65] Xinlei Chen,et al. Mind's eye: A recurrent visual representation for image caption generation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[66] Nenghai Yu,et al. Deliberation Networks: Sequence Generation Beyond One-Pass Decoding , 2017, NIPS.
[67] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[68] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[69] Xu Sun,et al. Global Encoding for Abstractive Summarization , 2018, ACL.
[70] Siqi Liu,et al. Improved Image Captioning via Policy Gradient optimization of SPIDEr , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[71] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[72] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.
[73] Eunsol Choi,et al. TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension , 2017, ACL.
[74] Yueting Zhuang,et al. Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference , 2018, ACL.
[75] Heiga Zen,et al. Statistical parametric speech synthesis using deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[76] Feng Liu,et al. Actor-Critic Sequence Training for Image Captioning , 2017, ArXiv.
[77] Christopher Burgess,et al. DARLA: Improving Zero-Shot Transfer in Reinforcement Learning , 2017, ICML.
[78] Di He,et al. Decoding with Value Networks for Neural Machine Translation , 2017, NIPS.
[79] Alexander M. Rush,et al. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.
[80] Eric P. Xing,et al. Connecting the Dots Between MLE and RL for Sequence Generation , 2018, DeepRLStructPred@ICLR.
[81] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.
[82] Marc G. Bellemare,et al. The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning , 2017, ICLR.
[83] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[84] Jin-Hyuk Hong,et al. Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information , 2018, AAAI.
[85] Xiaojun Wan,et al. Abstractive Document Summarization with a Graph-Based Attentional Neural Model , 2017, ACL.
[86] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[87] Samy Bengio,et al. Tensor2Tensor for Neural Machine Translation , 2018, AMTA.
[88] Cristian Danescu-Niculescu-Mizil,et al. Chameleons in Imagined Conversations: A New Approach to Understanding Coordination of Linguistic Style in Dialogs , 2011, CMCL@ACL.
[89] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.
[90] Li Fei-Fei,et al. End-to-End Learning of Action Detection from Frame Glimpses in Videos , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[91] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[92] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.
[93] Yen-Chun Chen,et al. Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.
[94] Luke S. Zettlemoyer,et al. Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language , 2015, EMNLP.
[95] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[96] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[97] Basura Fernando,et al. SPICE: Semantic Propositional Image Caption Evaluation , 2016, ECCV.
[98] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[99] Hang Li,et al. “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .
[100] William Yang Wang,et al. Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.
[101] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[102] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.
[103] Weinan Zhang,et al. Chinese Zero Pronoun Resolution with Deep Memory Network , 2017, EMNLP.
[104] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.
[105] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[106] Richard Socher,et al. Improving End-to-End Speech Recognition with Policy Learning , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[107] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[108] Quoc V. Le,et al. A Simple Method for Commonsense Reasoning , 2018, ArXiv.
[109] Ankur Bapna,et al. The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation , 2018, ACL.
[110] Geoffrey Zweig,et al. From captions to visual concepts and back , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[111] Vaibhava Goel,et al. Self-Critical Sequence Training for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[112] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[113] Alexander M. Rush,et al. Coarse-to-Fine Attention Models for Document Summarization , 2017, NFiS@EMNLP.
[114] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.
[115] Xuanjing Huang,et al. Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method , 2018, AAAI.
[116] Daniel Gildea,et al. The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.
[117] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.
[118] Yuxing Peng,et al. Reinforced Mnemonic Reader for Machine Comprehension , 2017 .
[119] John N. Tsitsiklis,et al. Asynchronous stochastic approximation and Q-learning , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.
[120] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[121] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.
[122] Anil A. Bharath,et al. Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.
[123] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[124] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[125] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[126] Mor Naaman,et al. Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies , 2018, NAACL.
[127] Dale Schuurmans,et al. Reward Augmented Maximum Likelihood for Neural Structured Prediction , 2016, NIPS.
[128] Yang Liu,et al. Minimum Risk Training for Neural Machine Translation , 2015, ACL.
[129] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.
[130] Diego Marcheggiani,et al. A Simple and Accurate Syntax-Agnostic Neural Model for Dependency-based Semantic Role Labeling , 2017, CoNLL.
[131] Richard Socher,et al. A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.
[132] Cyrus Rashtchian,et al. Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.
[133] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[134] Diego Marcheggiani,et al. Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling , 2017, EMNLP.
[135] Cícero Nogueira dos Santos,et al. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.
[136] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[137] Jason Weston,et al. Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.
[138] Jason Weston,et al. Learning End-to-End Goal-Oriented Dialog , 2016, ICLR.
[139] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.
[140] Jianfeng Gao,et al. Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.
[141] Ramakanth Pasunuru,et al. Reinforced Video Captioning with Entailment Rewards , 2017, EMNLP.
[142] Yu Zhang,et al. Deep Reinforcement Learning for Chinese Zero Pronoun Resolution , 2018, ACL.
[143] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[144] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.
[145] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[146] Koray Kavukcuoglu,et al. Multiple Object Recognition with Visual Attention , 2014, ICLR.
[147] Honglak Lee,et al. Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..
[148] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[149] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.
[150] Marc G. Bellemare,et al. The Reactor: A Sample-Efficient Actor-Critic Architecture , 2017, ArXiv.
[151] Yuxiang Wu,et al. Learning to Extract Coherent Summary via Deep Reinforcement Learning , 2018, AAAI.
[152] John Langford,et al. Search-based structured prediction , 2009, Machine Learning.
[153] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.
[154] Richard Socher,et al. Dynamic Coattention Networks For Question Answering , 2016, ICLR.
[155] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[156] Naren Ramakrishnan,et al. Deep Transfer Reinforcement Learning for Text Summarization , 2018, SDM.
[157] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.
[158] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[159] Yi Yang,et al. WikiQA: A Challenge Dataset for Open-Domain Question Answering , 2015, EMNLP.
[160] Yajie Miao,et al. EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[161] Yoshua Bengio,et al. Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.
[162] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[163] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[164] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[165] Percy Liang,et al. Compositional Semantic Parsing on Semi-Structured Tables , 2015, ACL.
[166] Chris Dyer,et al. Differentiable Scheduled Sampling for Credit Assignment , 2017, ACL.
[167] Ruslan Salakhutdinov,et al. Multimodal Neural Language Models , 2014, ICML.
[168] Wei Xu,et al. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) , 2014, ICLR.
[169] Kate Saenko,et al. Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text , 2016, EMNLP.
[170] Piji Li,et al. Actor-Critic based Training Framework for Abstractive Summarization , 2018, ArXiv.
[171] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.
[172] Min Sun,et al. A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss , 2018, ACL.
[173] Ruslan Salakhutdinov,et al. Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models , 2014, ArXiv.
[174] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.
[175] Xuanjing Huang,et al. Towards Diverse Text Generation with Inverse Reinforcement Learning , 2018, ArXiv.
[176] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[177] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[178] Pieter Abbeel,et al. On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient , 2010, NIPS.
[179] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[180] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.
[181] Xuanjing Huang,et al. Toward Diverse Text Generation with Inverse Reinforcement Learning , 2018, IJCAI.
[182] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[183] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.
[184] Ferenc Huszar,et al. How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? , 2015, ArXiv.
[185] Yong Yu,et al. Long Text Generation via Adversarial Training with Leaked Information , 2017, AAAI.
[186] Jonathan Berant,et al. Building a Semantic Parser Overnight , 2015, ACL.
[187] Sebastian Thrun,et al. Issues in Using Function Approximation for Reinforcement Learning , 1999 .