Image captioning based on deep reinforcement learning
暂无分享,去创建一个
Bo Wang | Peng Li | Zhenyu Wang | Haichao Shi | Bo Wang | Haichao Shi | Peng Li | Zhenyu Wang
[1] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[2] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[3] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.
[4] Ian R. Fasel,et al. Deep Belief Nets as Function Approximators for Reinforcement Learning , 2011, Lifelong Learning.
[5] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[7] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.
[8] Fasel Ian,et al. Deep Belief Nets as Function Approximators for Reinforcement Learning , 2011 .
[9] Xiaoyu Zhang,et al. Update vs. upgrade: Modeling with indeterminate multi-class active learning , 2015, Neurocomputing.
[10] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[11] Jiebo Luo,et al. Image Captioning with Semantic Attention , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[13] Alex M. Andrew,et al. ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).
[14] Richard Socher,et al. Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[16] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[17] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[18] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[20] Xin Jin,et al. Context-aware local abnormality detection in crowded scene , 2015, Science China Information Sciences.
[21] Changsheng Li,et al. Sparse representation for robust abnormality detection in crowded scenes , 2014, Pattern Recognit..
[22] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[23] Tao Mei,et al. Boosting Image Captioning with Attributes , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[24] Xiaoyu Zhang,et al. ListNet-based object proposals ranking , 2017, Neurocomputing.
[25] Xiaoyu Zhang,et al. Bidirectional Active Learning: A Two-Way Exploration Into Unlabeled and Labeled Data Set , 2015, IEEE Transactions on Neural Networks and Learning Systems.
[26] ZhangXiao-Yu. Simultaneous optimization for robust correlation estimation in partially observed social network , 2016 .
[27] Xinlei Chen,et al. Mind's eye: A recurrent visual representation for image caption generation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Chenliang Xu,et al. Image Caption Generation with Text-Conditional Semantic Attention , 2016, ArXiv.
[29] Martin A. Riedmiller,et al. Deep auto-encoder neural networks in reinforcement learning , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).
[30] Rama Chellappa,et al. From sample similarity to ensemble similarity: probabilistic distance measures in reproducing kernel Hilbert space , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[31] Geoffrey Zweig,et al. From captions to visual concepts and back , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] David Elkind,et al. Learning: An Introduction , 1968 .
[33] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[34] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[36] Xu Jia,et al. Guiding Long-Short Term Memory for Image Caption Generation , 2015, ArXiv.
[37] Xiaoyu Zhang,et al. Simultaneous optimization for robust correlation estimation in partially observed social network , 2016, Neurocomputing.
[38] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[39] Wei Xu,et al. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) , 2014, ICLR.
[40] Xiaoyu Zhang,et al. Interactive patent classification based on multi-classifier fusion and active learning , 2014, Neurocomputing.