暂无分享,去创建一个
Alex Graves | Marc G. Bellemare | Rémi Munos | Koray Kavukcuoglu | Jacob Menick | K. Kavukcuoglu | R. Munos | A. Graves | Jacob Menick | Alex Graves
[1] J. Rissanen. Stochastic Complexity and Modeling , 1986 .
[2] Stewart W. Wilson,et al. A Possibility for Implementing Curiosity and Boredom in Model-Building Neural Controllers , 1991 .
[3] Geoffrey E. Hinton,et al. Keeping the neural networks simple by minimizing the description length of the weights , 1993, COLT '93.
[4] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.
[5] S. Hochreiter,et al. REINFORCEMENT DRIVEN INFORMATION ACQUISITION IN NONDETERMINISTIC ENVIRONMENTS , 1995 .
[6] Mark Herbster,et al. Tracking the Best Expert , 1995, Machine-mediated learning.
[7] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[8] Timothy C. Bell,et al. A corpus for the evaluation of lossless compression algorithms , 1997, Proceedings DCC '97. Data Compression Conference.
[9] Andrew McCallum,et al. Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.
[10] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[11] P. Grünwald. The Minimum Description Length Principle (Adaptive Computation and Machine Learning) , 2007 .
[12] Pierre-Yves Oudeyer,et al. Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.
[13] Burr Settles,et al. Active Learning Literature Survey , 2009 .
[14] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[15] Pierre Baldi,et al. Bayesian surprise attracts human attention , 2005, Vision Research.
[16] Alex Graves,et al. Practical Variational Inference for Neural Networks , 2011, NIPS.
[17] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[18] Pierre-Yves Oudeyer,et al. The strategic student approach for life-long exploration and learning , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).
[19] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[20] Andrew G. Barto,et al. Intrinsic Motivation and Reinforcement Learning , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.
[21] Wojciech Zaremba,et al. Learning to Execute , 2014, ArXiv.
[22] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[23] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[24] Pierre-Yves Oudeyer,et al. Multi-Armed Bandits for Intelligent Tutoring Systems , 2013, EDM.
[25] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.
[26] Julien Cornebise,et al. Weight Uncertainty in Neural Networks , 2015, ArXiv.
[27] Ariel D. Procaccia,et al. Variational Dropout and the Local Reparameterization Trick , 2015, NIPS.
[28] Yulia Tsvetkov,et al. Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning , 2016, ACL.
[29] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[30] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.
[31] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.
[32] Jason Weston,et al. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks , 2015, ICLR.
[33] Nando de Freitas,et al. Neural Programmer-Interpreters , 2015, ICLR.
[34] Feryal Behbahani. Automated Curriculum Learning , 2018 .