暂无分享,去创建一个
[1] Suguru Arimoto,et al. An algorithm for computing the capacity of arbitrary discrete memoryless channels , 1972, IEEE Trans. Inf. Theory.
[2] Richard E. Blahut,et al. Computation of channel capacity and rate-distortion functions , 1972, IEEE Trans. Inf. Theory.
[3] Jürgen Schmidhuber,et al. Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.
[4] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[6] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[7] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[8] Doina Precup,et al. Learning Options in Reinforcement Learning , 2002, SARA.
[9] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[10] Pierre-Yves Oudeyer,et al. How can we define intrinsic motivation , 2008 .
[11] Pierre-Yves Oudeyer,et al. How can we define intrinsic motivation , 2008 .
[12] Jürgen Schmidhuber,et al. Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010) , 2010, IEEE Transactions on Autonomous Mental Development.
[13] David Silver,et al. Compositional Planning Using Optimal Option Models , 2012, ICML.
[14] Christoph Salge,et al. Empowerment - an Introduction , 2013, ArXiv.
[15] Shie Mannor,et al. Time-regularized interrupting options , 2014, ICML 2014.
[16] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[17] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.
[18] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[19] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[20] Alex Graves,et al. Strategic Attentive Writer for Learning Macro-Actions , 2016, NIPS.
[21] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.
[22] J. Schulman,et al. Variational Information Maximizing Exploration , 2016 .
[23] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[24] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.