暂无分享,去创建一个
[1] Chris R Sims,et al. Efficient coding explains the universal law of generalization in human perception , 2018, Science.
[2] Dawn Xiaodong Song,et al. Assessing Generalization in Deep Reinforcement Learning , 2018, ArXiv.
[3] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.
[4] Toby Berger,et al. Rate distortion theory : a mathematical basis for data compression , 1971 .
[5] Jordi Grau-Moya,et al. Soft Q-Learning with Mutual-Information Regularization , 2018, ICLR.
[6] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[7] Vicenç Gómez,et al. A unified view of entropy-regularized Markov decision processes , 2017, ArXiv.
[8] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[9] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[10] Yee Whye Teh,et al. Exploiting Hierarchy for Learning and Transfer in KL-regularized RL , 2019, ArXiv.
[11] Malcolm J. A. Strens,et al. A Bayesian Framework for Reinforcement Learning , 2000, ICML.
[12] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[13] Henry Zhu,et al. Soft Actor-Critic Algorithms and Applications , 2018, ArXiv.
[14] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.