Learning to Plan via a Multi-Step Policy Regression Method
暂无分享,去创建一个
Stefan Wagner | Stefan Harmeling | Tobias Uelwer | Michael Janschek | S. Harmeling | Stefan Wagner | Tobias Uelwer | Michael Janschek
[1] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[2] MahadevanSridhar,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003 .
[3] Shie Mannor,et al. Beyond the One Step Greedy Approach in Reinforcement Learning , 2018, ICML.
[4] Razvan Pascanu,et al. Distilling Policy Distillation , 2019, AISTATS.
[5] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[6] Xipeng Shen,et al. Deep reuse: streamline CNN inference on the fly via coarse-grained computation reuse , 2019, ICS.
[7] Tom Eccles,et al. An investigation of model-free planning , 2019, ICML.
[8] Aleksandr I. Panov,et al. Grid Path Planning with Deep Reinforcement Learning: Preliminary Results , 2017, BICA.
[9] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[10] Kee-Eung Kim,et al. Reinforcement Learning for Control with Multiple Frequencies , 2020, NeurIPS.
[11] Nils J. Nilsson,et al. A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..
[12] Balaraman Ravindran,et al. Dynamic Action Repetition for Deep Reinforcement Learning , 2017, AAAI.
[13] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[14] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[15] Richard S. Sutton,et al. Multi-step Reinforcement Learning: A Unifying Algorithm , 2017, AAAI.
[16] Razvan Pascanu,et al. Policy Distillation , 2015, ICLR.
[17] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[18] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.