Hierarchical Reinforcement Learning for Concurrent Discovery of Compound and Composable Policies
暂无分享,去创建一个
[1] Sergey Levine,et al. Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review , 2018, ArXiv.
[2] Stefan Schaal,et al. Robot Learning , 2017, Encyclopedia of Machine Learning and Data Mining.
[3] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[4] Dana H. Ballard,et al. Multiple-Goal Reinforcement Learning with Modular Sarsa(0) , 2003, IJCAI.
[5] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[6] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[7] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[8] Jan Peters,et al. Learning table tennis with a Mixture of Motor Primitives , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.
[9] Hussein A. Abbass,et al. Hierarchical Deep Reinforcement Learning for Continuous Action Control , 2018, IEEE Transactions on Neural Networks and Learning Systems.
[10] Misha Denil,et al. The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously , 2017, CoRL.
[11] Jan Peters,et al. Hierarchical Relative Entropy Policy Search , 2014, AISTATS.
[12] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[13] Vicenç Gómez,et al. A unified view of entropy-regularized Markov decision processes , 2017, ArXiv.
[14] Dewen Hu,et al. Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.
[15] Mitsuo Kawato,et al. Multiple Model-Based Reinforcement Learning , 2002, Neural Computation.
[16] Henry Zhu,et al. Soft Actor-Critic Algorithms and Applications , 2018, ArXiv.
[17] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[18] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[19] Jan Peters,et al. Hierarchical reinforcement learning of multiple grasping strategies with human instructions , 2018, Adv. Robotics.
[20] Sridhar Mahadevan,et al. Robot Learning , 1993 .
[21] Kenji Doya,et al. Combining learned controllers to achieve new goals based on linearly solvable MDPs , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[22] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[23] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[24] Christopher L. Simpkins,et al. Composable Modular Reinforcement Learning , 2019, AAAI.
[25] Jörn Malzahn,et al. Development of a human size and strength compliant bi-manual platform for realistic heavy manipulation tasks , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[26] Olivier Sigaud,et al. Policy Search in Continuous Action Domains: an Overview , 2018, Neural Networks.
[27] Jan Peters,et al. Learning elementary movements jointly with a higher level task , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[28] Sergey Levine,et al. Composable Deep Reinforcement Learning for Robotic Manipulation , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[29] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[30] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.