panda-gym: Open-source goal-conditioned environments for robotic learning
暂无分享,去创建一个
Marcin Andrychowicz | Bob McGrew | Bowen Baker | Matthias Plappert | Jonas Schneider | Glenn Powell | E. Dellandréa | Liming Chen | Nicolas Cazin | Quentin Gallouedec
[1] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[2] Marcin Andrychowicz,et al. Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research , 2018, ArXiv.
[3] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[4] Konkoly Thege. Multi-criteria Reinforcement Learning , 1998 .
[5] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[6] Boris Polyak,et al. Acceleration of stochastic approximation by averaging , 1992 .
[7] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[8] Silvio Savarese,et al. Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[9] Sriraam Natarajan,et al. Dynamic preferences in multi-criteria reinforcement learning , 2005, ICML.
[10] David Howard,et al. A Review of Physics Simulators for Robotic Applications , 2021, IEEE Access.