论文信息 - A Representation for General Pouring Behavior

A Representation for General Pouring Behavior

We introduce our work on pouring as an example of complicated tasks for robots. The architecture is a skill library with planning and learning methods. We briefly describe our representation for pouring, then discuss problems and future directions.

C. Atkeson | Akihiko Yamaguchi

[1] Tsukasa Ogasawara,et al. Pouring Skills with Planning and Learning Modeled from Human Demonstrations , 2015, Int. J. Humanoid Robotics.

[2] Nolan Wagener,et al. Learning contact-rich manipulation skills with guided policy search , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[3] J. Takamatsu,et al. DCOB: Action space for reinforcement learning of high DoF robots , 2013, Auton. Robots.

[4] Jan Peters,et al. Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[5] Stefan Schaal,et al. Reinforcement learning of motor skills in high dimensions: A path integral approach , 2010, 2010 IEEE International Conference on Robotics and Automation.

[6] Pieter Abbeel,et al. Cloth grasp point detection based on multiple-view geometric cues with application to robotic towel folding , 2010, 2010 IEEE International Conference on Robotics and Automation.

[7] Jan Peters,et al. Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .

[8] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[9] S. Schaal,et al. Robot juggling: implementation of memory-based learning , 1994, IEEE Control Systems.

[10] Stefanie Tellex,et al. Interpreting and Executing Recipes with a Cooking Robot , 2012, ISER.

[11] C. Atkeson,et al. Minimax differential dynamic programming: application to a biped walking robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[12] D. Mayne. A Second-order Gradient Method for Determining Optimal Trajectories of Non-linear Discrete-time Systems , 1966 .