论文信息 - Learning to select and generalize striking movements in robot table tennis

Learning to select and generalize striking movements in robot table tennis

Learning new motor tasks from physical interactions is an important goal for both robotics and machine learning. However, when moving beyond basic skills, most monolithic machine learning approaches fail to scale. For more complex skills, methods that are tailored for the domain of skill learning are needed. In this paper, we take the task of learning table tennis as an example and present a new framework that allows a robot to learn cooperative table tennis from physical interaction with a human. The robot first learns a set of elementary table tennis hitting movements from a human table tennis teacher by kinesthetic teach-in, which is compiled into a set of motor primitives represented by dynamical systems. The robot subsequently generalizes these movements to a wider range of situations using our mixture of motor primitives approach. The resulting policy enables the robot to select appropriate motor primitives as well as to generalize between them. Finally, the robot plays with a human table tennis partner and learns online to improve its behavior. We show that the resulting setup is capable of playing table tennis using an anthropomorphic robot arm.

[1] J. T. O'Hanlan. The Fosbury flop. , 1968, Virginia medical monthly.

[2] T. Flash,et al. The coordination of arm movements: an experimentally confirmed mathematical model , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[3] John Knight,et al. Pingpong-playing robot controlled by a microcomputer , 1986, Microprocess. Microsystems.

[4] Hideaki Hashimoto,et al. Development Of A Pingpong Robot System Using 7 Degrees Of Freedom Direct Drive Arm , 1987, Other Conferences.

[5] Russell L. Anderson,et al. A Robot Ping-Pong Player: Experiments in Real-Time Intelligent Control , 1988 .

[6] John T. Wen,et al. A robot ping pong player: optimized mechanics, high performance 3D vision, and intelligent sensor control , 1990, Robotersysteme.

[7] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[8] S. Schaal,et al. A Kendama Learning Robot Based on Bi-directional Theory , 1996, Neural Networks.

[9] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[10] Ilya A. Rybak,et al. A Neurobiological Perspective on Humanoid Robot Design , 2000, IEEE Intell. Syst..

[11] Ilya A. Rybak,et al. Neurobiological and neurorobotic approaches to control architectures for a humanoid motor system , 2001, Robotics Auton. Syst..

[12] Jun Nakanishi,et al. Movement imitation with nonlinear dynamical systems in humanoid robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[13] Christopher M. Bishop,et al. Bayesian Regression and Classification , 2003 .

[14] Stefan Schaal,et al. Computational approaches to motor learning by imitation. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[15] Jun Nakanishi,et al. Learning Movement Primitives , 2005, ISRR.

[16] Juan A. Méndez,et al. Ping-pong player prototype , 2003, IEEE Robotics Autom. Mag..

[17] Stefan Schaal,et al. Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[18] Jun Morimoto,et al. Learning from demonstration and adaptation of biped locomotion , 2004, Robotics Auton. Syst..

[19] Stefan Schaal,et al. Rapid synchronization and accurate phase-locking of rhythmic motor primitives , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.

[21] Sadao Kawamura,et al. Advances in Robot Control , 2006 .

[22] Fumio Miyazaki,et al. Learning to Dynamically Manipulate: A Table Tennis Robot Controls a Ball and Rallies with a Human Being , 2006 .

[23] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[24] Stefan Schaal,et al. Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25] Aude Billard,et al. Reinforcement learning for imitating constrained reaching movements , 2007, Adv. Robotics.

[26] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[27] Marc Toussaint,et al. Modelling motion primitives and their timing in biologically executed movements , 2007, NIPS.

[28] Aude Billard,et al. On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29] Stefan Schaal,et al. Dynamics systems vs. optimal control--a unifying view. , 2007, Progress in brain research.

[30] Jan Peters,et al. Using Bayesian Dynamical Systems for Motion Template Libraries , 2008, NIPS.

[31] Stefan Schaal,et al. Movement reproduction and obstacle avoidance with dynamic movement primitives and potential fields , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[32] Stefan Schaal,et al. Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[33] Betty J. Mohler,et al. Learning perceptual coupling for motor primitives , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[34] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .

[35] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[36] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.

[37] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[38] Sethu Vijayakumar,et al. Using dimensionality reduction to exploit constraints in reinforcement learning , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39] Christoph H. Lampert,et al. Movement templates for learning of hitting and batting , 2010, 2010 IEEE International Conference on Robotics and Automation.

[40] Jan Peters,et al. Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .

[41] Jan Peters,et al. A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42] Oliver Kroemer,et al. Grasping with Vision Descriptors and Motor Primitives , 2010, ICINCO.

[43] Jun Morimoto,et al. Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives , 2010, IEEE Transactions on Robotics.

[44] De Xu,et al. Visual Measurement and Prediction of Ball Trajectory for Table Tennis Robot , 2010, IEEE Transactions on Instrumentation and Measurement.

[45] Christoph H. Lampert,et al. Real-time detection of colored objects in multiple camera streams with off-the-shelf hardware components , 2012, Journal of Real-Time Image Processing.

[46] Jörg Stückler,et al. Learning Motion Skills from Expert Demonstrations and Own Experience using Gaussian Process Regression , 2010, ISR/ROBOTIK.

[47] Jan Peters,et al. Learning table tennis with a Mixture of Motor Primitives , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[48] T. I. James Tsay,et al. Self-Learning for a Humanoid Robotic Ping-Pong Player , 2011, Adv. Robotics.

[49] Minija Tamosiunaite,et al. Accurate position and velocity control for trajectories based on dynamic movement primitives , 2011, 2011 IEEE International Conference on Robotics and Automation.

[50] De Xu,et al. Trajectory prediction of spinning ball for ping-pong player robot , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[51] Jan Peters,et al. Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[52] Bernhard Schölkopf,et al. Probabilistic Modeling of Human Movements for Intention Inference , 2012, Robotics: Science and Systems.