Robot Composite Learning and the Nunchaku Flipping Challenge

Advanced motor skills are essential for robots to physically coexist with humans. Much research on robot dynamics and control has achieved success on hyper robot motor capabilities, but mostly through heavily case-specific engineering. Meanwhile, in terms of robot acquiring skills in a ubiquitous manner, robot learning from human demonstration (LfD) has achieved great progress, but still has limitations handling dynamic skills and compound actions. We present a composite learning scheme which goes beyond LfD and integrates robot learning from human definition, demonstration, and evaluation. The method tackles advanced motor skills that require dynamic time-critical maneuver, complex contact control, and handling partly soft partly rigid objects. We also introduce the “nunchaku flipping challenge”, an extreme test that puts hard requirements to all these three aspects. Continued from our previous presentations, this paper introduces the latest update of the composite learning scheme and the physical success of the nunchaku flipping challenge.

[1]  Maya Cakmak,et al.  Designing robot learners that ask good questions , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[2]  Cong Wang,et al.  Multiform Adaptive Robot Skill Learning from Humans , 2017, ArXiv.

[3]  Xinyu Liu,et al.  Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics , 2017, Robotics: Science and Systems.

[4]  Cong Wang,et al.  Polymorphic robot learning for dynamic and contact-rich handling of soft-rigid objects , 2017, 2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM).

[5]  Masatoshi Ishikawa,et al.  Dynamic regrasping using a high-speed multifingered hand and a high-speed vision system , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[6]  Cong Wang,et al.  Robot end-effector sensing with position sensitive detector and inertial sensors , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  Masatoshi Ishikawa,et al.  One-handed knotting of a flexible rope with a high-speed multifingered hand having tactile sensors , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Christopher G. Atkeson,et al.  Optimization based full body control for the atlas robot , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[9]  Jan Peters,et al.  Online Kernel-Based Learning for Task-Space Tracking Robot Control , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Aude Billard,et al.  Learning from Humans , 2016, Springer Handbook of Robotics, 2nd Ed..

[11]  Cong Wang,et al.  Statistical Learning Algorithms to Compensate Slow Visual Feedback for Industrial Robots , 2015 .

[12]  MengChu Zhou,et al.  Optimal Scheduling of Complex Multi-Cluster Tools Based on Timed Resource-Oriented Petri Nets , 2016, IEEE Access.

[13]  James R. Parker,et al.  A Comparison of Exergaming Interfaces for Use in Rehabilitation Programs and Research , 2012 .

[14]  Darwin G. Caldwell,et al.  Compliant skills acquisition and multi-optima policy search with EM-based reinforcement learning , 2013, Robotics Auton. Syst..

[15]  Seong Youb Chung,et al.  An augmented Petri net for modelling and control of assembly tasks with uncertainties , 2005, Int. J. Comput. Integr. Manuf..

[16]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[17]  Wayne A. Wickel HUMAN LEARNING AND MEMORY , 1981 .

[18]  Sandy H. Huang,et al.  Leveraging appearance priors in non-rigid registration, with application to manipulation of deformable objects , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Masatoshi Ishikawa,et al.  High-speed batting using a multi-jointed manipulator , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[20]  Brett Browning,et al.  Learning by demonstration with critique from a human teacher , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[21]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[22]  William E. Becker,et al.  The Learning Effect Of Assessment And Evaluation In High School , 1992 .

[23]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[24]  Ruzena Bajcsy,et al.  Evaluation of Pose Tracking Accuracy in the First and Second Generations of Microsoft Kinect , 2015, 2015 International Conference on Healthcare Informatics.

[25]  Gerald G. Duffy,et al.  A Study of the Relationship between Teacher Explanation and Student Metacognitive Awareness during Reading Instruction. , 1985 .

[26]  Jan Peters,et al.  Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[27]  Dennis W. Moore,et al.  Vocabulary acquisition from teacher explanation and repeated listening to stories: Do they overcome the Matthew effect? , 2002 .

[28]  Jeffrey H. Lang,et al.  Design Principles for Energy-Efficient Legged Locomotion and Implementation on the MIT Cheetah Robot , 2015, IEEE/ASME Transactions on Mechatronics.

[29]  Sergey Levine,et al.  One-shot learning of manipulation skills with online dynamics adaptation and neural network priors , 2015, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30]  Cong Wang,et al.  Fast planning of well conditioned trajectories for model learning , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  T. Crooks The Impact of Classroom Evaluation Practices on Students , 1988 .

[32]  LI X.RONG,et al.  Survey of maneuvering target tracking. Part I. Dynamic models , 2003 .

[33]  Kevin Blankespoor,et al.  BigDog, the Rough-Terrain Quadruped Robot , 2008 .

[34]  Ming Gu,et al.  Efficient Algorithms for Computing a Strong Rank-Revealing QR Factorization , 1996, SIAM J. Sci. Comput..

[35]  Kikuo Fujimura,et al.  The intelligent ASIMO: system overview and integration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Masatoshi Ishikawa,et al.  Dynamic Pen Spinning Using a High-speed Multifingered Hand with High-speed Tactile Sensor , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[37]  Charlotte Ringsted,et al.  The effect of testing on skills learning , 2009, Medical education.

[38]  Jan Peters,et al.  Imitation and Reinforcement Learning , 2010, IEEE Robotics & Automation Magazine.

[39]  Nolan Wagener,et al.  Learning contact-rich manipulation skills with guided policy search , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[40]  Jan Peters,et al.  A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.

[41]  Bernhard Schölkopf,et al.  Jointly learning trajectory generation and hitting point prediction in robot table tennis , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).