Robotic playing for hierarchical complex skill learning

In complex manipulation scenarios (e.g. tasks requiring complex interaction of two hands or in-hand manipulation), generalization is a hard problem. Current methods still either require a substantial amount of (supervised) training data and / or strong assumptions on both the environment and the task. In this paradigm, controllers solving these tasks tend to be complex. We propose a paradigm of maintaining simpler controllers solving the task in a small number of specific situations. In order to generalize to novel situations, the robot transforms the environment from novel situations into a situation where the solution of the task is already known. Our solution to this problem is to play with objects and use previously trained skills (basis skills). These skills can either be used for estimating or for changing the current state of the environment and are organized in skill hierarchies. The approach is evaluated in complex pick-and-place scenarios that involve complex manipulation. We further show that these skills can be learned by autonomous playing.

[1]  Rustam Stolkin,et al.  Prediction learning in robotic pushing manipulation , 2009, 2009 International Conference on Advanced Robotics.

[2]  Hans J. Briegel,et al.  Projective simulation for artificial intelligence , 2011, Scientific Reports.

[3]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[4]  Advait Jain,et al.  Improving robot manipulation with data-driven object-centric models of everyday forces , 2013, Auton. Robots.

[5]  Han-Pang Huang,et al.  Robot Motion Planning in Dynamic Uncertain Environments , 2011, Adv. Robotics.

[6]  Justus H. Piater,et al.  Scalable, accurate image annotation with joint SVMs and output kernels , 2015, Neurocomputing.

[7]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[8]  Ales Ude,et al.  Reactive, task-specific object manipulation by metric reinforcement learning , 2015, 2015 International Conference on Advanced Robotics (ICAR).

[9]  Oliver Kroemer,et al.  Learning to select and generalize striking movements in robot table tennis , 2012, AAAI Fall Symposium: Robots Learning Interactively from Human Teachers.

[10]  Trevor Darrell,et al.  Using robotic exploratory procedures to learn the meaning of haptic adjectives , 2013, 2013 IEEE International Conference on Robotics and Automation.

[11]  Hans-J. Briegel,et al.  Projective Simulation for Classical Learning Agents: A Comprehensive Investigation , 2015, New Generation Computing.

[12]  Danica Kragic,et al.  Dual arm manipulation - A survey , 2012, Robotics Auton. Syst..

[13]  P. L. Adams THE ORIGINS OF INTELLIGENCE IN CHILDREN , 1976 .

[14]  Ales Ude,et al.  Autonomous acquisition of pushing actions to support object grasping with a humanoid robot , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[15]  Stefan Schaal,et al.  From dynamic movement primitives to associative skill memories , 2013, Robotics Auton. Syst..

[16]  Pradeep K. Khosla,et al.  Manipulation task primitives for composing robot skills , 1997, Proceedings of International Conference on Robotics and Automation.

[17]  Claudio Zito,et al.  Two-level RRT planning for robotic push manipulation , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Oussama Khatib,et al.  Bayesian estimation for autonomous object manipulation based on tactile sensors , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[19]  Roderic A. Grupen,et al.  Choosing informative actions for manipulation tasks , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[20]  Andrew G. Barto,et al.  Intrinsically Motivated Hierarchical Skill Learning in Structured Environments , 2010, IEEE Transactions on Autonomous Mental Development.

[21]  Siddhartha S. Srinivasa,et al.  A Framework for Push-Grasping in Clutter , 2011, Robotics: Science and Systems.

[22]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[23]  Justus H. Piater,et al.  Multi-label Object Categorization Using Histograms of Global Relations , 2015, 2015 International Conference on 3D Vision.

[24]  Wolfram Burgard,et al.  Object identification with tactile sensors using bag-of-features , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Leslie Pack Kaelbling,et al.  Approximate Planning in POMDPs with Macro-Actions , 2003, NIPS.

[26]  Ron Alterovitz,et al.  Motion planning under uncertainty using iterative local optimization in belief space , 2012, Int. J. Robotics Res..