Meta-level Priors for Learning Manipulation Skills with Sparse Features

Manipulation skills need to adapt to the geometric features of the objects that they are manipulating, e.g. the position or length of an action-relevant part of an object. However, only a sparse set of the objects’ features will be relevant for generalizing the manipulation skill between different scenarios and objects. Rather than relying on a human to select the relevant features, our work focuses on incorporating feature selection into the skill learning process. An informative prior over the features’ relevance can guide the robot’s feature selection process. This prior is computed using a meta-level prior, which is learned from previous skills. The meta-level prior is transferred to new skills using meta features. Our robot experiments show that using a meta-level prior results in better generalization performance and more efficient skill learning.

[1]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[2]  Alexander Fabisch,et al.  Towards Learning of Generic Skills for Robotic Manipulation , 2014, KI - Künstliche Intelligenz.

[3]  Yasemin Altun,et al.  Relative Entropy Policy Search , 2010 .

[4]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[5]  Naftali Tishby,et al.  Learning to Select Features using their Properties , 2008 .

[6]  Jun Nakanishi,et al.  Learning Movement Primitives , 2005, ISRR.

[7]  Daphne Koller,et al.  Learning a meta-level prior for feature relevance from multiple related tasks , 2007, ICML '07.

[8]  Oliver Kroemer,et al.  Learning to select and generalize striking movements in robot table tennis , 2012, AAAI Fall Symposium: Robots Learning Interactively from Human Teachers.

[9]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[10]  Oliver Kroemer,et al.  Learning visual representations for perception-action systems , 2011, Int. J. Robotics Res..

[11]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Gaurav S. Sukhatme,et al.  Learning Relevant Features for Manipulation Skills using Meta-Level Priors , 2016, ArXiv.

[13]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[14]  Bruno Castro da Silva,et al.  Learning parameterized motor skills on a humanoid robot , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[16]  Peter Englert,et al.  Multi-task policy search for robotics , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).