Learning predictive models of a depth camera & manipulator from raw execution traces

In this paper, we attack the problem of learning a predictive model of a depth camera and manipulator directly from raw execution traces. While the problem of learning manipulator models from visual and proprioceptive data has been addressed before, existing techniques often rely on assumptions about the structure of the robot or tracked features in observation space. We make no such assumptions. Instead, we formulate the problem as that of learning a high-dimensional controlled stochastic process. We leverage recent work on nonparametric predictive state representations to learn a generative model of the depth camera and robotic arm from sequences of uninterpreted actions and observations. We perform several experiments in which we demonstrate that our learned model can accurately predict future depth camera observations in response to sequences of motor commands.

[1]  Benjamin Kuipers The Map-Learning Critter , 1985 .

[2]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[3]  Yoshua Bengio,et al.  An Input Output HMM Architecture , 1994, NIPS.

[4]  梶田 尚志,et al.  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'97) , 1998 .

[5]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[6]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[7]  Herbert Jaeger,et al.  Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[8]  Richard S. Sutton,et al.  Predictive Representations of State , 2001, NIPS.

[9]  Michael R. James,et al.  Predictive State Representations: A New Theory for Modeling Dynamical Systems , 2004, UAI.

[10]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[11]  Tohru Katayama,et al.  Subspace Methods for System Identification , 2005 .

[12]  Yiannis Demiris,et al.  Learning Forward Models for Robots , 2005, IJCAI.

[13]  David J. Fleet,et al.  Gaussian Process Dynamical Models , 2005, NIPS.

[14]  Satinder P. Singh,et al.  Predictive Linear-Gaussian Models of Stochastic Dynamical Systems , 2005, UAI.

[15]  Michael H. Bowling,et al.  Learning predictive state representations using non-blind policies , 2006, ICML '06.

[16]  Le Song,et al.  A Hilbert Space Embedding for Distributions , 2007, IFIP Working Conference on Database Semantics.

[17]  Le Song,et al.  A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[18]  Satinder P. Singh,et al.  On discovery and learning of models with predictive representations of state for agents with continuous actions and observations , 2007, AAMAS '07.

[19]  Wolfram Burgard,et al.  Unsupervised body scheme learning through self-perception , 2008, 2008 IEEE International Conference on Robotics and Automation.

[20]  Bernhard Schölkopf,et al.  Injective Hilbert Space Embeddings of Probability Measures , 2008, COLT.

[21]  Aude Billard,et al.  Online Learning of the Body Schema , 2008, Int. J. Humanoid Robotics.

[22]  Wolfram Burgard,et al.  Body schema learning for robotic manipulators from visual self-perception , 2009, Journal of Physiology - Paris.

[23]  Byron Boots,et al.  Closing the learning-planning loop with predictive state representations , 2009, Int. J. Robotics Res..

[24]  Le Song,et al.  Hilbert Space Embeddings of Hidden Markov Models , 2010, ICML.

[25]  Byron Boots,et al.  Closing the learning-planning loop with predictive state representations , 2009, Int. J. Robotics Res..

[26]  Joseph Modayil,et al.  Discovering sensor space: Constructing spatial embeddings that explain sensor correlations , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[27]  Richard M. Murray,et al.  Bootstrapping sensorimotor cascades: A group-theoretic perspective , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Carl E. Rasmussen,et al.  Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning , 2011, Robotics: Science and Systems.

[29]  Richard M. Murray,et al.  Bootstrapping bilinear models of robotic sensorimotor cascades , 2011, 2011 IEEE International Conference on Robotics and Automation.

[30]  Bart De Moor,et al.  Subspace Identification for Linear Systems: Theory ― Implementation ― Applications , 2011 .

[31]  Le Song,et al.  Kernel Bayes' Rule , 2010, NIPS.

[32]  Dieter Fox,et al.  Manipulator and object tracking for in-hand 3D object modeling , 2011, Int. J. Robotics Res..

[33]  Byron Boots,et al.  An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems , 2011, AAAI.

[34]  Stefan Ulbrich,et al.  Kinematic Bézier Maps , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[35]  Richard M. Murray,et al.  Learning diffeomorphism models of robotic sensorimotor cascades , 2012, 2012 IEEE International Conference on Robotics and Automation.

[36]  Guy Lever,et al.  Modelling transition dynamics in MDPs with RKHS embeddings , 2012, ICML.

[37]  Alex Simpkins,et al.  System Identification: Theory for the User, 2nd Edition (Ljung, L.; 1999) [On the Shelf] , 2012, IEEE Robotics & Automation Magazine.

[38]  Joelle Pineau,et al.  Mixed Observability Predictive State Representations , 2013, AAAI.

[39]  Byron Boots,et al.  Hilbert Space Embeddings of Predictive State Representations , 2013, UAI.

[40]  Joelle Pineau,et al.  Modelling Sparse Dynamical Systems with Compressed Predictive State Representations , 2013, ICML.

[41]  Anima Anandkumar,et al.  Tensor decompositions for learning latent variable models , 2012, J. Mach. Learn. Res..