论文信息 - Imitating by Generating: Deep Generative Models for Imitation of Interactive Tasks

Imitating by Generating: Deep Generative Models for Imitation of Interactive Tasks

To coordinate actions with an interaction partner requires a constant exchange of sensorimotor signals. Humans acquire these skills in infancy and early childhood mostly by imitation learning and active engagement with a skilled partner. They require the ability to predict and adapt to one's partner during an interaction. In this work we want to explore these ideas in a human-robot interaction setting in which a robot is required to learn interactive tasks from a combination of observational and kinesthetic learning. To this end, we propose a deep learning framework consisting of a number of components for (1) human and robot motion embedding, (2) motion prediction of the human partner, and (3) generation of robot joint trajectories matching the human motion. As long-term motion prediction methods often suffer from the problem of regression to the mean, our technical contribution here is a novel probabilistic latent variable model which does not predict in joint space but in latent space. To test the proposed method, we collect human-human interaction data and human-robot interaction data of four interactive tasks “hand-shake,” “hand-wave,” “parachute fist-bump,” and “rocket fist-bump.” We demonstrate experimentally the importance of predictive and adaptive components as well as low-level abstractions to successfully learn to imitate human behavior in interactive social tasks.

[1] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[2] B. Rogoff,et al. Guided participation in cultural activity by toddlers and caregivers. , 1993, Monographs of the Society for Research in Child Development.

[3] R. Martorell,et al. Early supplementary feeding and cognition: effects over two decades. , 1969, Monographs of the Society for Research in Child Development.

[4] H. Bekkering,et al. Joint action: bodies and minds moving together , 2006, Trends in Cognitive Sciences.

[5] Kerstin Dautenhahn,et al. Socially intelligent robots: dimensions of human–robot interaction , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6] Chrystopher L. Nehaniv,et al. Correspondence Mapping Induced State and Action Metrics for Robotic Imitation , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7] Stefan Schaal,et al. Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[8] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[9] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.

[10] Darwin G. Caldwell,et al. Learning and Reproduction of Gestures by Imitation , 2010, IEEE Robotics & Automation Magazine.

[11] Brian Williams,et al. Motion learning in variable environments using probabilistic flow tubes , 2011, 2011 IEEE International Conference on Robotics and Automation.

[12] Celia A. Brownell,et al. Early Developments in Joint Action , 2011, Review of philosophy and psychology.

[13] Brian Williams,et al. Learning and Recognition of Hybrid Manipulation Motions in Variable Environments Using Probabilistic Flow Tubes , 2012, Int. J. Soc. Robotics.

[14] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[15] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[16] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[17] Hema Swetha Koppula,et al. Anticipating Human Activities Using Object Affordances for Reactive Robotic Response , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.

[19] Oliver Kroemer,et al. Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks , 2017, Auton. Robots.

[20] Sari R. R. Nijssen,et al. Joint Action: Mental Representations, Shared Information and General Mechanisms for Coordinating with Others , 2017, Front. Psychol..

[21] Derong Liu,et al. Neural Information Processing , 2017, Lecture Notes in Computer Science.

[22] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[23] Danica Kragic,et al. Deep predictive policy training using reinforcement learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[24] Jan Peters,et al. Phase estimation for fast action recognition and trajectory generation in human–robot collaboration , 2017, Int. J. Robotics Res..

[25] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.

[26] Pieter Abbeel,et al. An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.

[27] Danica Kragic,et al. Anticipating Many Futures: Online Human Motion Prediction and Generation for Human-Robot Interaction , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[28] Ken Goldberg,et al. Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[29] Danica Kragic,et al. Classify, predict, detect, anticipate and synthesize: Hierarchical recurrent latent variable models for human activity modeling , 2018, ArXiv.

[30] Hedvig Kjellström,et al. Advances in Variational Inference , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.