A Deep Convolutional Neural Network Model for Sense of Agency and Object Permanence in Robots

This work investigates the role of predictive models in the implementation of basic cognitive skills in robots, such as the capability to distinguish between self-generated actions and those generated by other individuals and the capability to maintain an enhanced internal visual representation of the world, where objects covered by the robot's own body in the original image may be visible in the enhanced one. A developmental approach is adopted for this purpose. In particular, a humanoid robot is learning, through a self-exploration behaviour, the sensory consequences (in the visual domain) of self-generated movements. The generated sensorimotor experience is used as training data for a deep convolutional neural network that maps proprioceptive and motor data (e.g. initial arm joint positions and applied motor commands) onto the visual consequences of these actions. This forward model is then used in two experiments. First, for generating visual predictions of self-generated movements, which are compared to actual visual perceptions and then used to compute a prediction error. This error is shown to be higher when there is an external subject performing actions, compared to situations where the robot is observing only itself. This supports the idea that prediction errors may serve as a cue for distinguishing between self and other, a fundamental prerequisite for the sense of agency. Secondly, we show how predictions can be used to attenuate self-generated movements, and thus create enhanced visual perceptions, where the sight of objects - originally occluded by the robot body - is still maintained. This may represent an important tool both for cognitive development in robots and for the understanding of the sense of object permanence in humans.

[1]  A. Craig,et al.  Ecstatic epileptic seizures: A potential window on the neural basis for human self-awareness , 2009, Epilepsy & Behavior.

[2]  Pieter Abbeel,et al.  InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[3]  Bruno Lara,et al.  Body Representations for Robot Ego-Noise Modelling and Prediction. Towards the Development of a Sense of Agency in Artificial Agents , 2016, ALIFE.

[4]  D. Wolpert,et al.  Evidence for sensory prediction deficits in schizophrenia. , 2005, The American journal of psychiatry.

[5]  S. Gallagher Philosophical conceptions of the self: implications for cognitive science , 2000, Trends in Cognitive Sciences.

[6]  Sven Vanneste,et al.  The predictive brain and the “free will” illusion , 2013, Front. Psychol..

[7]  A. Cangelosi,et al.  Developmental Robotics: From Babies to Robots , 2015 .

[8]  Tooba Ahsen,et al.  Object Permanence in Human Development and Robotics-A Survey , 2017 .

[9]  C. Stern,et al.  Where vision meets memory: prefrontal-posterior networks for visual object constancy during categorization and recognition. , 2008, Cerebral cortex.

[10]  M. Jeannerod,et al.  Sense of body and sense of action both contribute to self-recognition , 2002, Cognition.

[11]  Guillaume Thierry,et al.  Computational mechanisms of object constancy for visual recognition revealed by event-related potentials , 2007, Vision Research.

[12]  Zoubin Ghahramani,et al.  Perspectives and problems in motor learning , 2001, Trends in Cognitive Sciences.

[13]  Sarah Bechtle,et al.  On the sense of agency and of object permanence in robots , 2016, 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[14]  V. Hafner,et al.  Re-Enacting Sensorimotor Experience for Cognition , 2017 .

[15]  Karl J. Friston,et al.  Prediction, perception and agency , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[16]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[17]  Thomas Brox,et al.  Learning to Generate Chairs, Tables and Cars with Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Bruno Lara,et al.  Exploration Behaviors, Body Representations, and Simulation Processes for the Development of Cognition in Artificial Agents , 2016, Front. Robot. AI.

[19]  Ah Chung Tsoi,et al.  Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[20]  C. Frith,et al.  Self-awareness and action , 2003, Current Opinion in Neurobiology.

[21]  ChrisD . Frith,et al.  Perceiving is believing: a Bayesian approach to explaining the positive symptoms of schizophrenia , 2009, Nature Reviews Neuroscience.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.