A predictive network architecture for a robust and smooth robot docking behavior

Robots and living beings exhibit latencies in their sensorimotor processing due to mechanical and electronic or neural processing delays. A reaction typically occurs to input stimuli of the past. This is critical not only when the environment changes (e.g. moving objects) but also when the agent itself moves. An agent that does not predict while moving may need to remain static between sensory input acquisition and output response to guarantee that the response is appropriate to the percept. We propose a biologically-inspired learning model of predictive sensorimotor integration to compensate for this latency. In this model, an Elman network is developed for sensory prediction and sensory filtering; a Continuous Actor-Critic Learning Automaton (CACLA) is trained for continuous action generation. For a robot docking experiment, this architecture improves the smoothness of the robot’s sensory input and therefore results in a faster and more accurate continuous approach behavior.

[1]  Ralf Möller,et al.  A model of ant navigation based on visual prediction. , 2012, Journal of theoretical biology.

[2]  Reza Shadmehr,et al.  Learning from Sensory and Reward Prediction Errors during Motor Adaptation , 2011, PLoS Comput. Biol..

[3]  J. Moran,et al.  Sensation and perception , 1980 .

[4]  C. Gilbert,et al.  Perceptual learning and top-down influences in primary visual cortex , 2004, Nature Neuroscience.

[5]  Ninad Pradhan,et al.  Robot crowd navigation using predictive position fields in the potential function framework , 2011, Proceedings of the 2011 American Control Conference.

[6]  G. L. Masson,et al.  Feedback inhibition controls spike transfer in hybrid thalamic circuits , 2002, Nature.

[7]  M.A. Wiering,et al.  Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[8]  Jose Rodriguez,et al.  IEEE Transactions on Industrial Electronics: Guest Editorial , 2002 .

[9]  Stefan Wermter,et al.  Real-world reinforcement learning for autonomous humanoid robot docking , 2012, Robotics Auton. Syst..

[10]  Stefan Wermter,et al.  Learning Features and Predictive Transformation Encoding Based on a Horizontal Product Model , 2012, ICANN.

[11]  D. Mackay Perceptual Stability of a Stroboscopically Lit Visual Field containing Self-Luminous Objects , 1958, Nature.

[12]  F. Craik,et al.  The Oxford handbook of memory , 2006 .

[13]  J. Krakauer,et al.  Error correction, sensory prediction, and adaptation in motor control. , 2010, Annual review of neuroscience.

[14]  Stefan Wermter,et al.  A neural approach for robot navigation based on cognitive map learning , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[15]  Tommy Chang,et al.  Road detection and tracking for autonomous mobile robots , 2002, SPIE Defense + Commercial Sensing.

[16]  Philip E. Bourne,et al.  PLoS Computational Biology: A New Community Journal , 2005, PLoS Comput. Biol..

[17]  U. Eysel,et al.  Orientation-specific relationship between populations of excitatory and inhibitory lateral connections in the visual cortex of the cat. , 1997, Cerebral cortex.

[18]  Sebastian Thrun,et al.  Bayesian Landmark Learning for Mobile Robot Localization , 1998, Machine Learning.

[19]  R. Nijhawan,et al.  Visual decomposition of colour through motion extrapolation , 1997, Nature.

[20]  Tao Xiong,et al.  A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[21]  Tim Kiemel,et al.  Navigating sensory conflict in dynamic environments using adaptive state estimation , 2011, Biological Cybernetics.

[22]  Chuang Liu,et al.  IEEE International Conference on Robotics and Biomimetics , 2014 .

[23]  Tetsuya Ogata,et al.  Predicting Object Dynamics From Visual Images Through Active Sensing Experiences , 2008, Adv. Robotics.

[24]  Steffen Udluft,et al.  Learning long-term dependencies with recurrent neural networks , 2008, Neurocomputing.

[25]  K. Cullen Sensory signals during active versus passive movement , 2004, Current Opinion in Neurobiology.

[26]  M. Bar,et al.  Top-down predictions in the cognitive brain , 2007, Brain and Cognition.

[27]  M. Bar A Cortical Mechanism for Triggering Top-Down Facilitation in Visual Object Recognition , 2003, Journal of Cognitive Neuroscience.

[28]  Philippe Gaussier,et al.  Biologically inspired neural networks for spatio-temporal planning in robotic navigation tasks , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[29]  Oussama Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Autonomous Robot Vehicles.

[30]  Gordon Cheng,et al.  Learning to Act from Observation and Practice , 2004, Int. J. Humanoid Robotics.

[31]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[32]  C. Gilbert,et al.  Synaptic physiology of horizontal connections in the cat's visual cortex , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[33]  S. Y. Chen,et al.  Kalman Filter for Robot Vision: A Survey , 2012, IEEE Transactions on Industrial Electronics.

[34]  Nikolaos G. Tsagarakis,et al.  A passivity based admittance control for stabilizing the compliant humanoid COMAN , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[35]  Pascal Frossard,et al.  IEEE Transactions on Circuits and Systems for Video Technology , 2008 .

[36]  Giulio Sandini,et al.  Sensory prediction for autonomous robots , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[37]  R. Olberg,et al.  Visual control of prey-capture flight in dragonflies , 2012, Current Opinion in Neurobiology.

[38]  Michael W. Spratling Predictive coding as a model of biased competition in visual attention , 2008, Vision Research.

[39]  Pitoyo Hartono,et al.  Fast reinforcement learning for simple physical robots , 2009, Memetic Comput..

[40]  G. Metta,et al.  Learning precise 3D reaching in a humanoid robot , 2007, 2007 IEEE 6th International Conference on Development and Learning.

[41]  Caspar M. Schwiedrzik,et al.  Stimulus Predictability Reduces Responses in Primary Visual Cortex , 2010, The Journal of Neuroscience.

[42]  Gian Luca Foresti,et al.  Object recognition and tracking for remote video surveillance , 1999, IEEE Trans. Circuits Syst. Video Technol..

[43]  George A. Constantinides,et al.  A Floating-point Extended Kalman Filter Implementation for Autonomous Mobile Robots , 2009, J. Signal Process. Syst..

[44]  Loulin Huang,et al.  Velocity planning for a mobile robot to track a moving target - a potential field approach , 2009, Robotics Auton. Syst..

[45]  V. Lamme,et al.  The distinct modes of vision offered by feedforward and recurrent processing , 2000, Trends in Neurosciences.

[46]  G. Pourtois,et al.  Top-down effects on early visual processing in humans: A predictive coding framework , 2011, Neuroscience & Biobehavioral Reviews.

[47]  Romi Nijhawan,et al.  Motion extrapolation in catching , 1994, Nature.

[48]  Alan F. Murray,et al.  International Joint Conference on Neural Networks , 1993 .

[49]  L. Trainor Predictive information processing is a fundamental learning mechanism present in early development: evidence from infants. , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[50]  Jun Miura,et al.  On-line road boundary modeling with multiple sensory features, flexible road model, and particle filter , 2011, Robotics Auton. Syst..

[51]  M. Corbetta,et al.  Control of goal-directed and stimulus-driven attention in the brain , 2002, Nature Reviews Neuroscience.