Enhancing Binocular Depth Estimation Based on Proactive Perception and Action Cyclic Learning for an Autonomous Developmental Robot

In humans, perception and action (PA) possess cyclically causal relations. In this paper, we propose a new PA-based cyclic learning framework to autonomously enhance the depth-estimation accuracy of a humanoid robot and perform given behavioral tasks. The proposed method integrates the concepts of sensory invariance-driven action and object-size invariance to autonomously enhance the depth-estimation accuracy. If the depth estimation is reliable, the reinforcement learning framework is used to generate goal-directed actions of a humanoid robot based on a perceived environment. Iterative PA cycles of a robot autonomously refine its depth-estimation. The proposed method is evaluated using a humanoid robot (NAO) with stereo cameras, and the experimental results demonstrate that the proposed framework is effective for autonomously enhancing both the depth-estimation accuracy and the action-generation performance.

[1]  Jürgen Schmidhuber,et al.  Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010) , 2010, IEEE Transactions on Autonomous Mental Development.

[2]  Nassim Mafi,et al.  Intrinsically motivated information foraging , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[3]  Olivier Stasse,et al.  Visually-Guided Grasping while Walking on a Humanoid Robot , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[4]  Karl Kral,et al.  Side-to-side head movements to obtain motion depth cues: A short review of research on the praying mantis , 1998, Behavioural Processes.

[5]  Mark Wexler,et al.  Depth perception by the active observer , 2005, Trends in Cognitive Sciences.

[6]  S. Chieffi,et al.  Visual illusion and action , 1996, Neuropsychologia.

[7]  Richard L. Lewis,et al.  Intrinsically Motivated Reinforcement Learning: An Evolutionary Perspective , 2010, IEEE Transactions on Autonomous Mental Development.

[8]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[9]  Karl Kral,et al.  Behavioural–analytical studies of the role of head movements in depth perception in insects, birds and mammals , 2003, Behavioural Processes.

[10]  Minho Lee,et al.  Autonomous and Interactive Improvement of Binocular Visual Depth Estimation through Sensorimotor Interaction , 2013, IEEE Transactions on Autonomous Mental Development.

[11]  Giulio Sandini,et al.  A Vision-Based Learning Method for Pushing Manipulation , 1993 .

[12]  Changyin Sun,et al.  Model Identification and Control Design for a Humanoid Robot , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[13]  Young-Jun Son,et al.  Vision-Based Target Detection and Localization via a Team of Cooperative UAV and UGVs , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Minho Lee,et al.  Autonomous Depth Perception of Humanoid Robot Using Binocular Vision System Through Sensorimotor Interaction with Environment , 2015, ICONIP.

[15]  G. Rizzolatti,et al.  Action recognition in the premotor cortex. , 1996, Brain : a journal of neurology.

[16]  James J. Gibson,et al.  The Ecological Approach to Visual Perception: Classic Edition , 2014 .

[17]  Mikko Nuutinen,et al.  Method for measuring stereo camera depth accuracy based on stereoscopic vision , 2011, Electronic Imaging.

[18]  Pierre Blazevic,et al.  The NAO humanoid: a combination of performance and affordability , 2008, ArXiv.

[19]  Renquan Lu,et al.  Trajectory-Tracking Control of Mobile Robot Systems Incorporating Neural-Dynamic Optimized Model Predictive Approach , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[20]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[21]  Walter J. Freeman,et al.  Consciousness, Intentionality, and Causality , 2001 .

[22]  Dewen Hu,et al.  Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[23]  Giorgio Metta,et al.  Active object recognition on a humanoid robot , 2012, 2012 IEEE International Conference on Robotics and Automation.

[24]  R. Passingham,et al.  Action observation and acquired motor skills: an FMRI study with expert dancers. , 2005, Cerebral cortex.

[25]  Pierre Blazevic,et al.  Mechatronic design of NAO humanoid , 2009, 2009 IEEE International Conference on Robotics and Automation.

[26]  Michail G. Lagoudakis,et al.  Complete analytical inverse kinematics for NAO , 2013, 2013 13th International Conference on Autonomous Robot Systems.

[27]  Oliver Kroemer,et al.  Learning visual representations for perception-action systems , 2011, Int. J. Robotics Res..

[28]  Michail G. Lagoudakis,et al.  Complete Analytical Forward and Inverse Kinematics for the NAO Humanoid Robot , 2015, J. Intell. Robotic Syst..

[29]  R. Held,et al.  MOVEMENT-PRODUCED STIMULATION IN THE DEVELOPMENT OF VISUALLY GUIDED BEHAVIOR. , 1963, Journal of comparative and physiological psychology.

[30]  Daniel Chern-Yeow Eng,et al.  Autonomous Learning of the Semantics of Internal Sensory States Based on Motor Exploration , 2007, Int. J. Humanoid Robotics.

[31]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[32]  Yiannis Aloimonos,et al.  Active segmentation for robotics , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Minho Lee,et al.  Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment , 2008, Neural Networks.