论文信息 - Autonomous, self-calibrating binocular vision based on learned attention and active efficient coding

Autonomous, self-calibrating binocular vision based on learned attention and active efficient coding

We present a self-calibrating binocular vision system that autonomously learns how to encode the visual input and how to move its eyes. The model combines the learning of disparity representations and vergence eye movements through Active Efficient Coding (AEC) and the learning of saccades to interesting targets through two novel attention models. The first model is an extension of the Attention based on Information Maximization (AIM) model by Bruce and Tsotsos to binocular images. The second model aims to directly maximize the learning progress of the AEC model. We demonstrate that both attention models improve learning speed compared to a random gaze control strategy. Notably, the vergence eye movement controller and the two attention mechanisms controlling saccades all use the same learned sparse image encoding. The system represents a step towards building self-calibrating, infant-like robots that autonomously learn how to make sense of their environment and how to interact with it.

Jochen Triesch | Bertram E. Shi | Qingpeng Zhu

[1] Zhaoping Li. A saliency map in primary visual cortex , 2002, Trends in Cognitive Sciences.

[2] Yu Zhao,et al. Intrinsically motivated learning of visual motion perception and smooth pursuit , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[3] Jochen Triesch,et al. On the utility of sparse neural representations in adaptive behaving agents , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[4] Shalabh Bhatnagar,et al. Natural actor-critic algorithms , 2009, Autom..

[5] John K. Tsotsos,et al. Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[6] Pierre-Yves Oudeyer,et al. Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[7] W. James,et al. The Principles of Psychology. , 1983 .

[8] Yu Zhao,et al. Robust active binocular vision through intrinsically motivated learning , 2013, Front. Neurorobot..

[9] J. Colombo. The development of visual attention in infancy. , 2001, Annual review of psychology.

[10] Jürgen Schmidhuber,et al. Optimal Artiﬁcial Curiosity, Creativity, Music, and the Fine Arts , 2005 .

[11] Bertram E. Shi,et al. The generative Adaptive Subspace Self-Organizing Map , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[12] Jochen Triesch,et al. Autonomous learning of smooth pursuit and vergence through active efficient coding , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[13] Giorgio Metta,et al. Design of the robot-cub (iCub) head , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[14] Yu Zhao,et al. A unified model of the joint development of disparity selectivity and vergence control , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[15] Kazuhiro Fukui,et al. Realistic CG Stereo Image Dataset with Ground Truth Disparity Maps , 2012 .

[16] Yu Zhao,et al. Autonomous learning of active multi-scale binocular vision , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[17] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.