Design of a Saccading and Accommodating Robot Vision System

Biologically inspired vision systems, such as convolutional networks, have begun to rival humans in some vision tasks. A key difference between such systems and the human visual system is that the latter dedicates most of its resources to a small fraction of the visual field (the fovea), which it moves frequently and rapidly to acquire a series of rich representations of small parts of a scene. We are developing an artificial system to approximate this aspect of human vision on a robot. The main components are a novel foveating lens, and a novel gimbal that can perform rapid and precise stereo camera orientation. This paper focuses on the gimbal design, which we are releasing under an open hardware license. We expect that it will serve as both an advanced robot vision system, and a source of further insight into human vision.

[1]  R. Kliegl,et al.  An examination of binocular reading fixations based on sentence corpus data. , 2009, Journal of vision.

[2]  Young-bong Bang,et al.  A Three-Degree-of-Freedom Anthropomorphic Oculomotor Simulator , 2006 .

[3]  Clément Gosselin,et al.  The agile eye: a high-performance three-degree-of-freedom camera-orienting device , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[4]  E. M. Klier,et al.  61 Neural Mechanisms of Eye Movements: Three-Dimensional Control and Perceptual Consequences , 2013 .

[5]  Nikola T. Markov,et al.  A Weighted and Directed Interareal Connectivity Matrix for Macaque Cerebral Cortex , 2012, Cerebral cortex.

[6]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[7]  K. Cullen,et al.  Vestibuloocular reflex dynamics during high-frequency and high-acceleration rotations of the head on body in rhesus monkey. , 2002, Journal of neurophysiology.

[8]  John K. Tsotsos,et al.  Design and Performance of Trish, a Binocular Robot Head with Torsional Eye Movements , 1993, Int. J. Pattern Recognit. Artif. Intell..

[9]  Jun Luo,et al.  Design and Control of 3-DoF Spherical Parallel Mechanism Robot Eyes Inspired by the Binocular Vestibule-ocular Reflex , 2015, J. Intell. Robotic Syst..

[10]  Gregory C DeAngelis,et al.  Coding of horizontal disparity and velocity by MT neurons in the alert macaque. , 2003, Journal of neurophysiology.

[11]  T. Hashizume,et al.  Development of micro wide angle fovea lens , 2012, IECON 2012 - 38th Annual Conference on IEEE Industrial Electronics Society.

[12]  Ales Ude,et al.  The Karlsruhe Humanoid Head , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[13]  Alexandre Bernardino,et al.  A review of log-polar imaging for visual perception in robotics , 2010, Robotics and Autonomous Systems.

[14]  Neil A. Dodgson,et al.  Variation and extrema of human interpupillary distance , 2004, IS&T/SPIE Electronic Imaging.

[15]  Thomas Villgrattner,et al.  Design and control of compact high dynamic camera orientation systems , 2010 .

[16]  W. Merigan,et al.  Spatial resolution across the macaque retina , 1990, Vision Research.

[17]  J. V. Van Gisbergen,et al.  Skewness of saccadic velocity profiles: a unifying parameter for normal and slow saccades. , 1987, Vision research.

[18]  B. Boycott,et al.  Retinal ganglion cell density and cortical magnification factor in the primate , 1990, Vision Research.

[19]  H Ulbrich,et al.  Design and Control of a Compact High-Dynamic Camera-Orientation System , 2011, IEEE/ASME Transactions on Mechatronics.

[20]  G. Poggio,et al.  Stereoscopic mechanisms in monkey visual cortex: binocular correlation and disparity selectivity , 1988, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[21]  George Curatu,et al.  ANALYSIS AND DESIGN OF WIDE-ANGLE FOVEATED OPTICAL SYSTEMS , 2009 .

[22]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23]  Heinz Ulbrich,et al.  Optimization and dynamic simulation of a parallel three degree-of-freedom camera orientation system , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Xoana G. Troncoso,et al.  Microsaccadic Efficacy and Contribution to Foveal and Peripheral Vision , 2012, The Journal of Neuroscience.

[25]  R S Harwerth,et al.  Motor and sensory fusion in monkeys: Psychophysical measurements , 1996, Eye.

[26]  F. A. Miles Binocular Vision and Stereopsis by Ian P. Howard and Brian J. Rogers, Oxford University Press, 1995. £90.00 (736 pages) ISBN 0 19 508476 4. , 1996, Trends in Neurosciences.

[27]  Christof Koch,et al.  Temporal Precision of Spike Trains in Extrastriate Cortex of the Behaving Macaque Monkey , 1999, Neural Computation.