Smooth Foveal vision with Gaussian receptive fields

Despite the huge amount of information, the human brain is able to perceive and interpret visual signals in real time. One of reasons is that visual information is selectively sampled in the retina providing higher acuity in the center (where usually the most important information is) than in periphery. Humanoids vision can benefit from such space-variant representations of the visual field with utility not only in image data reduction but also in others applications as vergence, active tracking, as demonstrated in the last decades' research. However, classical methods model foveation processes with non-smooth receptive fields with are a weak match to the human physiology. Instead we propose an alternative representation using Gaussian kernels. While increasing redundancy, Gaussian receptive fields provide a smoother representation of Foveal images and model certain properties of data acquisition in human vision. In addition, we propose an algebraic approach for the analysis, synthesis and processing of Foveal images, using simple matrix computations and operator theory. We show how to derive the equivalent Foveal operators to common Cartesian domain linear processing routines such as image geometrical transformations and filtering operations. We present experiments illustrating the performance of the proposed methodology in comparison to classical approaches for space-variant image processing both in image reconstruction and in motion estimation/tracking tasks.

[1]  G. Metta An attentional system for a humanoid robot exploiting space variant vision , 2001 .

[2]  Giulio Sandini,et al.  Foveated active tracking with redundant 2D motion parameters , 2002, Robotics Auton. Syst..

[3]  Giulio Sandini,et al.  An anthropomorphic retina-like structure for scene analysis , 1980 .

[4]  Alexandre Bernardino,et al.  Binocular tracking: integrating perception and control , 1999, IEEE Trans. Robotics Autom..

[5]  I. Gohberg,et al.  Basic Operator Theory , 1981 .

[6]  Filiberto Pla,et al.  Similarity motion estimation and active tracking through spatial-domain projections on log-polar images , 2005, Comput. Vis. Image Underst..

[7]  Giulio Sandini,et al.  Dynamic Vergence Using Log-Polar Images , 1997, International Journal of Computer Vision.

[8]  Giulio Sandini,et al.  Anthropomorphic Visual Sensors , 2005 .

[9]  T. Lindeberg,et al.  Foveal scale-space and the linear increase of receptive field size as a function of eccentricity , 1994 .

[10]  E. L. Schwartz,et al.  Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception , 1977, Biological Cybernetics.

[11]  Alexandre Bernardino,et al.  A Binocular Stereo Algorithm for Log-Polar Foveated Systems , 2002, Biologically Motivated Computer Vision.

[12]  S. Edelman Receptive Fields for Vision: from Hyperacuity to Object Recognition , 1995 .

[13]  Mohammed Yeasin,et al.  Foveated Vision Sensor and Image Processing - A Review , 2005, Machine Learning and Robot Perception.

[14]  A. Bernardino,et al.  Binocular Visual Tracking : Integration of Perception and Control , 1999 .

[15]  José Santos-Victor,et al.  Egomotion estimation using log-polar images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[16]  Dennis Gabor,et al.  Theory of communication , 1946 .

[17]  Kostas Daniilidis,et al.  Attentive Visual Motion Processing: Computations in the Log-Polar Plane , 1994, Theoretical Foundations of Computer Vision.

[18]  H. Kolb How the Retina Works , 2003, American Scientist.

[19]  Dianne P. O'Leary,et al.  The Use of the L-Curve in the Regularization of Discrete Ill-Posed Problems , 1993, SIAM J. Sci. Comput..

[20]  Jochen Triesch,et al.  Implementations and Implications of Foveated Vision , 2009 .

[21]  Alexandre Bernardino,et al.  Visual behaviours for binocular tracking , 1998, Robotics Auton. Syst..

[22]  Benjamin B. Bederson,et al.  Space variant image processing , 1994, International Journal of Computer Vision.

[23]  Heiko Neumann,et al.  Space-variant dynamic neural fields for visual attention , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[24]  Stewart W. Wilson On the Retino-Cortical Mapping , 1983, Int. J. Man Mach. Stud..

[25]  T. L. Boullion,et al.  The pseudoinverse of a centrosymmetric matrix , 1973 .