A Review of Biologically Motivated Space-Variant Data Reduction Models for Robotic Vision

The primate retina performs nonlinear “image” data reduction while providing a compromise between high resolution where needed, a wide field-of-view, and small output image size. For autonomous robotics, this compromise is useful for developing vision systems with adequate response times. This paper reviews the two classes of models of retino?cortical data reduction used in hardware implementations. The first class reproduces the retina to cortex mapping based on conformal mapping functions. The pixel intensities are averaged in groups called receptive fields (RF's) which are nonoverlapping and the averaging performed is uniform. As is the case in the retina, the size of the RF's increases with distance from the center of the sensor. Implementations using this class of models are reported to run at video rates (30 frames per second). The second class of models reproduce, in addition to the variable-resolution retino?cortical mapping, the overlap feature of receptive fields of retinal ganglion cells. Achieving data reduction with this class of models is more computationally expensive due to the RF overlap. However, an implementation using such a model running at a minimum of 10 frames per second has been recently proposed. In addition to biological consistency, models with overlapping fields permit the simple selection of a variety of RF computational masks.

[1]  T. Poggio Vision by man and machine. , 1984, Scientific American.

[2]  Carl F. R. Weiman,et al.  Video compression via log polar mapping , 1990, Defense, Security, and Sensing.

[3]  Bart Dierickx,et al.  Log polar image sensor in CMOS technology , 1996, Other Conferences.

[4]  Eric L. Schwartz,et al.  Computational Studies of the Spatial Architecture of Primate Visual Cortex , 1994 .

[5]  Harry Wechsler,et al.  2-D Invariant Object Recognition Using Distributed Associative Memory , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Giulio Sandini,et al.  A Foveated Retina-Like Sensor Using CCD Technology , 1989, Analog VLSI Implementation of Neural Systems.

[7]  A. Cowey,et al.  Retinal ganglion cells that project to the dorsal lateral geniculate nucleus in the macaque monkey , 1984, Neuroscience.

[8]  Giulio Sandini,et al.  On the Advantages of Polar and Log-Polar Mapping for Direct Estimation of Time-To-Impact from Optical Flow , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Kostas Daniilidis,et al.  Optical Flow Computation in the Log-Polar-Plane , 1995, CAIP.

[10]  L SchwartzEric,et al.  Space-variant active vision , 1995 .

[11]  Yehoshua Y. Zeevi,et al.  Reorganization and diversification of signals in vision , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[12]  Carl F. R. Weiman,et al.  Tracking Algorithms Using Log-Polar Mapped Image Coordinates , 1990, Other Conferences.

[13]  Hironao Yamada,et al.  A wide angle vision sensor with fovea-design of distortion lens and the simulated images , 1993, Proceedings of IECON '93 - 19th Annual Conference of IEEE Industrial Electronics.

[14]  Martin D. Levine,et al.  A Real-Time Foveated Sensor with Overlapping Receptive Fields , 1997, Real Time Imaging.

[15]  E. Switkes,et al.  Deoxyglucose analysis of retinotopic organization in primate striate cortex. , 1982, Science.

[16]  Carl F. R. Weiman,et al.  3-D Sensing With Polar Exponential Sensor Arrays , 1988, Defense, Security, and Sensing.

[17]  B. Boycott,et al.  Cortical magnification factor and the ganglion cell density of the primate retina , 1989, Nature.

[18]  Eric L. Schwartz,et al.  Computational anatomy and functional architecture of striate cortex: A spatial mapping approach to perceptual coding , 1980, Vision Research.

[19]  Richard D. Juday,et al.  A Programmable Video Image Remapper , 1988, Defense, Security, and Sensing.

[20]  W. H. Miller,et al.  Photoreceptor diameter and spacing for highest resolving power. , 1977, Journal of the Optical Society of America.

[21]  M. H. Pirenne,et al.  VISION AND THE EYE , 1949 .

[22]  B. Dierickx,et al.  Pixel structure with logarithmic response for intelligent and flexible imager architectures , 1992, ESSDERC '92: 22nd European Solid State Device Research conference.

[23]  Bärbel Mertsching,et al.  The SENROB vision-system and its philosophy , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[24]  Ramesh Jain,et al.  Motion Stereo And Ego-Motion Complex Logarithmic Mapping (ECLM) , 1988, Defense, Security, and Sensing.

[25]  Giulio Sandini,et al.  "Form-invariant" topological mapping strategy for 2D shape recognition , 1985, Comput. Vis. Graph. Image Process..

[26]  D. Hubel,et al.  Receptive fields of optic nerve fibres in the spider monkey , 1960, The Journal of physiology.

[27]  David C. Van Essen,et al.  Information processing strategies and pathways in the primate retina and visual cortex , 1990 .

[28]  D. Whitteridge,et al.  The representation of the visual field on the cerebral cortex in monkeys , 1961, The Journal of physiology.

[29]  Stewart W. Wilson On the Retino-Cortical Mapping , 1983, Int. J. Man Mach. Stud..

[30]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[31]  J. Hirsch,et al.  Quality of the primate photoreceptor lattice and limits of spatial vision , 1984, Vision Research.

[32]  Martin D. Levine,et al.  Visual information processing in primate cone pathways. I. A model , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[33]  Richard A. Messner,et al.  Docking Target Design And Spacecraft Tracking System Stability , 1990, Other Conferences.

[34]  Giulio Sandini,et al.  An anthropomorphic retina-like structure for scene analysis , 1980 .

[35]  Carl F. R. Weiman,et al.  Logarithmic spiral grids for image-processing and display , 1979 .

[36]  Rafael M. Inigo,et al.  Rotation and scale invariant pattern recognition using a multistaged neural network , 1991, Other Conferences.

[37]  Eric L. Schwartz,et al.  Design considerations for a space-variant visual sensor with complex-logarithmic geometry , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[38]  Benjamin B. Bederson,et al.  A miniaturized active vision system , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol. IV. Conference D: Architectures for Vision and Pattern Recognition,.

[39]  Hugh R. Wilson,et al.  10 – THE PERCEPTION OF FORM: Retina to Striate Cortex , 1989 .

[40]  J. W. Brown,et al.  Complex Variables and Applications , 1985 .

[41]  HIROYUKI YAMAMOTO,et al.  An Active Foveated Vision System: Attentional Mechanisms and Scan Path Covergence Measures , 1996, Comput. Vis. Image Underst..

[42]  Yehezkel Yeshurun,et al.  Shape Description with a Space-Variant Sensor: Algorithms for Scan-Path, Fusion, and Convergence Over Multiple Scans , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Brian V. Funt,et al.  Conformal transplantation of lightness to varying resolution sensors , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Giulio Sandini,et al.  Vision and Space-Variant Sensing , 1992 .

[45]  Yasuo Kuniyoshi,et al.  A foveated wide angle lens for active vision , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[46]  Carl F. R. Weiman,et al.  A Modification of the Fusion Model for Log Polar Coordinates , 1990, Other Conferences.

[47]  S M Anstis,et al.  Letter: A chart demonstrating variations in acuity with retinal position. , 1974, Vision research.

[48]  Anup Basu,et al.  Modeling fish-eye lenses , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[49]  Giorgio Bonmassar,et al.  Space-variant active vision: Definition, overview and examples , 1995, Neural Networks.