Using Human Visual System modeling for bio-inspired low level image processing

An efficient modeling of the processing occurring at retina level and in the V1 visual cortex has been proposed in [1,2]. The aim of the paper is to show the advantages of using such a modeling in order to develop efficient and fast bio-inspired modules for low level image processing. At the retina level, a spatio-temporal filtering ensures accurate structuring of video data (noise and illumination variation removal, static and dynamic contour enhancement). In the V1 cortex, a frequency and orientation based analysis is performed. The combined use of retina and V1 cortex modeling allows the development of low level image processing modules for contour enhancement, for moving contour extraction, for motion analysis and for motion event detection. Each module is described and its performances are evaluated. The retina model has been integrated into a real-time C/C++ optimized program which is also presented in this paper with the derived computer vision tools.

[1]  W. Beaudot,et al.  Sensory coding in the vertebrate retina: towards an adaptive control of visual sensitivity. , 1996, Network.

[2]  Laurent Bonnaud,et al.  Multimodal Focus Attention Detection in an Augmented Driver Simulator , 2006 .

[3]  Takeo Watanabe,et al.  High-Level Motion Processing , 1998 .

[4]  Patrick Le Callet,et al.  Spatio-temporal Tone Mapping Operator Based on a Retina Model , 2009, CCIW.

[5]  Gregory Bock,et al.  Higher-order processing in the visual system , 1994 .

[6]  Glenn D. Hines,et al.  Image enhancement, image quality, and noise , 2005, SPIE Optics + Photonics.

[7]  Dimitrios Tzovaras,et al.  Multimodal signal processing and interaction for a driving simulator: Component-based architecture , 2008, Journal on Multimodal User Interfaces.

[8]  Jeanny Hérault,et al.  NATURAL SCENE PERCEPTION: VISUAL ATTRACTORS AND IMAGES PROCESSING , 2002 .

[9]  Lale Akarun,et al.  Sign Language Tutoring tool , 2005, 2005 13th European Signal Processing Conference.

[10]  Patrick Le Callet,et al.  A coherent computational approach to model bottom-up visual attention , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[12]  Jamie Goode,et al.  Ciba Foundation Symposium 184 - Higher-Order Processing in the Visual System , 1994 .

[13]  Hans-Jochen Heinze,et al.  Selectivity for speed gradients in human area MT/V5 , 2005, Neuroreport.

[14]  H Barlow,et al.  Redundancy reduction revisited , 2001, Network.

[15]  Zia-ur Rahman,et al.  A multiscale retinex for bridging the gap between color images and the human observation of scenes , 1997, IEEE Trans. Image Process..

[16]  Dirk Walther,et al.  Interactions of visual attention and object recognition : computational modeling, algorithms, and psychophysics. , 2006 .

[17]  Misha Mahowald,et al.  A silicon model of early visual processing , 1993, Neural Networks.

[18]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[19]  R. L. Valois,et al.  The orientation and direction selectivity of cells in macaque visual cortex , 1982, Vision Research.

[20]  R C Reid,et al.  Efficient Coding of Natural Scenes in the Lateral Geniculate Nucleus: Experimental Test of a Computational Theory , 1996, The Journal of Neuroscience.

[21]  Joseph J. Atick,et al.  Towards a Theory of Early Visual Processing , 1990, Neural Computation.

[22]  D. H. Kelly Motion and vision. II. Stabilized spatio-temporal threshold surface. , 1979, Journal of the Optical Society of America.

[23]  Michael J. Berry,et al.  Adaptation of retinal processing to image contrast and spatial scale , 1997, Nature.

[24]  William Gaetz,et al.  Activation of area MT/V5 and the right inferior parietal cortex during the discrimination of transient direction changes in translational motion. , 2007, Cerebral cortex.

[25]  Jeanny Hérault,et al.  Realistic Simulation Tool for Early Visual Processing Including Space, Time and Colour Data , 1993, IWANN.

[26]  Alice Caplier,et al.  Head nods analysis: interpretation of non verbal communication gestures , 2005, IEEE International Conference on Image Processing 2005.

[27]  S. Thorpe,et al.  Surfing a spike wave down the ventral stream , 2002, Vision Research.

[28]  Stephen A. Baccus,et al.  A Retinal Circuit That Computes Object Motion , 2008, The Journal of Neuroscience.

[29]  J. Bullier Integrated model of visual processing , 2001, Brain Research Reviews.

[30]  D. Hubel,et al.  Sequence regularity and geometry of orientation columns in the monkey striate cortex , 1974, The Journal of comparative neurology.

[31]  Joseph J. Atick,et al.  What Does the Retina Know about Natural Scenes? , 1992, Neural Computation.

[32]  George Sperling,et al.  A Systems Analysis of Visual Motion Perception , 1999 .

[33]  B. Hassenstein,et al.  Systemtheoretische Analyse der Zeit-, Reihenfolgen- und Vorzeichenauswertung bei der Bewegungsperzeption des Rüsselkäfers Chlorophanus , 1956 .

[34]  C. Palm,et al.  Classification of color textures by Gabor filtering , 2002 .

[35]  R. A. Linsenmeier,et al.  Non-linear spatial summation in cat retinal ganglion cells at different background levels , 1979, Experimental Brain Research.

[36]  Pierre Kornprobst,et al.  From Light to Spikes: a Large-Scale Retina Simulator , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[37]  Jeanny Hérault,et al.  Practical implementation of LMMSE demosaicing using luminance and chrominance spaces , 2007, Comput. Vis. Image Underst..

[38]  Alice Caplier,et al.  Biological approach for head motion detection and analysis , 2005, 2005 13th European Signal Processing Conference.

[39]  Carver A. Mead,et al.  Orientation-Selective VLSI Retina , 1988, Other Conferences.

[40]  Nathalie Guyader,et al.  Emotion Classification or face identification depend on which part of the face is analyzed , 2007 .

[41]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[42]  Francis J. Devos,et al.  A programmable VLSI retina for rough vision , 2005, Machine Vision and Applications.

[43]  Abdelhakim Saadane,et al.  Design and Evaluation of an Entirely Psychovisual-Based Coding Scheme , 2001, J. Vis. Commun. Image Represent..

[44]  Patrick Lambert,et al.  A Fuzzy Color-Based Approach for Understanding Animated Movies Content in the Indexing Task , 2008, EURASIP J. Image Video Process..

[45]  L O Harvey,et al.  Visual masking at different polar angles in the two-dimensional Fourier plane. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[46]  S Marcelja,et al.  Mathematical description of the responses of simple cortical cells. , 1980, Journal of the Optical Society of America.

[47]  Z L Lu,et al.  Three-systems theory of human visual motion perception: review and update. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[48]  Alice Caplier,et al.  Fusing bio-inspired vision data for simplified high level scene interpretation: Application to face motion analysis , 2010, Comput. Vis. Image Underst..

[49]  F. Werblin,et al.  Neural interactions mediating the detection of motion in the retina of the tiger salamander , 1988, Visual Neuroscience.

[50]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[51]  John K. Tsotsos,et al.  Attending to visual motion , 2005, Comput. Vis. Image Underst..

[52]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[53]  Daniela Gorski Trevisan,et al.  Multimodal focus attention and stress detection and feedback in an augmented driver simulator , 2007, Personal and Ubiquitous Computing.

[54]  Albert Gjedde,et al.  Pattern–motion selectivity in the human pulvinar , 2005, NeuroImage.

[55]  Jeanny Hérault,et al.  Modeling Visual Perception for Image Processing , 2007, IWANN.

[56]  Alice Caplier,et al.  Hypovigilence analysis: open or closed eye or mouth? Blinking or yawning frequency? , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[57]  Nathalie Guyader,et al.  Spatio-temporal saliency model to predict eye movements in video free viewing , 2008, 2008 16th European Signal Processing Conference.

[58]  Nicole C. Rust,et al.  Do We Know What the Early Visual System Does? , 2005, The Journal of Neuroscience.

[59]  Scott J. Daly A visual model for optimizing the design of image processing algorithms , 1994, Proceedings of 1st International Conference on Image Processing.