A cortical architecture on parallel hardware for motion processing in real time.

Walking through a crowd or driving on a busy street requires monitoring your own movement and that of others. The segmentation of these other, independently moving, objects is one of the most challenging tasks in vision as it requires fast and accurate computations for the disentangling of independent motion from egomotion, often in cluttered scenes. This is accomplished in our brain by the dorsal visual stream relying on heavy parallel-hierarchical processing across many areas. This study is the first to utilize the potential of such design in an artificial vision system. We emulate large parts of the dorsal stream in an abstract way and implement an architecture with six interdependent feature extraction stages (e.g., edges, stereo, optical flow, etc.). The computationally highly demanding combination of these features is used to reliably extract moving objects in real time. This way-utilizing the advantages of parallel-hierarchical design-we arrive at a novel and powerful artificial vision system that approaches richness, speed, and accuracy of visual processing in biological systems.

[1]  J. Gibson The perception of the visual world , 1951 .

[2]  G. Granlund In search of a general picture processing operator , 1978 .

[3]  H. C. Longuet-Higgins,et al.  The interpretation of a moving retinal image , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[4]  D. Pollen,et al.  Phase relationships between adjacent simple cells in the visual cortex. , 1981, Science.

[5]  W C Gogel,et al.  Analysis of the perception of motion concomitant with a lateral motion of the head , 1982, Perception & psychophysics.

[6]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[7]  Leslie G. Ungerleider,et al.  Object vision and spatial vision: two cortical pathways , 1983, Trends in Neurosciences.

[8]  John H. R. Maunsell,et al.  The connections of the middle temporal visual area (MT) and their relationship to a cortical hierarchy in the macaque monkey , 1983, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[9]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[10]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[11]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  H. Komatsu,et al.  Relation of cortical areas MT and MST to pursuit eye movements. I. Localization and visual properties of neurons. , 1988, Journal of neurophysiology.

[14]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[15]  James J. Little,et al.  Seeing in Parallel: the Vision Machine , 1988 .

[16]  T Poggio,et al.  Parallel integration of vision modules. , 1988, Science.

[17]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[18]  W C Gogel,et al.  A theory of phenomenal geometry and its applications , 1990, Perception & psychophysics.

[19]  I. Ohzawa,et al.  Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. , 1990, Science.

[20]  E. Brenner Judging object motion during smooth pursuit eye movements: The role of optic flow , 1991, Vision Research.

[21]  David J. Fleet,et al.  Phase-based disparity measurement , 1991, CVGIP Image Underst..

[22]  J A Perrone,et al.  Model for the computation of self-motion in biological systems. , 1992, Journal of the Optical Society of America. A, Optics and image science.

[23]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[24]  Markus Lappe,et al.  A Neural Network for the Processing of Optic Flow from Ego-Motion in Man and Higher Mammals , 1993, Neural Computation.

[25]  K. Tanaka,et al.  Analysis of object motion in the ventral part of the medial superior temporal area of the macaque visual cortex. , 1993, Journal of neurophysiology.

[26]  David J. Fleet,et al.  Stability of Phase Information , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[28]  R. Wurtz,et al.  Response of monkey MST neurons to optic flow stimuli with shifted centers of motion , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[29]  W H Warren,et al.  Perceiving Heading in the Presence of Moving Objects , 1995, Perception.

[30]  Rajeev Sharma,et al.  Early detection of independent motion from active control of normal image flow patterns , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[31]  Wendong Wang,et al.  Recovering the Three-Dimensional Motion and Structure of Multiple Moving Objects from Binocular Image Flows , 1996, Comput. Vis. Image Underst..

[32]  P. Anandan,et al.  A unified approach to moving object detection in 2D and 3D scenes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[33]  Constance S. Royden,et al.  Human heading judgments in the presence of moving objects , 1996, Perception & psychophysics.

[34]  Eli Brenner,et al.  The Special Role of Distant Structures in Perceived Object Velocity , 1996, Vision Research.

[35]  K. Hoffmann,et al.  Optic Flow Processing in Monkey STS: A Theoretical and Experimental Approach , 1996, The Journal of Neuroscience.

[36]  I. Ohzawa,et al.  Encoding of binocular disparity by complex cells in the cat's visual cortex. , 1996, Journal of neurophysiology.

[37]  E. Rolls,et al.  INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[38]  Constance S. Royden,et al.  Mathematical analysis of motion-opponent mechanisms used in the determination of heading and depth. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[39]  Julie M. Harris,et al.  Guidance of locomotion on foot uses perceived target location rather than optic flow , 1998, Current Biology.

[40]  T J Sejnowski,et al.  A Model for Encoding Multiple Object Motions and Self-Motion in Area MST of Primate Visual Cortex , 1998, The Journal of Neuroscience.

[41]  Antonis A. Argyros,et al.  Robust Regression for the Detection of Independent 3D Motion by a Binocular Observer , 1998, Real Time Imaging.

[42]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[43]  P. Torr Geometric motion segmentation and model selection , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[44]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[45]  R. Wurtz,et al.  Response to motion in extrastriate area MSTl: disparity sensitivity. , 1999, Journal of neurophysiology.

[46]  Harpreet S. Sawhney,et al.  Independent motion detection in 3D scenes , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[47]  Radu Horaud,et al.  Motion-Egomotion Discrimination and Motion Segmentation from Image-Pair Streams , 2000, Comput. Vis. Image Underst..

[48]  Ning Qian,et al.  Relationship Between Phase and Energy Methods for Disparity Computation , 2000, Neural Computation.

[49]  Jacques Droulez,et al.  Self-motion and the perception of stationary objects , 2001, Nature.

[50]  J. Droulez,et al.  The stationarity hypothesis: an allocentric criterion in visual perception , 2001, Vision Research.

[51]  F. Mechler,et al.  Detection and Discrimination of Relative Spatial Phase by V1 Neurons , 2002, The Journal of Neuroscience.

[52]  D. Ringach Spatial structure and symmetry of simple-cell receptive fields in macaque primary visual cortex. , 2002, Journal of neurophysiology.

[53]  Marc M. Van Hulle,et al.  A phase-based approach to the estimation of the optical flow field using spatial filtering , 2002, IEEE Trans. Neural Networks.

[54]  Constance S. Royden,et al.  Computing heading in the presence of moving objects: a model that uses motion-opponent operators , 2002, Vision Research.

[55]  S. Heinrich,et al.  Fast obstacle detection for urban traffic situations , 2002, IEEE Trans. Intell. Transp. Syst..

[56]  S. Grossberg How does the cerebral cortex work? Development, learning, attention, and 3-D vision by laminar circuits of visual cortex. , 2003, Behavioral and cognitive neuroscience reviews.

[57]  J. Koenderink,et al.  Facts on optic flow , 1987, Biological Cybernetics.

[58]  David J. Fleet,et al.  Computation of component image velocity from local phase information , 1990, International Journal of Computer Vision.

[59]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[60]  Carlo Tomasi,et al.  On the Consistency of Instantaneous Rigid Motion Estimation , 2004, International Journal of Computer Vision.

[61]  John Oliensis The Least-Squares Error for Structure from Infinitesimal Motion , 2004, International Journal of Computer Vision.

[62]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[63]  Stefano Soatto,et al.  Optimal Structure from Motion: Local Ambiguities and Global Estimates , 2004, International Journal of Computer Vision.

[64]  Randal C. Nelson Qualitative detection of motion by a moving observer , 2004, International Journal of Computer Vision.

[65]  T. Sanger,et al.  Stereo disparity computation using Gabor filters , 1988, Biological Cybernetics.

[66]  William B. Thompson,et al.  Detecting moving objects , 1989, International Journal of Computer Vision.

[67]  P. Thier,et al.  Posterior Parietal Cortex Neurons Encode Target Motion in World-Centered Coordinates , 2004, Neuron.

[68]  Ning Qian,et al.  A Coarse-to-Fine Disparity Energy Model with Both Phase-Shift and Position-Shift Receptive Field Mechanisms , 2004, Neural Computation.

[69]  S. Rushton,et al.  Moving observers, relative retinal motion and the detection of object movement , 2005, Current Biology.

[70]  Yiannis Aloimonos,et al.  Motion segmentation using occlusions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[72]  David J Logan,et al.  Cerebral Cortex doi:10.1093/cercor/bhj082 Cerebral Cortex Advance Access published December 7, 2005 Cortical Area MSTd Combines Visual Cues , 2022 .

[73]  Eero P. Simoncelli,et al.  How MT cells analyze the motion of visual patterns , 2006, Nature Neuroscience.

[74]  James R. Bergen,et al.  Visual odometry for ground vehicle applications , 2006, J. Field Robotics.

[75]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[76]  Simon K Rushton,et al.  Perception of object trajectory: parsing retinal motion into self and object movement components. , 2007, Journal of vision.

[77]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  S. Rushton,et al.  The pop out of scene-relative object movement against retinal motion due to self-movement , 2007, Cognition.

[79]  Gérard G. Medioni,et al.  Detecting Motion Regions in the Presence of a Strong Parallax from a Moving Camera by Multiview Geometric Constraints , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[80]  David Suter,et al.  A Model-Selection Framework for Multibody Structure-and-Motion of Image Sequences , 2007, International Journal of Computer Vision.

[81]  Andrew Chi-Sing Leung,et al.  Discrete Wavelet Transform on Consumer-Level Graphics Hardware , 2007, IEEE Transactions on Multimedia.

[82]  Hugh F. Durrant-Whyte,et al.  Simultaneous Localization, Mapping and Moving Object Tracking , 2007, Int. J. Robotics Res..

[83]  Erik Lindholm,et al.  NVIDIA Tesla: A Unified Graphics and Computing Architecture , 2008, IEEE Micro.

[84]  M. Morrone,et al.  BOLD response to spatial phase congruency in human brain. , 2008, Journal of vision.

[85]  Laurence R Harris,et al.  The influence of retinal and extra-retinal motion cues on perceived object motion during self-motion. , 2008, Journal of vision.

[86]  Richard J Krauzlis,et al.  Vector subtraction using visual and extraretinal motion signals: a new look at efference copy and corollary discharge theories. , 2008, Journal of vision.

[87]  T. Vaudrey,et al.  Differences between stereo and motion behaviour on synthetic and real-world stereo sequences , 2008, 2008 23rd International Conference Image and Vision Computing New Zealand.

[88]  Guy A Orban,et al.  Higher order visual processing in macaque extrastriate cortex. , 2008, Physiological reviews.

[89]  Richard J Krauzlis,et al.  Spatial integration by MT pattern neurons: a closer look at pattern-to-component effects and the role of speed tuning. , 2008, Journal of vision.

[90]  Luc Van Gool,et al.  Coupled Object Detection and Tracking from Static Cameras and Moving Vehicles , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[91]  S. Rushton,et al.  Evidence for flow-parsing in radial flow displays , 2008, Vision Research.

[92]  Luc Van Gool,et al.  Robust Multiperson Tracking from a Mobile Platform , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[93]  Somkiat Wangsiripitak,et al.  Avoiding moving outliers in visual SLAM by tracking moving objects , 2009, 2009 IEEE International Conference on Robotics and Automation.

[94]  Hiroshi Ando,et al.  World-centered perception of 3D object motion during visually guided self-motion. , 2009, Journal of vision.

[95]  Stephen Grossberg,et al.  Cortical dynamics of navigation and steering in natural scenes: Motion-based object segmentation, heading, and obstacle avoidance , 2009, Neural Networks.

[96]  S. Rushton,et al.  Optic Flow Processing for the Assessment of Object Movement during Ego Movement , 2009, Current Biology.

[97]  Aapo Hyvärinen,et al.  Representation of Cross-Frequency Spatial Phase Relationships in Human Visual Cortex , 2009, The Journal of Neuroscience.

[98]  Marc M. Van Hulle,et al.  Optic flow from unstable sequences through local velocity constancy maximization , 2009, Image Vis. Comput..

[99]  Markus Lappe,et al.  Visual selectivity for heading in monkey area MST , 2009, Experimental Brain Research.

[100]  S. Rushton,et al.  Perception of scene-relative object movement: Optic flow parsing and the contribution of monocular depth cues , 2009, Vision Research.

[101]  Daniel Cremers,et al.  Detection and Segmentation of Independently Moving Objects from Dense Scene Flow , 2009, EMMCVPR.

[102]  L. Cormack,et al.  Disparity- and velocity-based signals for three-dimensional motion perception in human MT+ , 2009, Nature Neuroscience.

[103]  Fabio Solari,et al.  A compact harmonic code for early vision based on anisotropic frequency channels , 2010, Comput. Vis. Image Underst..

[104]  Ian D. Reid,et al.  On combining visual SLAM and visual odometry , 2010, 2010 IEEE International Conference on Robotics and Automation.

[105]  Norbert Krüger,et al.  A Three-level Architecture for Model-free Detection and Tracking of Independently Moving Objects , 2010, VISAPP.