Evaluating Motion Estimation Models from Behavioural and Psychophysical Data

Offering proper evaluation methodology is essential to continue progress in modelling the neural mechanisms involved in vision information processing. Currently the evaluation of biologically inspired motion estimation models lacks a proper methodology for comparing their performance against behavioural and psychophysical data. Here we set the basis for such a new benchmark methodology based on human visual performance and designed a database of image sequences taken from neuroscience and psychophysics literature. In this article we focused on two fundamental aspects of motion estimation, which are the respective influence between 1d versus 2d cues and the dynamics of motion integration. Since motion models deal with many kinds of motion representations and scales, we defined two general readouts based on a global motion estimation. Such readouts, namely eye movements and perceived motion, will serve as a reference to compare simulated and experimental data. Baseline results are provided for biologically inspired artificial vision models but also for computer vision models. As a whole we provide here the basis for a valuable evaluation methodology to unravel the fundamental mechanisms of motion perception in the visual cortex. Our database is freely available on the web together with scoring instructions and results at: http://www-sop.inria.fr/neuromathcomp/psymotionbench

[1]  M. Shiffrar,et al.  Different motion sensitive units are involved in recovering the direction of moving lines , 1993, Vision Research.

[2]  Luc Van Gool,et al.  Motion - Stereo Integration for Depth Estimation , 2002, ECCV.

[3]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[4]  U. Ilg,et al.  Initiation of smooth-pursuit eye movements by real and illusory contours , 2008, Vision Research.

[5]  F. Kooi Local direction of edge motion causes and abolishes the barberpole illusion , 1993, Vision Research.

[6]  S. Grossberg,et al.  A neural model of motion processing and visual navigation by cortical area MST. , 1999, Cerebral cortex.

[7]  Christopher C. Pack,et al.  Contrast dependence of suppressive influences in cortical area MT of alert macaque. , 2005, Journal of neurophysiology.

[8]  R. M. Díaz,et al.  On first-passage problems for asymmetric one-dimensional diffusions , 2007 .

[9]  Pascal Mamassian,et al.  Bayesian modeling of dynamic motion integration , 2007, Journal of Physiology-Paris.

[10]  Christopher C. Pack,et al.  Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain , 2001, Nature.

[11]  P. Kornprobst,et al.  Modelling the dynamics of motion integration with a new luminance-gated diffusion mechanism , 2010, Vision Research.

[12]  Carlos R. Ponce,et al.  Temporal evolution of 2-dimensional direction signals used to guide eye movements. , 2005, Journal of neurophysiology.

[13]  E. Castet,et al.  Temporal dynamics of motion integration for the initiation of tracking eye movements at ultra-short latencies , 2000, Visual Neuroscience.

[14]  Jitendra Malik,et al.  Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Heiko Neumann,et al.  Interactions of motion and form in visual cortex – A neural model , 2008, Journal of Physiology-Paris.

[16]  Olivier D. Faugeras,et al.  Local/Global Analysis of the Stationary Solutions of Some Neural Field Equations , 2009, SIAM J. Appl. Dyn. Syst..

[17]  R. Shapley,et al.  Contrast's effect on spatial summation by macaque V1 neurons , 1999, Nature Neuroscience.

[18]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[19]  Heiko Neumann,et al.  Disambiguating Visual Motion by Form-Motion Interaction—a Computational Model , 2007, International Journal of Computer Vision.

[20]  Eric Castet,et al.  The extrinsic/intrinsic classification of two-dimensional motion signals with barber-pole stimuli , 1999, Vision Research.

[21]  L. Stone,et al.  From following edges to pursuing objects. , 2002, Journal of neurophysiology.

[22]  I. Vanzetta,et al.  Behavioral receptive field for ocular following in humans: dynamics of spatial summation and center-surround interactions. , 2006, Journal of neurophysiology.

[23]  Heiko Neumann,et al.  Disambiguating Visual Motion Through Contextual Feedback Modulation , 2004, Neural Computation.

[24]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[25]  Christopher C. Pack,et al.  Integration of Contour and Terminator Signals in Visual Area MT of Alert Macaque , 2004, The Journal of Neuroscience.

[26]  Guillaume S Masson,et al.  Object motion computation for the initiation of smooth pursuit eye movements in humans. , 2005, Journal of neurophysiology.

[27]  Mads Nielsen,et al.  Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.

[28]  Michael J. Black,et al.  Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[30]  Agustín Salgado de la Nuez,et al.  Temporal Constraints in Large Optical Flow Estimation , 2007, EUROCAST.

[31]  Jean Lorenceau,et al.  A network view of the structure of center/surround modulations of V1 receptive field properties in visual and cortical spaces , 2001, Neurocomputing.

[32]  DeLiang Wang,et al.  An oscillatory correlation model of human motion perception , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[33]  Daniel Cremers,et al.  Anisotropic Huber-L1 Optical Flow , 2009, BMVC.

[34]  Geoffrey E. Hinton,et al.  Learning Generative Texture Models with extended Fields-of-Experts , 2009, BMVC.

[35]  M. Giese Dynamic neural field theory for motion perception , 1998 .

[36]  Hans Wallach Über visuell wahrgenommene Bewegungsrichtung , 1935 .

[37]  Nava Rubin,et al.  The dynamics of bi-stable alternation in ambiguous motion displays: a fresh look at plaids , 2003, Vision Research.

[38]  Xin Huang,et al.  Stimulus Dependency and Mechanisms of Surround Modulation in Cortical Area MT , 2008, The Journal of Neuroscience.

[39]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.