An Evaluation of Motion in Arti.cial Selective Attention

The Difference of Gaussian (DoG) saliency maps originally proposed by Koch and Ullman had information channels for intensity, opponent colors, and edge orientations, but from the beginning it was suggested that additional channels could and should be added. This paper addresses selective attention in video sequences, and adds motion channels to the saliency maps. The resulting attention windows display better-than-random correlation to the eye fixations of human subjects. This paper is not the first to add motion data to saliency maps. It is, however, the first paper we know of to explicitly compare the performance of saliency maps with and without a motion channel. The surprising negative result is that adding motion channels does not improve the performance of saliency-based selective attention, as measured by correspondence with human eye fixations. We draw two conclusions from this experiment: (1) although motion is clearly a critical attentional cue, the saliency map model may not extend as easily to motion data as some people (ourselves included) thought; and (2) the standard evaluation metric for selective attention algorithms - better than random correlation to human eye fixations - is outdated. Our community needs to focus on comparative studies between algorithms instead.

[1]  Timothy F. Cootes,et al.  Locating salient facial features using image invariants , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[2]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[3]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[4]  Bernt Schiele,et al.  Object Recognition Using Multidimensional Receptive Field Histograms , 1996, ECCV.

[5]  Heinz Hügli,et al.  Empirical Validation of the Saliency-based Model of Visual Attention , 2003 .

[6]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[7]  Todd C. Handy,et al.  Capacity Theory as a Model of Cortical Behavior , 2000, Journal of Cognitive Neuroscience.

[8]  Jacqueline Gottlieb,et al.  The lateral intraparietal area as a salience map: the representation of abrupt onset, stimulus motion, and task relevance , 2000, Vision Research.

[9]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[10]  J. Duncan,et al.  Visual search and stimulus similarity. , 1989, Psychological review.

[11]  A. Oliva,et al.  Coarse Blobs or Fine Edges? Evidence That Information Diagnosticity Changes the Perception of Complex Visual Stimuli , 1997, Cognitive Psychology.

[12]  Christof Koch,et al.  Attentional Selection for Object Recognition - A Gentle Way , 2002, Biologically Motivated Computer Vision.

[13]  Claudio M. Privitera,et al.  Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Christof Koch,et al.  Comparison of feature combination strategies for saliency-based visual attention systems , 1999, Electronic Imaging.

[15]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[16]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[17]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18]  Jang-Kyoo Shin,et al.  Biologically Inspired Saliency Map Model for Bottom-up Visual Attention , 2002, Biologically Motivated Computer Vision.

[19]  Laurent Itti,et al.  Automatic foveation for video compression using a neurobiological model of visual attention , 2004, IEEE Transactions on Image Processing.

[20]  N. Lavie Perceptual load as a necessary condition for selective attention. , 1995, Journal of experimental psychology. Human perception and performance.

[21]  M. Goldberg,et al.  Space and attention in parietal cortex. , 1999, Annual review of neuroscience.

[22]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[23]  Bruce A. Draper,et al.  Evaluation of selective attention under similarity transformations , 2005, Comput. Vis. Image Underst..