An architectural model for combining spatial-based and object-based information for attentive video analysis

We present an architectural model for the interaction between top-down, object-based information and bottom-up, spatial-based information in determining visual attention shifts. We focus in particular on how the attentive process can take into account the processing of faces and multiple moving objects. To validate the model, experiments with eye-tracked human subjects are presented and discussed.

[1]  Claudio M. Privitera,et al.  Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Bärbel Mertsching,et al.  Data- and Model-Driven Gaze Control for an Active-Vision System , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  M. Goodale,et al.  The objects of action and perception , 1998, Cognition.

[4]  Vito Di Gesù,et al.  Symmetry operators in computer vision , 1996 .

[5]  Jianxin Wu,et al.  Efficient face candidates selector for face detection , 2003, Pattern Recognit..

[6]  King Ngi Ngan,et al.  Face segmentation using skin-color map in videophone applications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[7]  Brian Scassellati,et al.  Humanoid Robots: A New Kind of Tool , 2000, IEEE Intell. Syst..

[8]  Robert Mariani,et al.  Face detection and precise eyes location , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Angelo Marcelli,et al.  Using motion for foveated analysis of video , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[10]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[11]  Peter J. Burt A pyramid-based front-end processor for dynamic vision applications , 2002 .

[12]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[13]  Thierry Pun,et al.  Attentive mechanisms for dynamic and static scene analysis , 1995 .

[14]  Z. Pylyshyn,et al.  Multiple object tracking and attentional processing. , 2000, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[15]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[16]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[17]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.