Natural contrast statistics and the selection of visual fixations

In this paper we address the problem of visual surveillance, which we define as the problem of optimally extracting information from the visual scene with a fixating, foveated imaging system. We are explicitly concerned with eye/camera movement strategies that result in maximizing information extraction from the visual field. Here we demonstrate how a novel characterization of the contrast statistics of natural images can be used for selecting fixation points that minimize the total contrast uncertainty (entropy) of natural images. We demonstrate the performance of the algorithm and compare its performance to ground truth methods. The results show that our algorithm performs favorably in terms of both efficiency and its ability to find salient features in the image.

[1]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[2]  Antonio Turiel,et al.  The Multifractal Structure of Contrast Changes in Natural Images: From Sharp Edges to Textures , 2000, Neural Computation.

[3]  Kim Steenstrup Pedersen,et al.  The Nonlinear Statistics of High-Contrast Patches in Natural Images , 2003, International Journal of Computer Vision.

[4]  D. Ruderman The statistics of natural images , 1994 .

[5]  Alan C. Bovik,et al.  Image features that draw fixations , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[6]  A. L. I︠A︡rbus Eye Movements and Vision , 1967 .

[7]  Marios S. Pattichis,et al.  Rate control for foveated MPEG/H.263 video , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[8]  Raghu G. Raj,et al.  Approximating filtered scale-variant signals , 2005, IEEE Transactions on Image Processing.

[9]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[10]  P Reinagel,et al.  Natural scene statistics at the centre of gaze. , 1999, Network.

[11]  W. Geisler,et al.  Separation of low-level and high-level factors in complex tasks: visual search. , 1995, Psychological review.

[12]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[13]  A. L. Yarbus,et al.  Eye Movements and Vision , 1967, Springer US.