Pre-Attentive Face Detection for Foveated Wide-Field Surveillance

Conventional surveillance sensors suffer from an unavoidable tradeoff between image resolution and field of view. This problem may be overcome by combining a fixed, preattentive, low-resolution wide-field camera with a shiftable, attentive, high-resolution narrow-field camera. Here we present techniques for orienting the attentive camera to faces detected in the pre-attentive wide-field image stream. Unfortunately, the low image resolution of the widefield sensor precludes the use of most conventional face detection algorithms. Instead, we argue that reliable performance can best be achieved by accurate probabilistic combination of multiple cues: skin detection, motion detection and foreground extraction. Fast sampling of scale space over all three modalities is achieved using integral images and parametric models of response distributions are derived using supervised learning techniques. Log likelihood ratios for each modality are combined with spatial priors incorporating tracking and novelty objectives to yield a posterior map indicating the probability of a face appearing at each image location. The result is a real-time attentive visual sensor which reliably fixates faces over a 130 deg field of view, allowing high-resolution capture of facial images over a large dynamic scene

[1]  Stuart J. Russell,et al.  Image Segmentation in Video Sequences: A Probabilistic Approach , 1997, UAI.

[2]  Heinrich Niemann,et al.  Statistical modeling and performance characterization of a real-time dual camera surveillance system , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[3]  B. Schiele,et al.  Fast and Robust Face Finding via Local Context , 2003 .

[4]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[5]  Brian Scassellati,et al.  Eye Finding via Face Detection for a Foveated Active Vision System , 1998, AAAI/IAAI.

[6]  Henry Schneiderman,et al.  Feature-centric evaluation for efficient cascaded object detection , 2004, CVPR 2004.

[7]  Biswajit Bose,et al.  Improving object classification in far-field video , 2004, CVPR 2004.

[8]  Vinod Nair,et al.  An unsupervised, online learning framework for moving object detection , 2004, CVPR 2004.

[9]  Ramakant Nevatia,et al.  Tracking multiple humans in complex situations , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Carlo S. Regazzoni,et al.  Dual camera system for face detection in unconstrained environments , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[11]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Fadi Dornaika,et al.  Attentive Wide-Field Sensing for Visual Telepresence and Surveillance , 2004 .

[14]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Takeo Kanade,et al.  Object Detection Using the Statistics of Parts , 2004, International Journal of Computer Vision.