Cooperative multisensor system for real-time face detection and tracking in uncontrolled conditions

The presented work describes an innovative architecture for multi-sensor distributed video surveillance applications. The aim of the system is to track moving objects in outdoor environments with a cooperative strategy exploiting two video cameras. The system also exhibits the capacity of focusing its attention on the faces of detected pedestrians collecting snapshot frames of face images, by segmenting and tracking them over time at different resolution. The system is designed to employ two video cameras in a cooperative client/server structure: the first camera monitors the entire area of interest and detects the moving objects using change detection techniques. The detected objects are tracked over time and their position is indicated on a map representing the monitored area. The objects’ coordinates are sent to the server sensor in order to point its zooming optics towards the moving object. The second camera tracks the objects at high resolution. As well as the client camera, this sensor is calibrated and the position of the object detected on the image plane reference system is translated in its coordinates referred to the same area map. In the map common reference system, data fusion techniques are applied to achieve a more precise and robust estimation of the objects’ track and to perform face detection and tracking. The work novelties and strength reside in the cooperative multi-sensor approach, in the high resolution long distance tracking and in the automatic collection of biometric data such as a person face clip for recognition purposes.

[1]  Pierre Valin,et al.  Data Fusion for Situation Monitoring, Incident Detection, Alert and Response Management , 2005 .

[2]  Mohan M. Trivedi Human movement capture and analysis in intelligent environments , 2003, Machine Vision and Applications.

[3]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Ying Dai,et al.  Face-texture model based on SGLD and its application in face detection in a color scene , 1996, Pattern Recognit..

[5]  Larry S. Davis,et al.  W4S : A real-time system for detecting and tracking people in 2 D , 1998, eccv 1998.

[6]  P. Peer,et al.  Human skin color clustering for face detection , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[7]  T. Ellis,et al.  Wide area surveillance with a multi camera network , 2004 .

[8]  Pramod K. Varshney,et al.  Distributed Detection and Data Fusion , 1996 .

[9]  Ian Craw,et al.  Finding Face Features , 1992, ECCV.

[10]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  A. M. Tekalp,et al.  Multiple camera tracking of interacting and occluded human motion , 2001, Proc. IEEE.

[13]  D. Makris Learning a Multi-camera Topology , 2003 .

[14]  Carlo S. Regazzoni,et al.  A change-detection method for multiple object localization in real scenes , 1994, Proceedings of IECON'94 - 20th Annual Conference of IEEE Industrial Electronics.

[15]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Franco Oberti,et al.  A neural network approach for moving objects recognition in color image sequences for surveillance applications , 1999, NSIP.

[17]  Tim Ellis,et al.  A multi-view surveillance system , 2003 .

[18]  Mohan M. Trivedi,et al.  An integrated surveillance system--human tracking and view synthesis using multiple omni-directional vision sensors , 2004, Image Vis. Comput..

[19]  Larry S. Davis,et al.  Hydra: multiple people detection and tracking using silhouettes , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[20]  C. S. Regazzoni,et al.  Object Detection and Tracking in Distributed Surveillance Systems Using Multiple Cameras , 2002 .

[21]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Franco Oberti,et al.  ON LINE SELF-ORGANIZING NON-RIGID SHAPE DESCRIPTION IN MULTIPLE OBJECTS SCENES , 2000 .

[23]  Takeo Kanade,et al.  Introduction to the Special Section on Video Surveillance , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[25]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Seong-Whan Lee,et al.  Face Detection and Facial Component Extraction by Wavelet Decomposition and Support Vector Machines , 2003, AVBPA.

[27]  Pramod K. Varshney,et al.  Multisensor Data Fusion , 1997, IEA/AIE.

[28]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[29]  L. Davis,et al.  W 4 S: a Real-time System for Detecting and Tracking People in 2 1 2 D , 1998 .

[30]  A. Ardeshir Goshtasby,et al.  Detecting human faces in color images , 1998, Image Vis. Comput..