Face Tracking in a Multi-Camera Environment

We present a system for the robust real-time tracking of human faces. The system utilizes multiple cameras and is built with low-cost standard equipment. A 3D tracking module that uses the information from the multiple cameras is the core of the presented approach. Endowed with a virtual zooming utility, the system provides a close-up view of a face regardless of the person’s position and orientation. This best matching front view is found by comparison of color histograms using the Bhattacharyya coefficient. The tracking initialization and learning of the target histograms are done automatically from training data. Results on image sequences of multiple persons demonstrate the versatility of the approach. Telepresence, teleteaching or face recognition systems are examples of possible applications. The system is scalable in terms of the number of computers and cameras, but one computer/laptop with three low-cost FireWire cameras is already sufficient.

[1]  Neil A. Thacker,et al.  The Bhattacharyya metric as an absolute similarity measure for frequency coded data , 1998, Kybernetika.

[2]  Jake K. Aggarwal,et al.  Tracking Human Motion in Structured Environments Using a Distributed-Camera System , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Ross Cutler,et al.  Practical calibrations for a real-time digital omnidirectional camera , 2001 .

[4]  Dorin Comaniciu,et al.  Adaptive Resolution System for Distributed Surveillance , 2002, Real Time Imaging.

[5]  Larry S. Davis,et al.  M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo , 2002, ECCV.

[6]  Luc Van Gool,et al.  Monkeys -- A Software Architecture for ViRoom -- Low-Cost Multicamera System , 2003, ICVS.

[7]  Luc Van Gool,et al.  An adaptive color-based particle filter , 2003, Image Vis. Comput..

[8]  Mubarak Shah,et al.  Human tracking in multiple cameras , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  Luc Van Gool,et al.  ViRoom - Low Cost Synchronized Multicamera System and Its Self-calibration , 2002, DAGM-Symposium.

[10]  Tomáš Svoboda Quick Guide to Multi-Camera Self-Calibration , 2003 .

[11]  Mohan M. Trivedi,et al.  Active Camera Networks and Semantic Event Databases for Intelligent Environments , 2002 .

[12]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[13]  Michael Harville,et al.  Foreground segmentation using adaptive mixture models in color and depth , 2001, Proceedings IEEE Workshop on Detection and Recognition of Events in Video.