Visual processes to detect and track faces for video compression and transmission. The system is based on an architecture in which a supervisor selects and activates visual processes in cyclic manner. Control of visual processes is made possible by a confidence factor which accompanies each observation. Fusion of results into a unified estimation for tracking is made possible by estimating a covariance matrix with each observation. Visual processes for face tracking are described using blink detection, normalised color histogram matching, and cross correlation (SSD and NCC). Ensembles of visual processes are organised into processing states so as to provide robust tracking. Transition between states is determined by events detected by processes. The result of face detection is fed into recursive estimator (Kalman filter). The output from the estimator drives a PD controller for a pan/tilt/zoom camera. The resulting system provides robust and precise tracking which operates continuously at approximately 20 images per second on a 150 megahertz computer workstation.
[1]
IMAG-LIFIA,et al.
Experimental Comparison of Correlation Techniques
,
1995
.
[2]
James L. Crowley,et al.
Finger Tracking as an Input Device for Augmented Reality
,
1995
.
[3]
James L. Crowley,et al.
Integration and Control of Reactive Visual Processes
,
1994,
ECCV.
[4]
James L. Crowley,et al.
Vision as Process
,
1995
.
[5]
Yves Demazeau,et al.
Principles and techniques for sensor data fusion
,
1993,
Signal Process..
[6]
Alex Waibel,et al.
Gaze Tracking Based on Face‐Color
,
1995
.
[7]
Masayuki Inaba,et al.
Robot vision system with a correlation chip for real-time tracking, optical flow and depth map generation
,
1992,
Proceedings 1992 IEEE International Conference on Robotics and Automation.
[8]
James L. Crowley,et al.
Integration and control of reactive visual processes
,
1994,
Robotics Auton. Syst..