Real-time face tracking and pose estimation with partitioned sampling and relevance vector machine

Tracking the pose of human face has long been an important research topic which has many important applications, and it is particularly challenging with a monocular camera because the depth information is lost due to the perspective projection. This work adopts particle filter with partitioned sampling to decompose the state space of face pose tracking into two subspaces for increasing the sampling efficiency, thus achieving satisfactory performance with fewer particles. The parameters in the first subspace describe the target on image plane, and the parameter in the second subspace is used for the estimate of the face pose in yaw angle direction. For the evaluation of each hypothesis in the second subspace, a statistical learning algorithm called relevance vector machine (RVM) is used to map a face containing image to the pose of the face. The training of RVM is tailored to each detected frontal face, and it takes less than half second, which is suitable for a real-time application. The learning based regression model also presents the insensitive ability to expression variation and unmodeled degree of freedom. The experimental results verify that the combination of particle filter and RVM can efficiently reduce the processing time and add robustness to the performance of the system, thus making this algorithm applicable to human-machine interface with low-cost webcams.

[1]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[2]  Gregory D. Hager,et al.  Efficient particle filtering using RANSAC with application to 3D face tracking , 2006, Image and Vision Computing.

[3]  Rainer Stiefelhagen,et al.  Head pose estimation using stereo vision for human-robot interaction , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[4]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[5]  Fadi Dornaika,et al.  Simultaneous Facial Action Tracking and Expression Recognition in the Presence of Head Motion , 2008, International Journal of Computer Vision.

[6]  Azriel Rosenfeld,et al.  3D object tracking using shape-encoded particle propagation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[8]  Marco La Cascia,et al.  Fast, Reliable Head Tracking under Varying Illumination: An Approach Based on Registration of Texture-Mapped 3D Models , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Vincent Lepetit,et al.  Stable real-time 3D tracking using online and offline information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Takeo Kanade,et al.  Pose Robust Face Tracking by Combining Active Appearance Models and Cylinder Head Models , 2007, International Journal of Computer Vision.

[11]  Christian Bauckhage,et al.  Fast learning for customizable head pose recognition in robotic wheelchair control , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[12]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[13]  Daijin Kim,et al.  Robust head tracking using 3D ellipsoidal head model in particle filter , 2008, Pattern Recognit..

[14]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[15]  Shaogang Gong,et al.  Fusion of perceptual cues for robust tracking of head pose and position , 2001, Pattern Recognit..

[16]  Jean-Marc Odobez,et al.  Head Pose Tracking and Focus of Attention Recognition Algorithms in Meeting Rooms , 2006, CLEAR.

[17]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevance Vector Machine , 2001 .

[18]  Marius Malciu,et al.  A robust model-based approach for 3D head tracking in video sequences , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[19]  J. Odobez,et al.  A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking , 2005 .

[20]  Thomas S. Huang,et al.  Accurate Head Pose Tracking in Low Resolution Video , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[21]  Li-Chen Fu,et al.  Real-time multitarget visual tracking with an active camera , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.