Facial feature extraction with a depth AAM algorithm

Facial feature extraction in video sequences takes an important role in face recognition, expression profile analysis, and human computer interaction. Traditional AAM (Active Appearance Model) methods for facial features localization always concentrate on fitting efficiency with few concrete analysis of the characteristic of the initial position and model instance, thus the location accuracy and speed are both not ideal. The main idea of our method is to use the face detection algorithm with a Kinect camera to accurately locate human head and estimate head pose. A depth AAM algorithm is developed to locate the detailed facial features. The head position and pose are used to initialize the AAM global shape transformation which guarantees the model fitting to the correct location. The depth AAM algorithm takes four channels-R, G, B, D into our consideration which combines the colors and the depth of input images. To locate facial feature robustly and accurately, the weights of RGB information and D information in global energy function are adjusted automatically. We also use the image pyramid algorithm and the inverse compositional algorithm to speed up the iteration. Experimental results show that our depth AAM algorithm can effectively and accurately locale facial features from video objects in conditions of complex backgrounds and various poses.

[1]  Domenico Prattichizzo,et al.  Using Kinect for hand tracking and rendering in wearable haptics , 2011, 2011 IEEE World Haptics Conference.

[2]  Roland Göcke,et al.  A Nonlinear Discriminative Approach to AAM Fitting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  Qiang Wang,et al.  Real Time Feature Based 3-D Deformable Face Tracking , 2008, ECCV.

[4]  Vasile Buzuloiu,et al.  On Color Texture Normalization for Active Appearance Models , 2009, IEEE Transactions on Image Processing.

[5]  Sebastian Thrun,et al.  Real time motion capture using a single time-of-flight camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Petros Maragos,et al.  Adaptive and constrained algorithms for inverse compositional Active Appearance Model fitting , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[8]  Sebastian Thrun,et al.  Real-time identification and localization of body parts from depth images , 2010, 2010 IEEE International Conference on Robotics and Automation.

[9]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Sridha Sridharan,et al.  Fourier Active Appearance Models , 2011, 2011 International Conference on Computer Vision.

[11]  Simon Baker,et al.  Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[12]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Takeo Kanade,et al.  Real-time combined 2D+3D active appearance models , 2004, CVPR 2004.

[14]  Anil K. Jain,et al.  Deformation Modeling for Robust 3D Face Matching , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Timothy F. Cootes,et al.  Locating Facial Features and Pose Estimation Using a 3D Shape Model , 2009, ISVC.

[16]  Yanwen Wu,et al.  Face Detection in Color Images Using AdaBoost Algorithm Based on Skin Color Information , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[17]  Timothy F. Cootes,et al.  Constrained active appearance models , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[18]  Paul A. Viola,et al.  Fast and Robust Classification using Asymmetric AdaBoost and a Detector Cascade , 2001, NIPS.

[19]  Fred Nicolls,et al.  Locating Facial Features with an Extended Active Shape Model , 2008, ECCV.

[20]  Jing Xiao,et al.  Multi-view AAM fitting and camera calibration , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[21]  Vincent Lepetit,et al.  Randomized trees for real-time keypoint recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Kikuo Fujimura,et al.  3D head pose estimation with optical flow and depth constraints , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[23]  Z. Liu,et al.  Real time head pose tracking from multiple cameras with a generic model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.