Enhanced real-time head pose estimation system for mobile device

This article proposes a real-time Head Pose Estimation HPE technique designed to be used in mobile devices. The method enables the interaction between the user and mobile devices using the device's inbuilt camera. The proposed technique is composed of different computer vision methods, which were optimized to operate in a restricted environment. The method has three Degrees of Freedom DOF, roll, yaw and pitch. The HPE is obtained using these movements. Experiments were conducted using 363 videos of 27 different people taken in varied scenarios with changes in illumination and background. The results demonstrate the robustness and efficiency of the proposed method.

[1]  Daw-Tung Lin,et al.  Integrating a mixed-feature model and multiclass support vector machine for facial expression recognition , 2009, Integr. Comput. Aided Eng..

[2]  Walid Mahdi,et al.  A hybrid approach for automatic lip localization and viseme classification to enhance visual speech recognition , 2008, Integr. Comput. Aided Eng..

[3]  Yoichi Sato,et al.  Head Pose Estimation System Based on Particle Filtering with Adaptive Diffusion Control , 2005, MVA.

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Takeo Kanade,et al.  Rotation Invariant Neural Network-Based Face Detection , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[6]  Jianfeng Ren,et al.  Real-time optimization of Viola -Jones face detection for mobile platforms , 2008, 2008 IEEE Dallas Circuits and Systems Workshop: System-on-Chip - Design, Applications, Integration, and Software.

[7]  Rainer Stiefelhagen,et al.  Head pose estimation using stereo vision for human-robot interaction , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[8]  George D. C. Cavalcanti,et al.  Real-Time Head Pose Estimation for Mobile Devices , 2012, IDEAL.

[9]  Harry Wechsler,et al.  Face pose discrimination using support vector machines (SVM) , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[10]  Kim L. Boyer,et al.  Head pose estimation using view based eigenspaces , 2002, Object recognition supported by user interaction for service robots.

[11]  Hankyu Moon,et al.  Estimating facial pose from a sparse representation [face recognition applications] , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[12]  Shaogang Gong,et al.  Face distributions in similarity space under varying head pose , 2001, Image Vis. Comput..

[13]  Larry S. Davis,et al.  Computing 3-D head orientation from a monocular image sequence , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[14]  Larry S. Davis,et al.  An anthropometric shape model for estimating head orientation , 1997 .

[15]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[16]  Roberto Cipolla,et al.  Determining the gaze of faces in images , 1994, Image Vis. Comput..

[17]  Shaogang Gong,et al.  Head Pose Classification in Crowded Scenes , 2009, BMVC.

[18]  Nicu Sebe,et al.  Multimodal Human Computer Interaction: A Survey , 2005, ICCV-HCI.

[19]  Yoichi Sato,et al.  Head Pose Classification from Low Resolution Images Using Pairwise Non-Local Intensity and Color Differences , 2010, 2010 Fourth Pacific-Rim Symposium on Image and Video Technology.

[20]  Shaogang Gong,et al.  Support vector regression and classification based multi-view face detection and recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[21]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[23]  Takeo Kanade,et al.  Real-time combined 2D+3D active appearance models , 2004, CVPR 2004.

[24]  José Ranilla,et al.  A low-cost 3D human interface device using GPU-based optical flow algorithms , 2011, Integr. Comput. Aided Eng..

[25]  Sharath Pankanti,et al.  Absolute head pose estimation from overhead wide-angle cameras , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[26]  R. Riego,et al.  A low-cost 3 D human interface device using GPU-based optical fl ow algorithms , 2011 .

[27]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[28]  Su Ruan,et al.  A robust agorithm for eye detection on gray intensity face without spectacles , 2005 .

[29]  Mohan M. Trivedi,et al.  Robust real-time detection, tracking, and pose estimation of faces in video streams , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[30]  Yuxiao Hu,et al.  Head Pose Estimation in Seminar Room Using Multi View Face Detectors , 2006, CLEAR.

[31]  Alejandro León,et al.  Unsupervised Markerless 3-DOF Motion Tracking in Real Time using a Single Low-Budget Camera , 2012, Int. J. Neural Syst..

[32]  Sethuraman Panchanathan,et al.  Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  William T. Freeman,et al.  Example-based head tracking , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[34]  Tolga K. Çapin,et al.  A Face Tracking Algorithm for User Interaction in Mobile Devices , 2009, 2009 International Conference on CyberWorlds.

[35]  Ahmed Bouridane,et al.  2D and 3D palmprint information, PCA and HMM for an improved person recognition performance , 2013, Integr. Comput. Aided Eng..

[36]  Erik D. Goodman,et al.  Integrating a statistical background- foreground extraction algorithm and SVM classifier for pedestrian detection and tracking , 2013, Integr. Comput. Aided Eng..

[37]  Teera Siriteerakul Advance in Head Pose Estimation from Low Resolution Images: A Review , 2012 .

[38]  Nasser Kehtarnavaz,et al.  Real-time implementation of robust face detection on mobile platforms , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[39]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Andrew Calway,et al.  Using affine correspondence to estimate 3-D facial pose , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[41]  Nicu Sebe,et al.  Human-Computer Intelligent Interaction: A Survey , 2007, ICCV-HCI.

[42]  Timothy F. Cootes,et al.  Automatic interpretation of human faces and hand gestures using flexible models. , 1995 .

[43]  Esteban Walter Gonzalez Clua,et al.  Using graph cuts in GPUs for color based human skin segmentation , 2011, Integr. Comput. Aided Eng..