Facial Landmarks Detection under Occlusions via Extended Restricted Boltzmann Machine

Facial landmarks encode critical information about face, which plays an important role in human communications. Accurate detecting and tracking facial landmarks have great potential value in intelligent user interfaces such as human-computer interactions. However, for face images with sever occlusions which may happen in real life such as hand occlusion, gesture occlusion and etc, detecting the facial landmarks is still a challenging problem. In this paper, we present a robust facial landmark detection method for image with occlusions based on Restricted Boltzmann Machine (RBM). We first present a face shape prior model which is constructed based on RBM to model the spatial shape patterns of the face. The detection process is accomplished by combining the prior shape model with the image measurements of facial landmarks. The low accuracy image measurements can be refined by the shape information embedded in the prior model. For the landmarks with severe occlusions, we firstly evaluate and determine the facial landmark occlusions, and replace their image measurements. The new image measurements are then fed into the prior model as evidence to predict the true locations. Evaluation on 3 databases demonstrates that the proposed method can detect facial landmarks accurately under severe occlusion, and achieved significant improvement over the current state of the art methods.

[1]  Fei Yang,et al.  Localization of Multi-pose and Occluded Facial Features via Sparse Shape Representation , 2013, ISVC.

[2]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Qingshan Liu,et al.  A Component-Based Framework for Generalized Face Alignment , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[5]  Yu Hen Hu,et al.  Video Salient Object Detection via Robust Seeds Extraction and Multi-Graphs Manifold Propagation , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Simon Baker,et al.  Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[7]  Yu Hen Hu,et al.  Video Saliency Detection via Graph Clustering With Motion Energy and Spatiotemporal Objectness , 2019, IEEE Transactions on Multimedia.

[8]  Müjdat Çetin,et al.  A graphical model based solution to the facial feature point tracking problem , 2011, Image Vis. Comput..

[9]  Jeng-Shyang Pan,et al.  Novel Systolization of Subquadratic Space Complexity Multipliers Based on Toeplitz Matrix–Vector Product Approach , 2019, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[10]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[11]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[12]  Yi Zhou,et al.  Bayesian tangent shape model: estimating shape and pose parameters via Bayesian inference , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Avinash C. Kak,et al.  Proceedings of IEEE International Conference on Multimodel Interfaces, 2002 , 2022 .

[14]  Maja Pantic,et al.  Local Evidence Aggregation for Regression-Based Facial Point Detection , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Gwen Littlewort,et al.  Real Time Face Detection and Facial Expression Recognition: Development and Applications to Human Computer Interaction. , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[16]  Maja Pantic,et al.  Facial point detection using boosted regression and graph models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Qiang Ji,et al.  Facial Feature Tracking Under Varying Facial Expressions and Face Poses Based on Restricted Boltzmann Machines , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Takeo Kanade,et al.  A Generative Shape Regularization Model for Robust Face Alignment , 2008, ECCV.