Accurate mouth state estimation via convolutional neural networks

Human mouth is very flexible such that its status (closed or open) is often used as a judgment in the liveness detection of face recognition. However, due to large head pose and illumination variations, accurate mouth status estimation is still challenging in real-world scenarios. In this paper, we propose a deep convolutional neural networks (CNNs) method for mouth status estimation under unconstrained conditions and different types of attacks. Different from previous methods that extract hand-crafted features and then treat the estimation problem as a binary classification task, our method automatically extracts discriminative features via learned convolutional and the pooling layers. To demonstrate the effectiveness of our method and the challenge of mouth status estimation in real-world, we also propose a mouth status estimation dataset that contains 10,714 images in the wild. Experimental results with two types of liveness attacks show that our proposed method outperforms the other traditional methods, especially in the wild condition.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Hongbin Zha,et al.  Unsupervised Random Forest Manifold Alignment for Lipreading , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Jian-Huang Lai,et al.  Mouth state estimation in mobile computing environment , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[5]  Alice Caplier,et al.  Open or Closed Mouth State Detection: Static Supervised Classification Based on Log-Polar Signature , 2008, ACIVS.

[6]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[8]  Larry S. Davis,et al.  Learning predictable binary codes for face indexing , 2015, Pattern Recognit..

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Subhransu Maji,et al.  Describing people: A poselet-based approach to attribute classification , 2011, 2011 International Conference on Computer Vision.

[13]  Zhenan Sun,et al.  Multi-task ConvNet for blind face inpainting with application to face verification , 2016, 2016 International Conference on Biometrics (ICB).

[14]  Zhenan Sun,et al.  A Lightened CNN for Deep Face Representation , 2015, ArXiv.

[15]  Ran He,et al.  Locally imposing function for Generalized Constraint Neural Networks - A study on equality constraints , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[16]  Xiaogang Wang,et al.  Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Arun Ross,et al.  Automatic facial makeup detection with application in face recognition , 2013, 2013 International Conference on Biometrics (ICB).

[18]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19]  Ankur Datta,et al.  Hierarchical ranking of facial attributes , 2011, Face and Gesture 2011.

[20]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).