Discriminative Probabilistic Latent Semantic Analysis with Application to Single Sample Face Recognition

Face recognition is still a challenging issue due to the presence of intrinsic complexity, external variations and number limitation of training samples. In this paper, a novel face recognition method based on probabilistic latent semantic analysis (pLSA) model is developed, which mainly contains two stages: bag-of-words features extraction and semantic representation learning. In the first stage, to extract more structure information, the region-specific dictionary strategy is employed, i.e., generating a dictionary for each region. The encoded and sum-pooled features of all regions are concatenated together. In the second stage, a discriminative pLSA (DpLSA) model is presented, which initializes the word-topic distribution $$P(w|z_k)$$P(w|zk) by the center point of the training data from category k. As a result, the problem of how to choose appropriate number of topics in classical topic model is alleviated, and the training process of DpLSA is very fast only requiring few iterations. Moreover, the discovered topic-document distribution $$P\left( z|d\right) $$Pz|d is discriminative and semantic with the dominant topic entry corresponds to the category label of image d, which enables performing classification by $$P\left( z|d\right) $$Pz|d directly. Extensive experiments on four representative databases demonstrate that the proposed DpLSA is effective for face recognition under single training sample and possesses a certain degree of robustness to illumination, pose, as well as occlusion.

[1]  The Connection Between Manifold Learning and Distance Metric Learning , 2007 .

[2]  Xiaolan Fu,et al.  Face Recognition and Micro-expression Recognition Based on Discriminant Tensor Subspace Analysis Plus Extreme Learning Machine , 2014, Neural Processing Letters.

[3]  Matti Pietikäinen,et al.  Face Recognition with Local Binary Patterns , 2004, ECCV.

[4]  Chengjun Liu,et al.  Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition , 2002, IEEE Trans. Image Process..

[5]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Yanyan Zhang,et al.  Face Recognition via Spatial-PLSA , 2009, 2009 Chinese Conference on Pattern Recognition.

[7]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[8]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Tal Hassner,et al.  Similarity Scores Based on Background Samples , 2009, ACCV.

[10]  Lei Zhang,et al.  Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person , 2013, 2013 IEEE International Conference on Computer Vision.

[11]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[12]  Yang Yang,et al.  Face image classification by pooling raw features , 2014, Pattern Recognit..

[13]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[15]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[16]  Lei Zhang,et al.  Local Generic Representation for Face Recognition with Single Sample per Person , 2014, ACCV.

[17]  Dacheng Tao,et al.  DCT Regularized Extreme Visual Recovery , 2017, IEEE Transactions on Image Processing.

[18]  Yang Yang,et al.  Face identification with second-order pooling in single-layer networks , 2016, Neurocomputing.

[19]  Zhi-Hua Zhou,et al.  Making FLDA applicable to face recognition with one sample per person , 2004, Pattern Recognit..

[20]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[21]  Dacheng Tao,et al.  Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Fei-Fei Li,et al.  Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[23]  Zhengtao Yu,et al.  Locality Preserving Collaborative Representation for Face Recognition , 2017, Neural Processing Letters.

[24]  Shiguang Shan,et al.  Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[26]  Yihong Gong,et al.  Nonlinear Learning using Local Coordinate Coding , 2009, NIPS.

[27]  Masahide Kaneko,et al.  Robust Face Recognition Using Block-Based Bag of Words , 2010, 2010 20th International Conference on Pattern Recognition.

[28]  Esa Rahtu,et al.  BSIF: Binarized statistical image features , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[29]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[30]  Gang Wang,et al.  Discriminative multi-manifold analysis for face recognition from a single training sample per person , 2011, 2011 International Conference on Computer Vision.

[31]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[32]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[33]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Gang Wang,et al.  Joint Feature Learning for Face Recognition , 2015, IEEE Transactions on Information Forensics and Security.

[35]  Jie Xu,et al.  Query Aware Determinization of Uncertain Objects , 2015, IEEE Transactions on Knowledge and Data Engineering.

[36]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[37]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[38]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[39]  Chun Chen,et al.  EMR: A Scalable Graph-Based Ranking Model for Content-Based Image Retrieval , 2015, IEEE Transactions on Knowledge and Data Engineering.

[40]  Lei Zhang,et al.  Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[41]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[42]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[44]  Jonghyun Choi,et al.  Multi-Directional Multi-Level Dual-Cross Patterns for Robust Face Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[46]  Dacheng Tao,et al.  A Comprehensive Survey on Pose-Invariant Face Recognition , 2015, ACM Trans. Intell. Syst. Technol..

[47]  Yang Wang,et al.  Human Action Recognition by Semilatent Topic Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[49]  Li Li,et al.  Face Recognition Using Gabor-Based Feature Extraction and Feature Space Transformation Fusion Method for Single Image per Person Problem , 2017, Neural Processing Letters.

[50]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[51]  Dacheng Tao,et al.  Robust Face Recognition via Multimodal Deep Face Representation , 2015, IEEE Transactions on Multimedia.

[52]  Simon C. K. Shiu,et al.  Multi-scale Patch Based Collaborative Representation for Face Recognition with Margin Distribution Optimization , 2012, ECCV.

[53]  Changyin Sun,et al.  SPA: Spatially Pooled Attributes for image retrieval , 2017, Neurocomputing.

[54]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Alessandro Perina,et al.  Robust Initialization for Learning Latent Dirichlet Allocation , 2015, SIMBAD.

[56]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[57]  Xiaogang Wang,et al.  Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Lei Zhang,et al.  Sparsely encoded local descriptor for face verification , 2015, Neurocomputing.

[59]  Alice Caplier,et al.  Enhanced Patterns of Oriented Edge Magnitudes for Face Recognition and Image Matching , 2012, IEEE Transactions on Image Processing.

[60]  Dacheng Tao,et al.  Pose-invariant face recognition with homography-based normalization , 2017, Pattern Recognit..

[61]  Dacheng Tao,et al.  Multi-Task Pose-Invariant Face Recognition , 2015, IEEE Transactions on Image Processing.

[62]  Christoph H. Lampert,et al.  Deep Fisher Kernels -- End to End Learning of the Fisher Kernel GMM Parameters , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[63]  Jian Sun,et al.  Face recognition with learning-based descriptor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[64]  Tieniu Tan,et al.  Feature Coding in Image Classification: A Comprehensive Study , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Jian Yang,et al.  Robust sparse coding for face recognition , 2011, CVPR 2011.

[66]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[67]  Zhiwu Lu,et al.  Image categorization via robust pLSA , 2010, Pattern Recognit. Lett..

[68]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[69]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[70]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[71]  C. L. Philip Chen,et al.  Sparse discriminative multi-manifold embedding for one-sample face identification , 2016, Pattern Recognit..