Hierarchical hyperlingual-words for multi-modality face classification

The usage of specifically designed cameras for robust image classification, biometric and surveillance has emerged recently and it inevitably involves multi-modality classification problems. Cross-modality as well as within- and between-class variations jointly produce a significantly complex problem. In this paper, we propose a hierarchical hyperlingual-words based approach towards the aforementioned problems. First, a novel structure, hyperlingual-words, is created to capture the high-level semantic features across different modalities and within each modality. Second, considering the impact of different resolutions of histograms, we utilize pyramid histogram match for hierarchical hyperlingual-words to weight the ChiSquare metric, and obtain a more discriminative one. Finally, extensive experiments are conducted on two data sets, namely, BUAA-VisNir Face Database and Oulu-CASIA NIR&VIS Database, and results show that our method is superior to the state-of-the-art on cross-modality face recognition with pose&expression variations.

[1]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[2]  王晓刚,et al.  Coupled Information-Theoretic Encoding for Face Photo-Sketch Recognition , 2011 .

[3]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Silvio Savarese,et al.  Cross-view action recognition via view knowledge transfer , 2011, CVPR 2011.

[5]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[6]  Shengcai Liao,et al.  Illumination Invariant Face Recognition Using Near-Infrared Images , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Yang Yang,et al.  Learning semantic visual vocabularies using diffusion distance , 2009, CVPR.

[9]  Anil K. Jain,et al.  Heterogeneous Face Recognition Using Kernel Prototype Similarities , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Siu-Yeung Cho,et al.  A face emotion tree structure representation with probabilistic recursive neural network modeling , 2010, Neural Computing and Applications.

[11]  Quanquan Gu,et al.  Learning the Shared Subspace for Multi-task Clustering and Transductive Transfer Classification , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[12]  Dahua Lin,et al.  Inter-modality Face Recognition , 2006, ECCV.

[13]  Dong Yi,et al.  Face Matching Between Near Infrared and Visible Light Images , 2007, ICB.

[14]  Jian Sun,et al.  Face recognition with learning-based descriptor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Shimon Ullman,et al.  Face Recognition: The Problem of Compensating for Changes in Illumination Direction , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Chris H. Q. Ding,et al.  A min-max cut algorithm for graph partitioning and data clustering , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[17]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[18]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[19]  Qingshan Liu,et al.  Image retrieval via probabilistic hypergraph ranking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[21]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[22]  Ming Shao,et al.  A super-resolution based method to synthesize visual images from near infrared , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[23]  Stan Z. Li,et al.  Coupled Spectral Regression for matching heterogeneous faces , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Matti Pietikäinen,et al.  Learning mappings for face synthesis from near infrared to visual light images , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[26]  Ammad Ali,et al.  Face Recognition with Local Binary Patterns , 2012 .

[27]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[29]  Shengcai Liao,et al.  Heterogeneous Face Recognition from Local Structures of Normalized Appearance , 2009, ICB.

[30]  Serge J. Belongie,et al.  Higher order learning with graphs , 2006, ICML.