SRSC: Selective, Robust, and Supervised Constrained Feature Representation for Image Classification

Feature representation learning, an emerging topic in recent years, has achieved great progress. Powerful learned features can lead to excellent classification accuracy. In this article, a selective and robust feature representation framework with a supervised constraint (SRSC) is presented. SRSC seeks a selective, robust, and discriminative subspace by transforming the original feature space into the category space. Particularly, we add a selective constraint to the transformation matrix (or classifier parameter) that can select discriminative dimensions of the input samples. Moreover, a supervised regularization is tailored to further enhance the discriminability of the subspace. To relax the hard zero-one label matrix in the category space, an additional error term is also incorporated into the framework, which can lead to a more robust transformation matrix. SRSC is formulated as a constrained least square learning (feature transforming) problem. For the SRSC problem, an inexact augmented Lagrange multiplier method (ALM) is utilized to solve it. Extensive experiments on several benchmark data sets adequately demonstrate the effectiveness and superiority of the proposed method. The proposed SRSC approach has achieved better performances than the compared counterpart methods.

[1]  Zhiwei Li,et al.  Max-Margin Dictionary Learning for Multiclass Image Categorization , 2010, ECCV.

[2]  Shuicheng Yan,et al.  SDE: A Novel Selective, Discriminative and Equalizing Feature Representation for Visual Recognition , 2017, International Journal of Computer Vision.

[3]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression Database , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Svetlana Lazebnik,et al.  Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[5]  David Zhang,et al.  Sparse Representation Based Fisher Discrimination Dictionary Learning for Image Classification , 2014, International Journal of Computer Vision.

[6]  Shuicheng Yan,et al.  Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[8]  Qi Tian,et al.  Image-Specific Classification With Local and Global Discriminations , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Zhuowen Tu,et al.  Max-Margin Multiple-Instance Dictionary Learning , 2013, ICML.

[10]  Feiping Nie,et al.  Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Liang-Tien Chia,et al.  Laplacian Sparse Coding, Hypergraph Laplacian Sparse Coding, and Applications , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yong Yu,et al.  Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Daming Shi,et al.  Significant vector learning to construct sparse kernel regression models , 2007, Neural Networks.

[14]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[15]  Yu-Chiang Frank Wang,et al.  Robust Face Recognition With Structurally Incoherent Low-Rank Matrix Decomposition , 2014, IEEE Transactions on Image Processing.

[16]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[17]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Bingyuan Liu,et al.  Learning a Representative and Discriminative Part Model with Deep Convolutional Features for Scene Recognition , 2014, ACCV.

[19]  Thomas Mensink,et al.  Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[22]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[24]  Zi Huang,et al.  Scalable Supervised Asymmetric Hashing With Semantic and Latent Factor Embedding , 2019, IEEE Transactions on Image Processing.

[25]  Shiming Xiang,et al.  Retargeted Least Squares Regression Algorithm , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Daming Shi,et al.  Low-Rank-Sparse Subspace Representation for Robust Regression , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[28]  Xiaojun Chen,et al.  Local Adaptive Projection Framework for Feature Selection of Labeled and Unlabeled Data , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Larry S. Davis,et al.  Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[30]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[31]  Xi Yang,et al.  Weighted Mixed-Norm Regularized Regression for Robust Face Identification , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Cewu Lu,et al.  Learning Important Spatial Pooling Regions for Scene Classification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Tilo Strutz,et al.  Data Fitting and Uncertainty: A practical introduction to weighted least squares and beyond , 2010 .

[34]  Guo-Sen Xie,et al.  Integrating supervised subspace criteria with restricted Boltzmann Machine for feature extraction , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[35]  Xuelong Li,et al.  Generalized Uncorrelated Regression with Adaptive Graph for Unsupervised Feature Selection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Yi Ma,et al.  The Augmented Lagrange Multiplier Method for Exact Recovery of Corrupted Low-Rank Matrices , 2010, Journal of structural biology.

[37]  Fernando De la Torre,et al.  Robust Regression , 2016, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[39]  Chunhong Pan,et al.  Groupwise Retargeted Least-Squares Regression , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[40]  C. Ding,et al.  On the equivalent of low-rank linear regressions and linear discriminant analysis based regressions , 2013, KDD.

[41]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[42]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[43]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[44]  Baoxin Li,et al.  Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[46]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[47]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[48]  Zhuowen Tu,et al.  Deep FisherNet for Image Classification , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Jean Ponce,et al.  Learning Discriminative Part Detectors for Image Classification and Cosegmentation , 2013, 2013 IEEE International Conference on Computer Vision.

[50]  Jing Liu,et al.  Learning Robust Face Representation With Classwise Block-Diagonal Structure , 2014, IEEE Transactions on Information Forensics and Security.

[51]  Lei Zhang,et al.  Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[52]  Yun Fu,et al.  Learning Robust and Discriminative Subspace With Low-Rank Constraints , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[53]  Ling Shao,et al.  Extracting Privileged Information for Enhancing Classifier Learning , 2019, IEEE Transactions on Image Processing.

[54]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Shuicheng Yan,et al.  Task-Driven Feature Pooling for Image Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56]  Jean Ponce,et al.  Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57]  C. V. Jawahar,et al.  Blocks That Shout: Distinctive Parts for Scene Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Larry S. Davis,et al.  Learning Structured Low-Rank Representations for Image Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Alexei A. Efros,et al.  Mid-level Visual Element Discovery as Discriminative Mode Seeking , 2013, NIPS.

[60]  Gang Wang,et al.  Learning Discriminative and Shareable Features for Scene Classification , 2014, ECCV.

[61]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[62]  H. Abdi Partial least squares regression and projection on latent structure regression (PLS Regression) , 2010 .

[63]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Ling Shao,et al.  Discriminative Elastic-Net Regularized Linear Regression , 2017, IEEE Transactions on Image Processing.

[65]  Qi Tian,et al.  Semantically Modeling of Object and Context for Categorization , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[66]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[67]  Xuelong Li,et al.  Regularized Label Relaxation Linear Regression , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[68]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[69]  Ling Shao,et al.  Discriminative Fisher Embedding Dictionary Learning Algorithm for Object Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[70]  Mohammed Bennamoun,et al.  Linear Regression for Face Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71]  Xuelong Li,et al.  Regularized Class-Specific Subspace Classifier , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[72]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[73]  Jian Yang,et al.  Marginal Representation Learning With Graph Structure Self-Adaptation , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[74]  Jieping Ye,et al.  Least squares linear discriminant analysis , 2007, ICML '07.

[75]  Aleix M. Martínez,et al.  Subclass discriminant analysis , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[76]  Guo-Sen Xie,et al.  Efficient Feature Coding Based on Auto-encoder Network for Image Classification , 2014, ACCV.

[77]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[78]  Antonio Torralba,et al.  Recognizing indoor scenes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.