Simultaneously Learning Neighborship and Projection Matrix for Supervised Dimensionality Reduction

Explicitly or implicitly, most dimensionality reduction methods need to determine which samples are neighbors and the similarities between the neighbors in the original high-dimensional space. The projection matrix is then learnt on the assumption that the neighborhood information, e.g., the similarities, are known and fixed prior to learning. However, it is difficult to precisely measure the intrinsic similarities of samples in high-dimensional space because of the curse of dimensionality. Consequently, the neighbors selected according to such similarities and the projection matrix obtained according to such similarities and the corresponding neighbors might not be optimal in the sense of classification and generalization. To overcome this drawback, in this paper, we propose to let the similarities and neighbors be variables and model these in a low-dimensional space. Both the optimal similarity and projection matrix are obtained by minimizing a unified objective function. Nonnegative and sum-to-one constraints on the similarity are adopted. Instead of empirically setting the regularization parameter, we treat it as a variable to be optimized. It is interesting that the optimal regularization parameter is adaptive to the neighbors in a low-dimensional space and has an intuitive meaning. Experimental results on the YALE B, COIL-100, and MNIST data sets demonstrate the effectiveness of the proposed method.

[1]  Kun Zhou,et al.  Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[2]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Jian Yang,et al.  A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Zhetao Li,et al.  Visual Tracking With Weighted Adaptive Local Sparse Appearance Model via Spatio-Temporal Context Learning , 2018, IEEE Transactions on Image Processing.

[5]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[6]  Jianmin Jiang,et al.  Automated Induction of Heterogeneous Proximity Measures for Supervised Spectral Embedding , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Yanwei Pang,et al.  GlanceNets — efficient convolutional neural networks with adaptive hard example mining , 2018, Science China Information Sciences.

[8]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Lunke Fei,et al.  Robust Sparse Linear Discriminant Analysis , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Xuelong Li,et al.  Discriminative Transfer Subspace Learning via Low-Rank and Sparse Representation , 2016, IEEE Transactions on Image Processing.

[12]  Hailin Zhang,et al.  Joint Global and Local Structure Discriminant Analysis , 2013, IEEE Trans. Inf. Forensics Secur..

[13]  David J. Kriegman,et al.  Acquiring linear subspaces for face recognition under variable lighting , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Mehrtash Tafazzoli Harandi,et al.  From Manifold to Manifold: Geometry-Aware Dimensionality Reduction for SPD Matrices , 2014, ECCV.

[15]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[16]  Xuelong Li,et al.  Learning Sampling Distributions for Efficient Object Detection , 2015, IEEE Transactions on Cybernetics.

[17]  Xuelong Li,et al.  Cascade Learning by Optimally Partitioning , 2015, IEEE Transactions on Cybernetics.

[18]  Xuelong Li,et al.  Robust Tensor Analysis With L1-Norm , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Ioannis A. Kakadiaris,et al.  An Overview and Empirical Comparison of Distance Metric Learning Methods , 2017, IEEE Transactions on Cybernetics.

[21]  Dacheng Tao,et al.  On the Performance of Manhattan Nonnegative Matrix Factorization , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Zoubin Ghahramani,et al.  Unifying linear dimensionality reduction , 2014, 1406.0873.

[23]  Vassilios Morellas,et al.  Tensor Sparse Coding for Positive Definite Matrices , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Jun Guo,et al.  Linear Ranking Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[26]  Yanwei Pang,et al.  Learning Regularized LDA by Clustering , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Xuelong Li,et al.  Convolution in Convolution for Network in Network , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Junbin Gao,et al.  Vectorial Dimension Reduction for Tensors Based on Bayesian Inference , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Xuelong Li,et al.  General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Anastasios Tefas,et al.  Dimensionality Reduction Using Similarity-Induced Embeddings , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Wray L. Buntine Variational Extensions to EM and Multinomial PCA , 2002, ECML.

[32]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[33]  Yong Yu,et al.  Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Xiao Ma,et al.  Robust sparse representation based face recognition in an adaptive weighted spatial pyramid structure , 2016, Science China Information Sciences.

[35]  Ling Shao,et al.  Sequential Compact Code Learning for Unsupervised Image Hashing , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[36]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Quanxue Gao,et al.  Joint Global and Local Structure Discriminant Analysis , 2013, IEEE Transactions on Information Forensics and Security.

[38]  Xuelong Li,et al.  Cascaded Subpatch Networks for Effective CNNs , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Jiwen Lu,et al.  Regularized Locality Preserving Projections and Its Extensions for Face Recognition , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[40]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  Tingting Mu,et al.  Adaptive Data Embedding Framework for Multiclass Classification , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Neil D. Lawrence,et al.  A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction: Insights and New Models , 2010, J. Mach. Learn. Res..

[44]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[45]  Xinbo Gao,et al.  Stable Orthogonal Local Discriminant Embedding for Linear Dimensionality Reduction , 2013, IEEE Transactions on Image Processing.

[46]  Lei Wang,et al.  A Graph-Embedding Approach to Hierarchical Visual Word Mergence , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[47]  Xuelong Li,et al.  Gabor-Based Region Covariance Matrices for Face Recognition , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[48]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[50]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[51]  Gerhard Widmer,et al.  Deep Linear Discriminant Analysis , 2015, ICLR.

[52]  Delin Chu,et al.  Sparse Uncorrelated Linear Discriminant Analysis for Undersampled Problems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[53]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[54]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..