Local structured feature learning with dynamic maximum entropy graph

Abstract In recent years, Linear Discriminant Analysis (LDA) has seen huge adoption in data mining applications. Due to its globality, it is incompetent to handle multimodal data. Besides, most of LDA’s variants learn the projection matrix based on the pre-defined similarity matrix, which is easily affected by noisy and irrelevant features. To address above two issues, a novel local structured feature learning with Dynamic Maximum Entropy Graph (DMEG) method is developed which firstly develops a more discriminative LDA with whitening constraint that can minimize the within-class scatter while keeping the total samples scatter unchanged simultaneously. Second, for exploring the local structure of data, the l0-norm constraint is imposed on similarity matrix to ensure the k connectivity on graph. More importantly, proposed model learns the similarity and projection matrix simultaneously to ensure that the neighborships can be found in the optimal subspace where the noise have been removed already. Moreover, a maximum entropy regularization is employed to reinforce the discriminability of graph and avoid the trivial solution. Last but not least, an efficient iterative optimization algorithm is provided to optimize proposed model with a NP-hard constraint. Extensive experiments conducted on synthetic and several real-world data sets demonstrate the efficiency in classification task and robustness to noise of proposed method.

[1]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[2]  David Zhang,et al.  Local Linear Discriminant Analysis Framework Using Sample Neighbors , 2011, IEEE Transactions on Neural Networks.

[3]  Jingjing Liu,et al.  Enhanced fisher discriminant criterion for image recognition , 2012, Pattern Recognit..

[4]  Zhenhua Guo,et al.  Phase congruency induced local features for finger-knuckle-print recognition , 2012, Pattern Recognit..

[5]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression (PIE) database , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[6]  Lei Zhang,et al.  A multi-manifold discriminant analysis method for image feature extraction , 2011, Pattern Recognit..

[7]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Jian Yang,et al.  Globally Maximizing, Locally Minimizing: Unsupervised Discriminant Projection with Applications to Face and Palm Biometrics , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Rong Wang,et al.  Submanifold-Preserving Discriminant Analysis With an Auto-Optimized Graph , 2020, IEEE Transactions on Cybernetics.

[10]  Xinbo Gao,et al.  Dimensionality Reduction by Integrating Sparse Representation and Fisher Criterion and its Applications , 2015, IEEE Transactions on Image Processing.

[11]  Bin Zhao,et al.  CAM-RNN: Co-Attention Model Based RNN for Video Captioning , 2019, IEEE Transactions on Image Processing.

[12]  A. Martínez,et al.  The AR face databasae , 1998 .

[13]  Qing He,et al.  On Defining Partition Entropy by Inequalities , 2007, IEEE Transactions on Information Theory.

[14]  Xuelong Li,et al.  Locality Adaptive Discriminant Analysis , 2017, IJCAI.

[15]  Kun Zhou,et al.  Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[16]  Jieping Ye,et al.  Feature Reduction via Generalized Uncorrelated Linear Discriminant Analysis , 2006, IEEE Transactions on Knowledge and Data Engineering.

[17]  Feiping Nie,et al.  Neighborhood MinMax Projections , 2007, IJCAI.

[18]  Feiping Nie,et al.  Adaptive Neighborhood MinMax Projections , 2018, Neurocomputing.

[19]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[20]  Feiping Nie,et al.  Orthogonal least squares regression for feature extraction , 2016, Neurocomputing.

[21]  Masashi Sugiyama,et al.  Local Fisher discriminant analysis for supervised dimensionality reduction , 2006, ICML.

[22]  Honggang Zhang,et al.  Comments on "Globally Maximizing, Locally Minimizing: Unsupervised Discriminant Projection with Application to Face and Palm Biometrics" , 2007, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Feiping Nie,et al.  Adaptive Local Linear Discriminant Analysis , 2020, ACM Trans. Knowl. Discov. Data.

[24]  Dong Xu,et al.  Trace Ratio vs. Ratio Trace for Dimensionality Reduction , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Jieping Ye,et al.  Characterization of a Family of Algorithms for Generalized Discriminant Analysis on Undersampled Problems , 2005, J. Mach. Learn. Res..

[26]  Feiping Nie,et al.  Dimension Reduction for Non-Gaussian Data by Adaptive Discriminative Analysis , 2019, IEEE Transactions on Cybernetics.

[27]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[28]  Zhihui Lai,et al.  Structured optimal graph based sparse feature extraction for semi-supervised learning , 2020, Signal Process..

[29]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[30]  Feiping Nie,et al.  Curriculum Audiovisual Learning , 2020, ArXiv.

[31]  Feiping Nie,et al.  Trace Ratio Problem Revisited , 2009, IEEE Transactions on Neural Networks.

[32]  Qi Wang,et al.  PCC Net: Perspective Crowd Counting via Spatial Convolutional Network , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[34]  Ran He,et al.  Robust Principal Component Analysis Based on Maximum Correntropy Criterion , 2011, IEEE Transactions on Image Processing.

[35]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[36]  Jian Yang,et al.  Approximate Orthogonal Sparse Embedding for Dimensionality Reduction , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Feiping Nie,et al.  A New Formulation of Linear Discriminant Analysis for Robust Dimensionality Reduction , 2019, IEEE Transactions on Knowledge and Data Engineering.

[38]  Limei Zhang,et al.  Graph-optimized locality preserving projections , 2010, Pattern Recognit..

[39]  Anastasios Tefas,et al.  Weighted Piecewise LDA for Solving the Small Sample Size Problem in Face Verification , 2007, IEEE Transactions on Neural Networks.

[40]  Hui Xu,et al.  Two-dimensional supervised local similarity and diversity projection , 2010, Pattern Recognit..

[41]  Daoqiang Zhang,et al.  Efficient and robust feature extraction by maximum margin criterion , 2003, IEEE Transactions on Neural Networks.

[42]  Hwann-Tzong Chen,et al.  Local discriminant embedding and its variants , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[43]  Jieping Ye,et al.  Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis , 2006, J. Mach. Learn. Res..

[44]  Jieping Ye,et al.  Null space versus orthogonal linear discriminant analysis , 2006, ICML '06.

[45]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[46]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[47]  Jun Guo,et al.  The small sample size problem of ICA: A comparative study and analysis , 2012, Pattern Recognit..

[48]  Xinbo Gao,et al.  Robust DLPP With Nongreedy $\ell _1$ -Norm Minimization and Maximization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Jun Guo,et al.  Robust, accurate and efficient face recognition from a single training image: A uniform pursuit approach , 2010, Pattern Recognit..

[50]  Jiashu Zhang,et al.  Linear Discriminant Analysis Based on L1-Norm Maximization , 2013, IEEE Transactions on Image Processing.

[51]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[52]  Qi Wang,et al.  SCAR: Spatial-/Channel-wise Attention Regression Networks for Crowd Counting , 2019, Neurocomputing.

[53]  Lin Zhang,et al.  Discriminative low-rank preserving projection for dimensionality reduction , 2019, Appl. Soft Comput..

[54]  Rong Wang,et al.  Towards Robust Discriminative Projections Learning via Non-Greedy $\ell _{2,1}$2,1, 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.