Self-centralized jointly sparse maximum margin criterion for robust dimensionality reduction

Abstract Linear discriminant analysis (LDA) is among the most popular supervised dimensionality reduction algorithms, which has been largely followed in the fields of pattern recognition and data mining. However, LDA has three major drawbacks. One is the challenge brought by small-sample-size (SSS) problem; second makes it sensitive to outliers due to the use of squared L 2 -norms in the scatter loss evaluation; the third is the case that the feature loadings in projection matrix are relatively redundant and there is a risk of overfitting. In this paper, we put forward a novel functional expression for LDA, which combines maximum margin criterion (MMC) with a weighted strategy formulated by L 1 , 2 -norms to against outliers. Meanwhile, we simultaneously realize the adaptive calculation of weighted intra-class and global centroid to further reduce the influence of outliers, and employ the L 2 , 1 -norm to constrain row sparsity so that subspace learning and feature selection could be performed cooperatively. Besides, an effective alternating iterative algorithm is derived and its convergence is verified. From the complexity analysis, our proposed algorithm can deal with large-scale data processing. Our proposed model can address the sensitivity problem of outliers and extract the most representative features while preventing overfitting effectively. Experiments performed on several benchmark databases demonstrate that the proposed algorithm is more effective than some other state-of-the-art methods and has better generalization performance.

[1]  Kun Zhou,et al.  Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[2]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[4]  Xuesong Lu,et al.  Fisher Discriminant Analysis With L1-Norm , 2014, IEEE Transactions on Cybernetics.

[5]  Alessandro Armando,et al.  LTL Model Checking for Security Protocols , 2007, 20th IEEE Computer Security Foundations Symposium (CSF'07).

[6]  Andy Harter,et al.  Parameterisation of a stochastic model for human face identification , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[7]  Kimon P. Valavanis,et al.  Unmanned Vehicle Controller Design, Evaluation and Implementation: From MATLAB to Printed Circuit Board , 2007, J. Intell. Robotic Syst..

[8]  Jiawei Han,et al.  Joint Feature Selection and Subspace Learning , 2011, IJCAI.

[9]  Zhenhua Guo,et al.  Face recognition by sparse discriminant analysis via joint L2, 1-norm minimization , 2014, Pattern Recognit..

[10]  Qi Tian,et al.  Image Classification By The Foley-Sammon Transform , 1986 .

[11]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[12]  D. B. Graham,et al.  Characterising Virtual Eigensignatures for General Purpose Face Recognition , 1998 .

[13]  Nojun Kwak,et al.  Principal Component Analysis Based on L1-Norm Maximization , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Konstantinos N. Plataniotis,et al.  Regularization studies of linear discriminant analysis in small sample size scenarios with application to face recognition , 2005, Pattern Recognit. Lett..

[15]  Jianhua Z. Huang,et al.  Sparse Linear Discriminant Analysis with Applications to High Dimensional Low Sample Size Data , 2009 .

[16]  Wai Keung Wong,et al.  Generalized Robust Regression for Jointly Sparse Subspace Learning , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[18]  Zhenyu He,et al.  Joint sparse principal component analysis , 2017, Pattern Recognit..

[19]  J. Friedman Regularized Discriminant Analysis , 1989 .

[20]  Lunke Fei,et al.  Robust Sparse Linear Discriminant Analysis , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Delin Chu,et al.  Sparse Uncorrelated Linear Discriminant Analysis for Undersampled Problems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Tao Jiang,et al.  Efficient and robust feature extraction by maximum margin criterion , 2003, IEEE Transactions on Neural Networks.

[23]  Takeo Kanade,et al.  Robust L/sub 1/ norm factorization in the presence of outliers and missing data by alternative convex programming , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Lei Wang,et al.  Classification for Volume Rendering of Industrial CT Based on Moment of Histogram , 2007, 2007 2nd IEEE Conference on Industrial Electronics and Applications.

[25]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  David Zhang,et al.  A Multiple Maximum Scatter Difference Discriminant Criterion for Facial Feature Extraction , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Hong Qiao,et al.  Scatter Balance: An Angle-Based Supervised Dimensionality Reduction , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Chris H. Q. Ding,et al.  R1-PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization , 2006, ICML.

[29]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[30]  Zhongfei Zhang,et al.  Linear discriminant analysis using rotational invariant L1 norm , 2010, Neurocomputing.

[31]  David Zhang,et al.  Face recognition based on a novel linear discriminant criterion , 2006, Pattern Analysis and Applications.

[32]  Jiashu Zhang,et al.  Linear Discriminant Analysis Based on L1-Norm Maximization , 2013, IEEE Transactions on Image Processing.

[33]  Larry S. Davis,et al.  Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Jian Yang,et al.  Why can LDA be performed in PCA transformed space? , 2003, Pattern Recognit..

[35]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[36]  Feiping Nie,et al.  Neighborhood MinMax Projections , 2007, IJCAI.

[37]  Feiping Nie,et al.  A New Formulation of Linear Discriminant Analysis for Robust Dimensionality Reduction , 2019, IEEE Transactions on Knowledge and Data Engineering.

[38]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[39]  Shiliang Sun,et al.  Manifold Partition Discriminant Analysis , 2017, IEEE Transactions on Cybernetics.

[40]  Hua Yu,et al.  A direct LDA algorithm for high-dimensional data - with application to face recognition , 2001, Pattern Recognit..

[41]  Ja-Chen Lin,et al.  A new LDA-based face recognition system which can solve the small sample size problem , 1998, Pattern Recognit..

[42]  Trevor J. Hastie,et al.  Sparse Discriminant Analysis , 2011, Technometrics.