Robust Linear Discriminant Analysis Using Ratio Minimization of L1, 2-Norms

As one of the most popular linear subspace learning methods, the Linear Discriminant Analysis (LDA) method has been widely studied in machine learning community and applied to many scientific applications. Traditional LDA minimizes the ratio of squared L2-norms, which is sensitive to outliers. In recent research, many L1-norm based robust Principle Component Analysis methods were proposed to improve the robustness to outliers. However, due to the difficulty of L1-norm ratio optimization, so far there is no existing work to utilize sparsity-inducing norms for LDA objective. In this paper, we propose a novel robust linear discriminant analysis method based on the L1,2-norm ratio minimization. Minimizing the L1,2-norm ratio is a much more challenging problem than the traditional methods, and there is no existing optimization algorithm to solve such non-smooth terms ratio problem. We derive a new efficient algorithm to solve this challenging problem, and provide a theoretical analysis on the convergence of our algorithm. The proposed algorithm is easy to implement, and converges fast in practice. Extensive experiments on both synthetic data and nine real benchmark data sets show the effectiveness of the proposed robust LDA method.

[1]  Hyunsoo Kim,et al.  Adaptive nonlinear discriminant analysis by regularized minimum squared errors , 2006, IEEE Transactions on Knowledge and Data Engineering.

[2]  Jian Yang,et al.  L1-Norm Distance Linear Discriminant Analysis Based on an Effective Iterative Algorithm , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  P. Sabatier A L 1 -norm Pca and a Heuristic Approach , 1996 .

[4]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[6]  Jieping Ye,et al.  Least squares linear discriminant analysis , 2007, ICML '07.

[7]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[8]  Chris H. Q. Ding,et al.  Discriminant Laplacian Embedding , 2010, AAAI.

[9]  Sanja Fidler,et al.  Robust LDA Classification by Subsampling , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[10]  Junbin Gao,et al.  Robust L1 Principal Component Analysis and Its Bayesian Variational Inference , 2008, Neural Computation.

[11]  Nojun Kwak,et al.  Principal Component Analysis Based on L1-Norm Maximization , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jieping Ye,et al.  Characterization of a Family of Algorithms for Generalized Discriminant Analysis on Undersampled Problems , 2005, J. Mach. Learn. Res..

[13]  Peter Filzmoser,et al.  CLASSIFICATION EFFICIENCIES FOR ROBUST LINEAR DISCRIMINANT ANALYSIS , 2008 .

[14]  Dong Xu,et al.  Trace Ratio vs. Ratio Trace for Dimensionality Reduction , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Haesun Park,et al.  A comparison of generalized linear discriminant analysis algorithms , 2008, Pattern Recognit..

[16]  Jiashu Zhang,et al.  Linear Discriminant Analysis Based on L1-Norm Maximization , 2013, IEEE Transactions on Image Processing.

[17]  Chris H. Q. Ding,et al.  R1-PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization , 2006, ICML.

[18]  Stephen P. Boyd,et al.  Robust Fisher Discriminant Analysis , 2005, NIPS.

[19]  Feiping Nie,et al.  A Non-Greedy Algorithm for L1-Norm LDA , 2017, IEEE Transactions on Image Processing.

[20]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression Database , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Yuan-Hai Shao,et al.  Robust L1-norm two-dimensional linear discriminant analysis , 2015, Neural Networks.

[22]  Jaegul Choo,et al.  Hierarchical Linear Discriminant Analysis for Beamforming , 2009, SDM.

[23]  Rong Wang,et al.  Submanifold-Preserving Discriminant Analysis With an Auto-Optimized Graph , 2020, IEEE Transactions on Cybernetics.

[24]  Feiping Nie,et al.  A New Formulation of Linear Discriminant Analysis for Robust Dimensionality Reduction , 2019, IEEE Transactions on Knowledge and Data Engineering.

[25]  Daoqiang Zhang,et al.  Efficient and robust feature extraction by maximum margin criterion , 2003, IEEE Transactions on Neural Networks.

[26]  Delin Chu,et al.  A new and fast implementation for null space based linear discriminant analysis , 2010, Pattern Recognit..

[27]  Takeo Kanade,et al.  Robust L/sub 1/ norm factorization in the presence of outliers and missing data by alternative convex programming , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  K. Joossens Robust discriminant analysis , 2006 .

[29]  Feiping Nie,et al.  Trace Ratio Problem Revisited , 2009, IEEE Transactions on Neural Networks.

[30]  Jieping Ye,et al.  Multi-class Discriminant Kernel Learning via Convex Programming , 2008, J. Mach. Learn. Res..

[31]  Jieping Ye,et al.  Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis , 2006, J. Mach. Learn. Res..

[32]  Feiping Nie,et al.  Optimal Mean Robust Principal Component Analysis , 2014, ICML.

[33]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.