Dimension Reduction by Maximizing Pairwise Discriminations

Dimension reduction is an important pre-processing technique for high-dimensional data analysis. In this paper, we consider the process of linear dimension reduction (LDR) in multiclass problems. We propose a novel feature extraction method based on Minimax Probability Machine (MPM), named MPMbased Dimension Reduction (DR-MPM). Its objective naturally combines the discriminative information of all the class pairs and each pair of classes is well separated in the projected subspace. The algorithm is robust in the sense that it is insensitive to 'outlier' classes which lie far away from other classes. We evaluate DR-MPM on a number of synthetic and real-world data sets, and show that it outperforms other state-of-art feature extraction methods in terms of visual intuition and classification accuracy, especially when the distances between classes are unevenly distributed.

[1]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[2]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[3]  Tong Heng Lee,et al.  Face recognition using recursive Fisher linear discriminant , 2006, IEEE Transactions on Image Processing.

[4]  James M. Rehg,et al.  Fast Asymmetric Learning for Cascade Face Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Michael I. Jordan,et al.  Robust Novelty Detection with Single-Class MPM , 2002, NIPS.

[6]  Ioana Popescu,et al.  Optimal Inequalities in Probability Theory: A Convex Optimization Approach , 2005, SIAM J. Optim..

[7]  Michael I. Jordan,et al.  A Robust Minimax Approach to Classification , 2003, J. Mach. Learn. Res..

[8]  C. R. Rao,et al.  The Utilization of Multiple Measurements in Problems of Biological Classification , 1948 .

[9]  Kimberly A. Dukes Gram–Schmidt Process† , 2014 .

[10]  Kilian Q. Weinberger,et al.  Gradient boosted feature selection , 2014, KDD.

[11]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[12]  Xuelong Li,et al.  Supervised Tensor Learning , 2005, ICDM.

[13]  R. Tibshirani,et al.  Penalized Discriminant Analysis , 1995 .

[14]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[15]  Robert P. W. Duin,et al.  Expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix , 1998, Pattern Recognit. Lett..

[16]  Lianwen Jin,et al.  A New Facial Expression Recognition Method Based on Local Gabor Filter Bank and PCA plus LDA , 2006 .

[17]  Kilian Q. Weinberger,et al.  Transductive Minimax Probability Machine , 2014, ECML/PKDD.

[18]  Jieping Ye,et al.  Characterization of a Family of Algorithms for Generalized Discriminant Analysis on Undersampled Problems , 2005, J. Mach. Learn. Res..

[19]  Xuesong Lu,et al.  Fisher Discriminant Analysis With L1-Norm , 2014, IEEE Transactions on Cybernetics.