Multiclass linear dimension reduction via a generalized Chernoff bound

In this paper, we consider the problem of linear dimension reduction (LDR) for multiclass classification. Often, a linear projection in which classes are separable may exist, but is hard to find. In the absence of methods that can find such plane, one may unnecessarily resort to nonlinear dimension reduction (DR). Generalization of two-class separation criteria such as Mahalanobis, Bhattacharya, or Chernoff distance are often done in an ad-hoc fashion. In this paper, we propose two algorithms for multiclass LDR that aim at minimizing upper bounds on the probability of misclassification and are based on generalizations of Chernoff distance for the multiclass problem. We present a numerical study and comparison to state-of-the-art LDR methods on datasets from the UCI machine learning repository. We show that our algorithms result in lower classification error rates compared to techniques of the same class.

[1]  Kari Torkkola,et al.  Feature Extraction by Non-Parametric Mutual Information Maximization , 2003, J. Mach. Learn. Res..

[2]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[3]  Zoran Nenadic,et al.  Information Discriminant Analysis: Feature Extraction with an Information-Theoretic Objective , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Martin E. Hellman,et al.  Probability of error, equivocation, and the Chernoff bound , 1970, IEEE Trans. Inf. Theory.

[5]  Chia-Wei Hsu,et al.  A Linear Feature Extraction for Multiclass Classification Problems Based on Class Mean and Covariance Discriminant Information , 2006, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Lawrence K. Saul,et al.  Analysis and extension of spectral methods for nonlinear dimensionality reduction , 2005, ICML.

[7]  Kuldip K. Paliwal,et al.  A Modified Minimum Classification Error (MCE) Training Algorithm for Dimensionality Reduction , 2002, J. VLSI Signal Process..

[8]  Luis Rueda,et al.  A New Approach to Multi-class Linear Dimensionality Reduction , 2006, CIARP.

[9]  Geoffrey E. Hinton,et al.  Neighbourhood Components Analysis , 2004, NIPS.

[10]  Robert P. W. Duin,et al.  Linear dimensionality reduction via a heteroscedastic extension of LDA: the Chernoff criterion , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Luis Rueda,et al.  A New Linear Dimensionality Reduction Technique Based on Chernoff Distance , 2006, IBERAMIA-SBIA.