Nonlinear Discriminant Analysis Using Kernel Functions

Fishers linear discriminant analysis (LDA) is a classical multivariate technique both for dimension reduction and classification. The data vectors are transformed into a low dimensional subspace such that the class centroids are spread out as much as possible. In this subspace LDA works as a simple prototype classifier with linear decision boundaries. However, in many applications the linear boundaries do not adequately separate the classes. We present a nonlinear generalization of discriminant analysis that uses the kernel trick of representing dot products by kernel functions. The presented algorithm allows a simple formulation of the EM-algorithm in terms of kernel functions which leads to a unique concept for unsupervised mixture analysis, supervised discriminant analysis and semi-supervised discriminant analysis with partially unlabelled observations in feature spaces.

[1]  R. Tibshirani,et al.  Discriminant Analysis by Gaussian Mixtures , 1996 .

[2]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[3]  Alexander Gammerman,et al.  Ridge Regression Learning Algorithm in Dual Variables , 1998, ICML.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[6]  Nello Cristianini,et al.  Advances in Kernel Methods - Support Vector Learning , 1999 .

[7]  Bernard D. Flury,et al.  Why Multivariate Statistics , 1997 .

[8]  R. Tibshirani,et al.  Penalized Discriminant Analysis , 1995 .

[9]  R. Tibshirani,et al.  Flexible Discriminant Analysis by Optimal Scoring , 1994 .

[10]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[11]  Bernhard Schölkopf,et al.  Support vector learning , 1997 .

[12]  Volker Roth,et al.  Pattern Recognition Combining Feature- and Pixel-Based Classification Within a Real World Application , 1999, DAGM-Symposium.

[13]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[14]  R. Stephenson A and V , 1962, The British journal of ophthalmology.