Feature Extraction for One-Class Classification

Feature reduction is often an essential part of solving a classification task. One common approach for doing this, is Principal Component Analysis. There the low variance directions in the data are removed and the high variance directions are retained. It is hoped that these high variance directions contain information about the class differences. For one-class classification or novelty detection, the classification task contains one ill-determined class, for which (almost) no information is available. In this paper we show that for one-class classification, the low-variance directions are most informative, and that in the feature reduction a bias-variance trade-off has to be considered which causes that retaining the high variance directions is often not optimal.

[1]  I. Jolliffe Principal Component Analysis , 2002 .

[2]  David M. J. Tax,et al.  One-class classification , 2001 .

[3]  Tom Heskes,et al.  Bias/Variance Decompositions for Likelihood-Based Estimators , 1998, Neural Computation.

[4]  Don R. Hush,et al.  Network constraints and multi-objective optimization for one-class classification , 1996, Neural Networks.

[5]  David M. Rocke,et al.  The Distribution of Robust Distances , 2005 .

[6]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[7]  Nathalie Japkowicz,et al.  A Novelty Detection Approach to Classification , 1995, IJCAI.

[8]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[9]  Kah Kay Sung,et al.  Learning and example selection for object and pattern detection , 1995 .

[10]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[11]  Gunter Ritter,et al.  Automatic context-sensitive karyotyping of human chromosomes based on elliptically symmetric statistical distributions , 1995, Pattern Recognit..

[12]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[13]  R. F.,et al.  Mathematical Statistics , 1944, Nature.

[14]  Massimiliano Pontil,et al.  Face Detection in Still Gray Images , 2000 .

[15]  Sung-Bae Cho,et al.  Recognition of unconstrained handwritten numerals by doubly self-organizing neural network , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[16]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.