Non-iterative Heteroscedastic Linear Dimension Reduction for Two-Class Data

Linear discriminant analysis (LDA) is a traditional solution to the linear dimension reduction (LDR) problem, which is based on the maximization of the between-class scatter over the within-class scatter. This solution is incapable of dealing with heteroscedastic data in a proper way, because of the implicit assumption that the covariance matrices for all the classes are equal. Hence, discriminatory information in the difference between the covariance matrices is not used and, as a consequence, we can only reduce the data to a single dimension in the two-class case. We propose a fast non-iterative eigenvector-based LDR technique for heteroscedastic two-class data, which generalizes, and improves upon LDAb y dealing with the aforementioned problem. For this purpose, we use the concept of directed distance matrices, which generalizes the between-class covariance matrix such that it captures the differences in (co)variances.

[1]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[2]  T. W. Anderson,et al.  Classification into two Multivariate Normal Distributions with Different Covariance Matrices , 1962 .

[3]  B. Ginneken,et al.  Automatic segmentation of lung fields in chest radiographs. , 2000 .

[4]  Witold Malina,et al.  On an Extended Fisher Criterion for Feature Selection , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[6]  H. P. Decell,et al.  Linear dimension reduction and Bayes classification , 1981, Pattern Recognit..

[7]  Demetrios Kazakos On the optimal linear feature (Corresp.) , 1978, IEEE Trans. Inf. Theory.

[8]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[9]  M. Loog Approximate Pairwise Accuracy Criteria for Multiclass Linear Dimension Reduction: Generalisations of the Fisher Criterion , 1999 .

[10]  C. R. Rao,et al.  The Utilization of Multiple Measurements in Problems of Biological Classification , 1948 .

[11]  J. Wade Davis,et al.  Statistical Pattern Recognition , 2003, Technometrics.

[12]  H. P. Decell,et al.  Feature combinations and the divergence criterion , 1977 .

[13]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  W. A. Coberly,et al.  Linear dimension reduction and Bayes classification with unknown population parameters , 1982, Pattern Recognit..

[15]  Henry P. Decell,et al.  Feature combinations and the bhattacharyya criterion , 1976 .

[16]  B. M. ter Haar Romeny,et al.  Automatic segmentation of lung fields in chest radiographs. , 2000, Medical physics.

[17]  C. T. Ng,et al.  Measures of distance between probability distributions , 1989 .

[18]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[19]  C. H. Chen,et al.  On information and distance measures, error bounds, and feature selection , 1976, Information Sciences.

[20]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[21]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .