Variable selection for discriminant analysis of fish sounds using matrix correlations

Discriminant analysis is a widely used multivariate technique. In some applications the number of variables available is very large and, as with other multivariate techniques, it is desirable to simplify matters by selecting a subset of the variables in such a way that little useful information is lost in doing so. Many methods have been suggested for variable selection in discriminant analysis; this article introduces a new one, based on matrix correlation, an idea that has proved useful in the context of principal component analysis. The method is illustrated on an example involving fish sounds. It is important to discriminate between the sounds made by different species of fish, and even by individual fish, but the nature of the data is such that many potential variables are available.

[1]  I. Jolliffe,et al.  A simulation study of the use of principal components in linear discriminant analysis , 1996 .

[2]  N. Campbell,et al.  Variable selection techniques in discriminant analysis: I. Description , 1982 .

[3]  Anthony D. Hawkins,et al.  LOCATING SPAWNING HADDOCK BY MEANS OF SOUND , 2002 .

[4]  Geert Molenberghs,et al.  Regression modelling of weighted κ by using generalized estimating equations , 2000 .

[5]  Ian T. Jolliffe,et al.  Variable selection and the interpretation of principal subspaces , 2001 .

[6]  Comparison of two leading multivariate techniques in terms of variable selection for linear discriminant analysis , 2001 .

[7]  B. Silverman,et al.  The Stationary Wavelet Transform and some Statistical Applications , 1995 .

[8]  M. R. Mickey,et al.  Estimation of Error Rates in Discriminant Analysis , 1968 .

[9]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[10]  J. Orestes Cerdeira,et al.  Computational aspects of algorithms for variable selection in the context of principal components , 2004, Comput. Stat. Data Anal..

[11]  T. Sapatinas,et al.  Wavelet Analysis and its Statistical Applications , 2000 .

[12]  A. D. Hawkins,et al.  The calls of gadoid fish , 1978, Journal of the Marine Biological Association of the United Kingdom.

[13]  N. Campbell,et al.  Variable selection techniques in discriminant analysis: II. Allocation , 1982 .

[14]  Hong-Ye Gao,et al.  Applied wavelet analysis with S-plus , 1996 .

[15]  I. Jolliffe Principal Component Analysis , 2002 .

[16]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[17]  Geoffrey J. McLachlan,et al.  Discriminant Analysis and Statistical Pattern Recognition: McLachlan/Discriminant Analysis & Pattern Recog , 2005 .

[18]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[19]  J. Friedman Regularized Discriminant Analysis , 1989 .

[20]  Ramanathan Gnanadesikan,et al.  Methods for statistical data analysis of multivariate observations , 1977, A Wiley publication in applied statistics.

[21]  W. R. Dillon,et al.  On the Use of Component Scores in the Presence of Group Structure , 1989 .

[22]  Ramanathan Gnanadesikan Methods for Statistical Data Analysis of Multivariate Observations: Gnanadesikan/Methods , 1997 .

[23]  Graham W. Horgan,et al.  The statistical analysis of plant part appearance — a review , 2001 .

[24]  Tom Fearn,et al.  Discrimination with Many Variables , 1999 .