Statistical Discriminant Analysis

Canonical discriminant analysis (CDA) and linear discriminant analysis (LDA) are popular classification techniques. Likewise, practitioners, who are familiar with regularized discriminant analysis (RDA), soft modeling by class analogy (SIMCA), principal component analysis (PCA), and partial least squares (PLS) will often use them to perform classification. In this chapter, we will attempt to make some sense out of all of this. We will explain when CDA and LDA are the same and when they are not the same. We will also discuss the relative merits of the various stabilization and dimension reducing methods used, focusing on RDA for numerical stabilization of the inverse of the covariance matrix and PCA and PLS as part of a two-step process for classification when dimensionality reduction is an issue.

[1]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[2]  Trygve Almøy,et al.  ST‐PLS: a multi‐directional nearest shrunken centroid type classifier via PLS , 2008 .

[3]  Charles D. Smith,et al.  Using OrPLS to identify asymptomatic women at risk for Alzheimer's disease , 2008, Journal of chemotherapy.

[4]  William L Grogan,et al.  A new American genus of predaceous midges related to Palpomyia and Bezzia (Diptera: Ceratopogonidae) , 1981 .

[5]  Romà Tauler,et al.  Quality assessment of the results obtained by multivariate curve resolution analysis of multiple runs of gasoline blending processes , 2006 .

[6]  B. Walczak,et al.  About kernel latent variable approaches and SVM , 2005 .

[7]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[8]  Peter C. Jurs,et al.  Pattern recognition studies of complex chromatographic data sets , 1986 .

[9]  M. Bartlett Further aspects of the theory of multiple regression , 1938, Mathematical Proceedings of the Cambridge Philosophical Society.

[10]  S. Wold,et al.  Multivariate Data Analysis in Chemistry , 1984 .

[11]  S. Wold,et al.  SIMCA: A Method for Analyzing Chemical Data in Terms of Similarity and Analogy , 1977 .

[12]  Douglas R. Henry,et al.  Pattern Recognition Studies of Complex Chromatographic Data Sets. , 1986, Journal of research of the National Bureau of Standards.

[13]  Jerome H. Friedman,et al.  Classification: Oldtimers and newcomers , 1989 .

[14]  Barry K. Lavine,et al.  Source identification of underground fuel spills by pattern recognition analysis of high-speed gas chromatograms , 1995 .

[15]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[16]  Erik Johansson,et al.  Four levels of pattern recognition , 1978 .

[17]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[18]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[19]  G Blomquist,et al.  Classification of fungi by means of pyrolysis-gas chromatography-pattern recognition. , 1979, Journal of Chromatography A.

[20]  M. Rantalainen,et al.  OPLS discriminant analysis: combining the strengths of PLS‐DA and SIMCA classification , 2006 .

[21]  Bruce R. Kowalski,et al.  Chemometrics, mathematics and statistics in chemistry , 1984 .

[22]  Sofía Valenzuela,et al.  Multivariate strategies for classification of Eucalyptus globulus genotypes using carbohydrates content and NIR spectra for evaluation of their cold resistance , 2008 .

[23]  William S. Rayens,et al.  PLS and dimension reduction for classification , 2007, Comput. Stat..

[24]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[25]  D. Bertrand,et al.  Application of PLS‐DA in multivariate image analysis , 2006 .

[26]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[27]  Eric R. Ziegel,et al.  Handbook of Chemometrics and Qualimetrics, Part B , 2000, Technometrics.