Comparison of support vector machine classification to partial least squares dimension reduction with logistic descrimination of hyperspectral data

Remote sensing technologies with high spatial and spectral resolution show a great deal of promise in addressing critical environmental monitoring issues, but the ability to analyze and interpret the data lags behind the technology. Robust analytical methods are required before the wealth of data available through remote sensing can be applied to a wide range of environmental problems for which remote detection is the best method. In this study we compare the classification effectiveness of two relatively new techniques on data consisting of leaf-level reflectance from plants that have been exposed to varying levels of heavy metal toxicity. If these methodologies work well on leaf-level data, then there is some hope that they will also work well on data from airborne and space-borne platforms. The classification methods compared were support vector machine classification of exposed and non-exposed plants based on the reflectance data, and partial east squares compression of the reflectance data followed by classification using logistic discrimination (PLS/LD). PLS/LD was performed in two ways. We used the continuous concentration data as the response during compression, and then used the binary response required during logistic discrimination. We also used a binary response during compression followed by logistic discrimination. The statistics we used to compare the effectiveness of the methodologies was the leave-one-out cross validation estimate of the prediction error.

[1]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[2]  G. Carter Ratios of leaf reflectances in narrow wavebands as indicators of plant stress , 1994 .

[3]  A. Gitelson,et al.  Signature Analysis of Leaf Reflectance Spectra: Algorithm Development for Remote Sensing of Chlorophyll , 1996 .

[4]  G. Smith,et al.  Food research and data analysis , 1983 .

[5]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6]  B. Rock,et al.  Comparison of in situ and airborne spectral measurements of the blue shift associated with forest decline , 1988 .

[7]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[8]  G. Suter,et al.  Toxicological Benchmarks for Screening Potential Contaminants of Concern for Effects on Terrestrial Plants , 1993 .

[9]  Bernhard Schölkopf,et al.  Support vector learning , 1997 .

[10]  Herman Wold,et al.  Soft modelling: The Basic Design and Some Extensions , 1982 .

[11]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[12]  D. M. Moss,et al.  Red edge spectral measurements from sugar maple leaves , 1993 .

[13]  Danh V. Nguyen,et al.  Tumor classification by partial least squares using microarray gene expression data , 2002, Bioinform..

[14]  Alan J. M. Baker,et al.  TERRESTRIAL HIGHER PLANTS WHICH HYPERACCUMULATE METALLIC ELEMENTS. A REVIEW OF THEIR DISTRIBUTION, ECOLOGY AND PHYTOCHEMISTRY , 1989 .

[15]  Curran Pj Estimating green LAI from multispectral aerial photography , 1983 .