Cloud Classification of Satellite Radiance Data by Multicategory Support Vector Machines

Abstract Two-category support vector machines (SVMs) have become very popular in the machine learning community for classification problems and have recently been shown to have good optimality properties for classification purposes. Treating multicategory problems as a series of binary problems is common in the SVM paradigm. However, this approach may fail under a variety of circumstances. The multicategory support vector machine (MSVM), which extends the binary SVM to the multicategory case in a symmetric way, and has good theoretical properties, has recently been proposed. The proposed MSVM in addition provides a unifying framework when there are either equal or unequal misclassification costs, and when there is a possibly nonrepresentative training set. Illustrated herein is the potential of the MSVM as an efficient cloud detection and classification algorithm for use in Earth Observing System models, which require knowledge of whether or not a radiance profile is cloud free. If the profile is not clou...

[1]  G. Wahba,et al.  Smoothing spline ANOVA for exponential families, with application to the Wisconsin Epidemiological Study of Diabetic Retinopathy : the 1994 Neyman Memorial Lecture , 1995 .

[2]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[3]  Chong Gu,et al.  Structured Machine Learning for Soft Classification with Smoothing Spline ANOVA and Stacked Tuning, Testing, and Evaluation , 1993, NIPS.

[4]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[5]  Michael C. Ferris,et al.  Interfaces to PATH 3.0: Design, Implementation and Usage , 1999, Comput. Optim. Appl..

[6]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[7]  W. Paul Menzel,et al.  Cloud Properties inferred from 812-µm Data , 1994 .

[8]  J. Key,et al.  Tools for Atmospheric Radiative Transfer: Streamer and FluxNet. Revised , 1998 .

[9]  G. Wahba Support vector machines, reproducing kernel Hilbert spaces, and randomized GACV , 1999 .

[10]  G. Wahba,et al.  Multicategory Support Vector Machines , Theory , and Application to the Classification of Microarray Data and Satellite Radiance Data , 2004 .

[11]  G. Wahba Spline models for observational data , 1990 .

[12]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[14]  B. Yandell,et al.  Automatic Smoothing of Regression Functions in Generalized Linear Models , 1986 .

[15]  Yi Lin Multicategory Support Vector Machines, Theory, and Application to the Classification of . . . , 2003 .

[16]  Olvi L. Mangasarian,et al.  Nonlinear Programming , 1969 .

[17]  Yoonkyung Lee,et al.  Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data , 2003, Bioinform..

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  Nello Cristianini,et al.  Advances in Kernel Methods - Support Vector Learning , 1999 .

[20]  Andrew K. Heidinger,et al.  Using MODIS to Estimate Cloud Contamination of the AVHRR Data Record , 2002 .

[21]  Yi Lin,et al.  Support Vector Machines and the Bayes Rule in Classification , 2002, Data Mining and Knowledge Discovery.

[22]  W. Paul Menzel,et al.  The MODIS cloud products: algorithms and examples from Terra , 2003, IEEE Trans. Geosci. Remote. Sens..

[23]  Grace Wahba,et al.  Soft and hard classification by reproducing kernel Hilbert space methods , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  W. Menzel,et al.  Discriminating clear sky from clouds with MODIS , 1998 .

[25]  Yi Lin,et al.  Support Vector Machines for Classification in Nonstandard Situations , 2002, Machine Learning.