An introduction to smoothing spline ANOVA models in RKHS, with examples in geographical data, medicine, atmospheric sciences and machine learning

Abstract This paper is a brief introduction to smoothing spline ANOVA (SS-ANOVA) models in reproducing kernel Hilbert spaces (RKHS) and some of their applications. These models decompose a function of several variables as sums of functions of one variable plus sums of functions of two variables and so forth, analogous to the ordinary analysis of variance decomposition familiar to students in elementary Statistics classes. This is done in such a way that the individual terms are projections onto orthogonal subspaces in RKHS, and the relevant reproducing kernels may be found in many examples. Given the appropriate RKHS, various kinds of estimation and model fitting problems in several variables given observational data can be solved.

[1]  Yi Lin Multicategory Support Vector Machines, Theory, and Application to the Classification of . . . , 2003 .

[2]  Meta M. Voelker,et al.  Variable Selection and Model Building via Likelihood Basis Pursuit , 2004 .

[3]  G. Wahba,et al.  Smoothing spline ANOVA for exponential families, with application to the Wisconsin Epidemiological Study of Diabetic Retinopathy : the 1994 Neyman Memorial Lecture , 1995 .

[4]  G. Wahba,et al.  Smoothing Spline ANOVA for Multivariate Bernoulli Observations With Application to Ophthalmology Data , 2001 .

[5]  G. Wahba,et al.  Smoothing Spline ANOVA with Component-Wise Bayesian “Confidence Intervals” , 1993 .

[6]  G. Wahba Erratum: Spline Interpolation and Smoothing on the Sphere , 1982 .

[7]  Grace Wahba,et al.  Soft and hard classification by reproducing kernel Hilbert space methods , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Xiwu Lin Smoothing Spline Analysis Of Variance For Polychotomous Response Data , 1998 .

[9]  Yi Lin,et al.  Support Vector Machines and the Bayes Rule in Classification , 2002, Data Mining and Knowledge Discovery.

[10]  G. Wahba,et al.  Multicategory Support Vector Machines , Theory , and Application to the Classification of Microarray Data and Satellite Radiance Data , 2004 .

[11]  G. Wahba Spline models for observational data , 1990 .

[12]  G. Wahba Spline Interpolation and Smoothing on the Sphere , 1981 .

[13]  Grace Wahba,et al.  Spatial-Temporal Analysis of Temperature Using Smoothing Spline ANOVA , 1998 .

[14]  G. Wahba,et al.  Smoothing Spline ANOVA Fits for Very Large, Nearly Regular Data Sets, with Application to Historical , 1995 .

[15]  Michael C. Ferris,et al.  Variable Selection via Basis Pursuit for Non-Gaussian Data , 2001 .