React Scatterplot Smoothers: Superefficiency through Basis Economy

Abstract REACT estimators for the mean of a linear model involve three steps: transforming the model to a canonical form that provides an economical representation of the unknown mean vector, estimating the risks of a class of candidate linear shrinkage estimators, and adaptively selecting the candidate estimator that minimizes estimated risk. Applied to one- or higher-way layouts, the REACT method generates automatic scatterplot smoothers that compete well on standard datasets with the best fits obtained by alternative techniques. Historical precursors to REACT include nested model selection, ridge regression, and nested principal component selection for the linear model. However, REACT's insistence on working with an economical basis greatly increases its superefficiency relative to the least squares fit. This reduction in risk and the possible economy of the discrete cosine basis, of the orthogonal polynomial basis, or of a smooth basis that generalizes the discrete cosine basis are illustrated by fitting scatterplots drawn from the literature. Flexible monotone shrinkage of components rather than nested 1–0 shrinkage achieves a secondary decrease in risk that is visible in these examples. Pinsker bounds on asymptotic minimax risk for the estimation problem express the remarkable role of basis economy in reducing risk.

[1]  S. Efromovich Quasi-Linear Wavelet Estimation , 1999 .

[2]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[3]  Density and Regression Smoothing , 1995 .

[4]  E. Mammen,et al.  Optimal spatial adaptation to inhomogeneous smoothness: an approach based on kernel estimates with variable bandwidth selectors , 1997 .

[5]  Brian D. Ripley,et al.  Modern Applied Statistics with S-Plus Second edition , 1997 .

[6]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[7]  Helge Toutenburg,et al.  Linear models : least squares and alternatives , 1999 .

[8]  R. Tibshirani,et al.  Linear Smoothers and Additive Models , 1989 .

[9]  William N. Venables,et al.  Modern Applied Statistics with S-Plus. , 1996 .

[10]  J. Rice Bandwidth Choice for Nonparametric Regression , 1984 .

[11]  Ker-Chau Li,et al.  Asymptotic Optimality for $C_p, C_L$, Cross-Validation and Generalized Cross-Validation: Discrete Index Set , 1987 .

[12]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[13]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[14]  F. T. Wright,et al.  Order restricted statistical inference , 1988 .

[15]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[16]  James Stephen Marron,et al.  Choosing a Kernel Regression Estimator , 1991 .

[17]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[18]  C. Stein Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution , 1956 .

[19]  R. Beran,et al.  Modulation Estimators and Confidence Sets , 1998 .

[20]  Ker-Chau Li,et al.  From Stein's Unbiased Risk Estimates to the Method of Generalized Cross Validation , 1985 .

[21]  Ker-Chau Li,et al.  The Data-Smoothing Aspect of Stein Estimates , 1984 .

[22]  C. Stein Estimation of the Mean of a Multivariate Normal Distribution , 1981 .

[23]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[24]  A. Kneip Ordered Linear Smoothers , 1994 .

[25]  C. Mallows More comments on C p , 1995 .

[26]  R. Beran Confidence sets centered at Cp-estimators , 1996 .

[27]  C. L. Mallows,et al.  Some Theory of Nonlinear Smoothers , 1980 .

[28]  B. Silverman,et al.  Some Aspects of the Spline Smoothing Approach to Non‐Parametric Regression Curve Fitting , 1985 .

[29]  S. R. Searle Linear Models , 1971 .