Smoothing spline models for the analysis of nested and crossed samples of curves

Abstract We introduce a class of models for an additive decomposition of groups of curves stratified by crossed and nested factors, generalizing smoothing splines to such samples by associating them with a corresponding mixed-effects model. The models are also useful for imputation of missing data and exploratory analysis of variance. We prove that the best linear unbiased predictors (BLUPs) from the extended mixed-effects model correspond to solutions of a generalized penalized regression where smoothing parameters are directly related to variance components, and we show that these solutions are natural cubic splines. The model parameters are estimated using a highly efficient implementation of the EM algorithm for restricted maximum likelihood (REML) estimation based on a preliminary eigenvector decomposition. Variability of computed estimates can be assessed with asymptotic techniques or with a novel hierarchical bootstrap resampling scheme for nested mixed-effects models. Our methods are applied to me...

[1]  H. Scheffé,et al.  The Analysis of Variance , 1960 .

[2]  R. Potthoff,et al.  A generalized multivariate analysis of variance model useful especially for growth curve problems , 1964 .

[3]  Calyampudi R. Rao,et al.  The theory of least squares when the parameters are stochastic and its application to the analysis of growth curves. , 1965, Biometrika.

[4]  S. R. Searle Large Sample Variances of Maximum Likelihood Estimators of Variance Components , 1968 .

[5]  G. Wahba,et al.  A Correspondence Between Bayesian Estimation on Stochastic Processes and Smoothing by Splines , 1970 .

[6]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[7]  C. Reinsch,et al.  Oscillation matrices with spline smoothing , 1975 .

[8]  J. Miller,et al.  Asymptotic Properties of Maximum Likelihood Estimates in the Mixed Model of the Analysis of Variance , 1977 .

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[10]  G. Wahba Improper Priors, Spline Smoothing and the Problem of Guarding Against Model Errors in Regression , 1978 .

[11]  D. Freedman,et al.  Bootstrapping a Regression Equation: Some Empirical Results , 1984 .

[12]  B. Silverman,et al.  Some Aspects of the Spline Smoothing Approach to Non‐Parametric Regression Curve Fitting , 1985 .

[13]  G. Wahba A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing Problem , 1985 .

[14]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[15]  T. Louis,et al.  Empirical Bayes Confidence Intervals Based on Bootstrap Samples , 1987 .

[16]  Tj Sweeting Invited discussion of G. Wahba: Partial and interaction spline models , 1988 .

[17]  M. C. Jones,et al.  Spline Smoothing and Nonparametric Regression. , 1989 .

[18]  G. Wahba Spline models for observational data , 1990 .

[19]  E A Thompson,et al.  Pedigree analysis for quantitative traits: variance components without matrix inversion. , 1990, Biometrics.

[20]  J. Ramsay,et al.  Some Tools for Functional Data Analysis , 1991 .

[21]  G. Robinson That BLUP is a Good Thing: The Estimation of Random Effects , 1991 .

[22]  Andrew L. Rukhin,et al.  Tools for statistical inference , 1991 .

[23]  J. Overstreet,et al.  Relationship of serum estradiol and progesterone concentrations to the excretion profiles of their major urinary metabolites as measured by enzyme immunoassay and radioimmunoassay. , 1991, Clinical chemistry.

[24]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[25]  Terry Speed,et al.  [That BLUP is a Good Thing: The Estimation of Random Effects]: Comment , 1991 .

[26]  Martin Abba Tanner,et al.  Tools for Statistical Inference: Observed Data and Data Augmentation Methods , 1993 .

[27]  R. Tibshirani,et al.  An introduction to the bootstrap , 1993 .

[28]  B. Silverman,et al.  Nonparametric regression and generalized linear models , 1994 .

[29]  R H Jones,et al.  Smoothing splines for longitudinal data. , 1995, Statistics in medicine.

[30]  Daniel Barry An Empirical Bayes Approach to Growth Curve Analysis , 1996 .

[31]  D D Baird,et al.  Preimplantation hormonal differences between the conception and non-conception menstrual cycles of 32 normal women. , 1997, Human reproduction.

[32]  S. Yen,et al.  Reproductive Endocrinology: Physiology, Pathophysiology, and Clinical Management , 1999 .