Nonparametric Mixed Effects Models for Unequally Sampled Noisy Curves

Summary. We propose a method of analyzing collections of related curves in which the individual curves are modeled as spline functions with random coefficients. The method is applicable when the individual curves are sampled at variable and irregularly spaced points. This produces a low‐rank, low‐frequency approximation to the covariance structure, which can be estimated naturally by the EM algorithm. Smooth curves for individual trajectories are constructed as best linear unbiased predictor (BLUP) estimates, combining data from that individual and the entire collection. This framework leads naturally to methods for examining the effects of covariates on the shapes of the curves. We use model selection techniques—Akaike information criterion (AIC), Bayesian information criterion (BIC), and cross‐validation—to select the number of breakpoints for the spline approximation. We believe that the methodology we propose provides a simple, flexible, and computationally efficient means of functional data analysis.

[1]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[2]  H. Müller,et al.  Nonparametric Regression Analysis of Growth Curves , 1984 .

[3]  J. Phair,et al.  The Multicenter AIDS Cohort Study: rationale, organization, and selected characteristics of the participants. , 1987, American journal of epidemiology.

[4]  H. Müller Nonparametric regression analysis of longitudinal data , 1988 .

[5]  Richard A. Olshen,et al.  Gait Analysis and the Bootstrap , 1989 .

[6]  Robin Thompson,et al.  [That BLUP is a Good Thing: The Estimation of Random Effects]: Comment , 1991 .

[7]  G. Robinson That BLUP is a Good Thing: The Estimation of Random Effects , 1991 .

[8]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[9]  Richard H. Jones,et al.  Longitudinal Data with Serial Correlation : A State-Space Approach , 1994 .

[10]  W. D. Ray,et al.  Longitudinal Data with Serial Correlation: A State Space Approach. , 1994 .

[11]  P. Diggle,et al.  Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. , 1994, Biometrics.

[12]  P. Diggle,et al.  Analysis of Longitudinal Data , 2003 .

[13]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[14]  E. Vonesh,et al.  Linear and Nonlinear Models for the Analysis of Repeated Measurements , 1996 .

[15]  Young K. Truong,et al.  Polynomial splines and their tensor products in extended linear modeling: 1994 Wald memorial lecture , 1997 .

[16]  Xihong Lin Variance component testing in generalised linear models with random effects , 1997 .

[17]  Philippe Besse,et al.  Simultaneous non-parametric regressions of unbalanced longitudinal data , 1997 .

[18]  Simultaneous non-parametric regressions ofunbalanced longitudinal , 1997 .

[19]  J. Rice,et al.  Smoothing spline models for the analysis of nested and crossed samples of curves , 1998 .

[20]  P J Diggle,et al.  Nonparametric estimation of covariance structure in longitudinal data. , 1998, Biometrics.

[21]  Li Ping Yang,et al.  Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data , 1998 .