An EM Algorithm Fitting First-Order Conditional Autoregressive Models to Longitudinal Data

Abstract An EM algorithm fits a state-space formulation of the longitudinal regression model in which a continuous response depends on the lagged response and both time-dependent and time-independent covariates. The baseline response depends only on covariates. The model handles both missing data and Gaussian measurement error on both response and continuous covariates. The E step uses the Kalman filter and associated filtering algorithms to update the unknown true response and predictor series for the observed data. The M step uses standard closed-form Gaussian results. Standard errors come from the supplemented EM (SEM) algorithm. The model accurately fits 6 years of pulmonary function measurements on 158 children with many missing observations.

[1]  K Y Liang,et al.  An overview of methods for the analysis of longitudinal data. , 1992, Statistics in medicine.

[2]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[3]  S. R. Searle,et al.  Matrix Algebra Useful for Statistics , 1982 .

[4]  Nozer D. Singpurwalla,et al.  Understanding the Kalman Filter , 1983 .

[5]  F. Speizer,et al.  The use of an autoregressive model for the analysis of longitudinal data in epidemiologic studies. , 1985, Statistics in medicine.

[6]  R. Kohn,et al.  A geometrical derivation of the fixed interval smoothing algorithm , 1982 .

[7]  R. E. Kalman,et al.  New Results in Linear Filtering and Prediction Theory , 1961 .

[8]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[9]  R. Shumway,et al.  AN APPROACH TO TIME SERIES SMOOTHING AND FORECASTING USING THE EM ALGORITHM , 1982 .

[10]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[11]  S. Redline,et al.  The relationship between longitudinal change in pulmonary function and nonspecific airway responsiveness in children and young adults. , 1989, The American review of respiratory disease.

[12]  Cheng Hsiao,et al.  Formulation and estimation of dynamic models using panel data , 1982 .

[13]  L L Kupper,et al.  Effects of the use of unreliable surrogate variables on the validity of epidemiologic research studies. , 1984, American journal of epidemiology.

[14]  R. Jennrich,et al.  Unbalanced repeated-measures models with structured covariance matrices. , 1986, Biometrics.

[15]  M. Segal,et al.  A parametric family of correlation structures for the analysis of longitudinal data. , 1992, Biometrics.

[16]  Richard H. Jones Longitudinal data with serial correlation , 1993 .

[17]  David J. Hand,et al.  Analysis of Repeated Measures , 1990 .

[18]  Richard H. Jones,et al.  Maximum Likelihood Fitting of ARMA Models to Time Series With Missing Observations , 1980 .

[19]  Robert Kohn,et al.  On the estimation of ARIMA Models with Missing Values , 1984 .

[20]  B Rosner,et al.  Autoregressive modelling for the analysis of longitudinal data with unequally spaced examinations. , 1988, Statistics in medicine.

[21]  Christopher H. Schmid,et al.  Incorporating measurement error in the estimation of autoregressive models for longitudinal data , 1994 .

[22]  Piet de Jong,et al.  Covariances for smoothed estimates in state space models , 1988 .

[23]  S. Redline,et al.  Longitudinal variability in airway responsiveness in a population-based sample of children and young adults. Intrinsic and extrinsic contributing factors. , 1989, The American review of respiratory disease.

[24]  B Rosner,et al.  Effect of parental cigarette smoking on the pulmonary function of children. , 1979, American journal of epidemiology.

[25]  Andrew P. Sage,et al.  Estimation theory with applications to communications and control , 1979 .

[26]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[27]  B Rosner,et al.  A Bayesian approach to logistic regression models having measurement error following a mixture distribution. , 1993, Statistics in medicine.

[28]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .