Multilevel models for censored and latent responses

Multilevel models were originally developed to allow linear regression or ANOVA models to be applied to observations that are not mutually independent. This lack of independence commonly arises due to clustering of the units of observations into ‘higher level units’ such as patients in hospitals. In linear mixed models, the within-cluster correlations are modelled by including random effects in a linear model. In this paper, we discuss generalizations of linear mixed models suitable for responses subject to systematic and random measurement error and interval censoring. The first example uses data from two cross-sectional surveys of schoolchildren to investigate risk factors for early first experimentation with cigarettes. Here the recalled times of the children’s first cigarette are likely to be subject to both systematic and random measurement errors as well as being interval censored. We describe multilevel models for interval censored survival times as special cases of generalized linear mixed models and discuss methods of estimating systematic recall bias. The second example is a longitudinal study of mental health problems of patients nested in clinics. Here the outcome is measured by multiple questionnaires allowing the measurement errors to be modelled within a linear latent growth curve model. The resulting model is a multilevel structural equation model. We briefly discuss such models both as extensions of linear mixed models and as extensions of structural equation models. Several different model structures are examined. An important goal of the paper is to place a number of methods that readers may have considered as being distinct within a single overall modelling framework.

[1]  B. Muthén,et al.  Multilevel Covariance Structure Analysis , 1994 .

[2]  Bengt Muthen,et al.  10. Latent Variable Modeling of Longitudinal and Multilevel Data , 1997 .

[3]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[4]  Kenneth A. Bollen,et al.  Structural Equations with Latent Variables , 1989 .

[5]  W. A. Thompson,et al.  On the treatment of grouped observations in life studies. , 1977, Biometrics.

[6]  N. Sartorius,et al.  Mental illness in general health care : an international study , 1995 .

[7]  Stephen P. Jenkins,et al.  Easy Estimation Methods for Discrete-Time Duration Models , 1995 .

[8]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[9]  H. Goldstein Multilevel mixed linear model analysis using iterative generalized least squares , 1986 .

[10]  Bengt Muthén,et al.  On structural equation modeling with data that are not missing completely at random , 1987 .

[11]  Scott L. Zeger,et al.  Generalized linear models with random e ects: a Gibbs sampling approach , 1991 .

[12]  Multilevel Risk Models for Retrospective Age-Of-Onset Data , 2001 .

[13]  S. Rabe-Hesketh,et al.  Generalized linear latent and mixed models , 2000 .

[14]  Harvey Goldstein,et al.  Balanced versus unbalanced designs for linear structural relations in two‐level data , 1989 .

[15]  Jürgen Baumert,et al.  Modeling longitudinal and multilevel data , 2000 .

[16]  D. Hedeker,et al.  MIXOR: a computer program for mixed-effects ordinal regression analysis. , 1996, Computer methods and programs in biomedicine.

[17]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[18]  Michael Keane,et al.  A Computationally Practical Simulation Estimator for Panel Data , 1994 .

[19]  Jan de Leeuw,et al.  Introducing Multilevel Modeling , 1998 .

[20]  H. Goldstein Restricted unbiased iterative generalized least-squares estimation , 1989 .

[21]  Qing Liu,et al.  A note on Gauss—Hermite quadrature , 1994 .

[22]  B. Muthén Latent variable modeling in heterogeneous populations , 1989 .

[23]  P. Diggle Analysis of Longitudinal Data , 1995 .

[24]  Sik-Yum Lee,et al.  Maximum likelihood and generalized least squares analyses of two-level structural equation models , 1992 .

[25]  Joop J. Hox,et al.  Applied Multilevel Analysis. , 1995 .

[26]  D. Collet Modelling Survival Data in Medical Research , 2004 .

[27]  B. Everitt,et al.  Modelling Covariances and Latent Variables Using EQS , 1993 .

[28]  Jürgen Baumert,et al.  Modeling longitudinal and multilevel data: Practical issues, applied approaches, and specific examples. , 2000 .

[29]  R Crouchley,et al.  A comparison of frailty models for multivariate survival data. , 1995, Statistics in medicine.

[30]  A. Pickles,et al.  Reconciling recalled dates of developmental milestones, events and transitions : a mixed generalized linear model with random mean and variance functions , 1996 .

[31]  A. Agresti Analysis of Ordinal Categorical Data , 1985 .

[32]  A random effects model for ordinal responses from a crossover trial , 1991 .

[33]  J. Ormel,et al.  Why GHQ threshold varies from one place to another , 1998, Psychological Medicine.

[34]  T. K. Jensen,et al.  A discrete survival model with random effects: an application to time to pregnancy. , 1997, Biometrics.

[35]  Nicholas T. Longford,et al.  Factor analysis for clustered observations , 1992 .

[36]  Karl G. Jöreskog,et al.  Lisrel 8: Structural Equation Modeling With the Simplis Command Language , 1993 .

[37]  C. McCulloch Maximum Likelihood Algorithms for Generalized Linear Mixed Models , 1997 .

[38]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[39]  Sik-Yum Lee,et al.  Constrained maximum likelihood estimation of two-level covariance structure model via EM type algorithms , 1999 .

[40]  A. Pickles,et al.  A simple method for censored age-of-onset data subject to recall bias: Mothers' reports of age of puberty in male twins , 1994, Behavior genetics.

[41]  Pickles A SkrondalA. Rabe-HeskethS GLLAMM: A general class of multilevel models and a STATA programme , 2001 .

[42]  A. Satorra,et al.  Complex Sample Data in Structural Equation Modeling , 1995 .

[43]  Seymour Sudman,et al.  Effects of Time and Memory Factors on Response in Surveys , 1973 .

[44]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[45]  M. Aitkin A General Maximum Likelihood Analysis of Variance Components in Generalized Linear Models , 1999, Biometrics.

[46]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[47]  Thompson Wa On the treatment of grouped observations in life studies. , 1977 .

[48]  J. Hobcraft,et al.  Demographic event history analysis: a selective review. , 1986, Population index.

[49]  Covariance structure analysis with three-level data , 1993 .

[50]  David Collett Modelling Survival Data in Medical Research , 1994 .

[51]  Stephen W. Raudenbush,et al.  Maximum likelihood estimation for unbalanced multilevel covariance structure models via the EM algorithm , 1995 .

[52]  L. Simar Maximum Likelihood Estimation of a Compound Poisson Process , 1976 .

[53]  Harvey Goldstein,et al.  A general model for the analysis of multilevel data , 1988 .

[54]  John B. Willett,et al.  It’s About Time: Using Discrete-Time Survival Analysis to Study Duration and the Timing of Events , 1993 .

[55]  S. Bennett,et al.  Analysis of survival data by the proportional odds model. , 1983, Statistics in medicine.

[56]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[57]  F B Hu,et al.  Random-effects regression analysis of correlated grouped-time survival data , 2000, Statistical methods in medical research.