The analysis longitudinal binary data.

Longitudinal data modelling is complicated by the necessity to deal appropriately with the correlation between observations made on the same individual. A thorough examination of popular approaches to longitudinal analysis establishes the essential features of an effective longitudinal model. Building upon an earlier non-robust version proposed by Heagerty [20], our robust marginally specified generalized linear mixed model (ROBMS-GLMM) is successful in exhibiting such features. This type of model is one of the first to allow both population-averaged and individual specific inference. As well, this type of model adopts the flexibility and interpretability of generalized linear models for introducing dependence, but builds regression structure for the marginal mean, allowing valid application with time-independent and time-dependent covariates. These new estimators are obtained as solutions of a robustified likelihood equation involving Huber's least favorable distribution and a collection of weights. Huber's least favorable distribution produces estimates which are resistant to deviations from the random effects distributional assumptions. Innovative weighting strategies enable the ROBMS-GLMM to perform well when faced with outlying observations both in the response and covariates. A simulation study allows us to investigate the sampling properties of the ROBMS-GLMM estimates. We illustrate the methodology with an analysis of a prospective longitudinal study of laryngoscopic endotracheal intubation, a skill which numerous health care professionals are expected to acquire. We also look at data collected on pregnancies and births in Nova Scotia with interest in the smoking habits of the expectant mothers. Psychiatric data concerning an anti-depression drug is also used for demonstrative purposes. The principal goal of our research is to achieve robust inference in longitudinal analyses. Robust model testing strategies and asymptotics properties of the ROBMS-GLMMs are also of interest. A concurrent goal is to investigate and potentially alleviate some of the difficulties with current model fitting software.

[1]  S. P. Pederson,et al.  On Robustness in the Logistic Regression Model , 1993 .

[2]  E. Ronchetti,et al.  Robust Estimators for Simultaneous Equations Models , 1997 .

[3]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[4]  N. Laird,et al.  A likelihood-based method for analysing longitudinal binary responses , 1993 .

[5]  Ke-Hai Yuan,et al.  Asymptotics of Estimating Equations under Natural Conditions , 1998 .

[6]  J. Hebel,et al.  Dose-response of birth weight to various measures of maternal smoking during pregnancy. , 1988, Journal of clinical epidemiology.

[7]  Elvezio Ronchetti,et al.  Robustness Aspects of Model Choice , 1997 .

[8]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[9]  J. Kalbfleisch,et al.  A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary Data , 1991 .

[10]  P. McCullagh Quasi-Likelihood Functions , 1983 .

[11]  P. Heagerty,et al.  Misspecified maximum likelihood estimates and generalised linear mixed models , 2001 .

[12]  S. Morgenthaler Least-Absolute-Deviations Fits for Generalized Linear Models , 1992 .

[13]  L. Dodds,et al.  Prevalence of smoking among pregnant women in Nova Scotia from 1988 to 1992. , 1995, CMAJ : Canadian Medical Association journal = journal de l'Association medicale canadienne.

[14]  R. Huggins,et al.  A Robust Approach to the Analysis of Repeated Measures , 1993 .

[15]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[16]  P. Heagerty Marginally Specified Logistic‐Normal Models for Longitudinal Binary Data , 1999, Biometrics.

[17]  Fred T. Krogh,et al.  Algorithm 699: a new representation of Patterson's quadrature formulae , 1991, TOMS.

[18]  A. Welsh,et al.  Robust Restricted Maximum Likelihood in Mixed Linear Models , 1995 .

[19]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[20]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[21]  J S Preisser,et al.  Robust Regression for Clustered Data with Application to Binary Responses , 1999, Biometrics.

[22]  T. Patterson,et al.  The optimum addition of points to quadrature formulae. , 1968 .

[23]  Roy E. Welsch,et al.  Handouts for the Instructional Meeting on "Robust Statistical Methods" , 1982 .

[24]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[25]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[26]  A. Agresti,et al.  Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical Responses , 1994 .

[27]  A. Azzalini Logistic regression for autocorrelated data with application to repeated measures , 1994 .

[28]  Scott L. Zeger,et al.  Marginalized Multilevel Models and Likelihood Inference , 2000 .

[29]  Scott L. Zeger,et al.  Generalized linear models with random e ects: a Gibbs sampling approach , 1991 .

[30]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[31]  J. Kalbfleisch,et al.  The effects of mixture distribution misspecification when fitting mixed-effects logistic models , 1992 .

[32]  J. Kalbfleisch,et al.  Between- and within-cluster covariate effects in the analysis of clustered data. , 1998, Biometrics.

[33]  C. McCulloch Maximum Likelihood Algorithms for Generalized Linear Mixed Models , 1997 .

[34]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[35]  Stuart R. Lipsitz,et al.  Review of Software to Fit Generalized Estimating Equation Regression Models , 1999 .

[36]  J. Dale Global cross-ratio models for bivariate, discrete, ordered responses. , 1986, Biometrics.

[37]  J. Booth,et al.  Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm , 1999 .

[38]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[39]  F. Hampel The Influence Curve and Its Role in Robust Estimation , 1974 .

[40]  David R. Cox,et al.  Role of Models in Statistical Analysis , 1990 .

[41]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .