Fast estimation algorithm for likelihood-based analysis of repeated categorical responses

Likelihood-based marginal regression modelling for repeated, or otherwise clustered, categorical responses is computationally demanding. This is because the number of measures needed to describe the associations within a cluster increase geometrically with increasing cluster size. The proposed estimation methods typically describe the associations using odds ratios, which result in computationally unfeasible solutions for large cluster sizes. An alternative method for joint modelling of the regression, association, and dropout mechanism for clustered categorical responses is presented. The joint distribution of a multivariate categorical response is described by utilizing the mean parameterization, which facilitates maximum likelihood estimation in two important respects. The models are illustrated by analyses of the presence and absence of schizophrenia symptoms on 86 patients at 12 repeated time-points, and a survey of opinions of 607 adults regarding government spending on nine different targets, measured on a common 3-level ordinal scale. Free software is available.

[1]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[2]  M. Haber Maximum likelihood methods for linear and log-linear models in categorical data , 1985 .

[3]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[4]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[5]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[6]  A. Agresti,et al.  Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical Responses , 1994 .

[7]  Jukka Jokinen,et al.  Combining regression and association modelling for longitudinal data on bacterial carriage. , 2002, Statistics in medicine.

[8]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[9]  Joseph B. Lang,et al.  Multinomial-Poisson homogeneous models for contingency tables , 2003 .

[10]  C. Borror Generalized Linear Models and Extensions, Second Edition , 2008 .

[11]  P W Smith,et al.  Association models for a multivariate binary response. , 2000, Biometrics.

[12]  N. Laird,et al.  A likelihood-based method for analysing longitudinal binary responses , 1993 .

[13]  Christian Heumann Marginal regression modeling of correlated multicategorical response: A likelihood approach , 1996 .

[14]  Geert Verbeke,et al.  MEANINGFUL STATISTICAL MODEL FORMULATIONS FOR REPEATED MEASURES , 2004 .

[15]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[16]  Jukka Jokinen,et al.  Meaningful Regression and Association Models for Clustered Ordinal Data , 2006 .

[17]  P. McCullagh,et al.  Multivariate Logistic Models , 1995 .

[18]  P. J. Lindsey,et al.  Multivariate distributions with correlation matrices for nonlinear repeated measurements , 2006, Comput. Stat. Data Anal..

[19]  J. Dale Global cross-ratio models for bivariate, discrete, ordered responses. , 1986, Biometrics.

[20]  A. Agresti,et al.  Modeling Clustered Ordered Categorical Data: A Survey , 2001 .

[21]  Geert Molenberghs,et al.  Models for the association between ordinal variables , 1998 .

[22]  C. Kastner,et al.  MAREG and WinMAREG A tool for marginal regression models , 1997 .

[23]  Gunnar Rosenqvist,et al.  Statistics, Econometrics and society: Essays in Honour of Leif Nordberg , 2003 .

[24]  P. Diggle Analysis of Longitudinal Data , 1995 .

[25]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[26]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[27]  Jukka Jokinen,et al.  Joint Regression and Association Modeling of Longitudinal Ordinal Data , 2003, Biometrics.

[28]  G. Molenberghs,et al.  Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution , 1994 .

[29]  Jozef L. Teugels,et al.  Algebraic Descriptions of Nominal Multivariate Discrete Data , 1998 .

[30]  R. Plackett A Class of Bivariate Distributions , 1965 .

[31]  John W. McDonald,et al.  Marginal regression analysis of a multivariate binary response , 1995 .

[32]  G. Glonek A class of regression models for multivariate categorical responses , 1996 .