Estimating equations for association structures

This paper investigates generalized estimating equations for association parameters, which are frequently of interest in family studies, with emphasis on covariance estimation. Separate link functions are used to connect the mean, the scale, and the correlation to linear predictors involving possibly different sets of covariates, and separate estimating equations are proposed for the three sets of parameters. Simulations show that the robust 'sandwich' variance estimator and the jackknife variance estimator for the correlation parameters are generally close to the empirical variance for the sample size of 50 clusters. The results contradict Ziegler et al. and Kastner and Ziegler, where the 'sandwich' estimator obtained from the software MAREG was shown to be unsuitable for practical usage. The problem appears to arise because the MAREG variance estimator does not account for variability in estimation of the scale parameters, but may be valid with fixed scale. We also find that the formula for the approximate jackknife variance estimator in Ziegler et al. is deficient, resulting in systematic deviations from the fully iterated jackknife variance estimator. A general jackknife formula is provided and performs well in numerical studies. Data from a study on the genetics of alcoholism is used to illustrate the importance of reliable variance estimation in biomedical applications.

[1]  J. Shults,et al.  On eliminating the asymptotic bias in the quasi-least squares estimate of the correlation parameter , 1999 .

[2]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[3]  C. Kastner,et al.  The Generalised Estimating Equations: An Annotated Bibliography , 1998 .

[4]  M C Paik,et al.  Parametric variance function estimation for nonnormal repeated measurement data. , 1992, Biometrics.

[5]  S R Lipsitz,et al.  Jackknife estimators of variance for parameter estimates from estimating equations with applications to clustered survival data. , 1994, Biometrics.

[6]  A Ziegler,et al.  Familial associations of lipid profiles: a generalized estimating equations approach. , 2000, Statistics in medicine.

[7]  S. Zeger,et al.  Marginal Regression Models for Clustered Ordinal Measurements , 1996 .

[8]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[9]  Thomas A. Severini,et al.  Extended Generalized Estimating Equations for Clustered Data , 1998 .

[10]  C. Kastner,et al.  MAREG and WinMAREG A tool for marginal regression models , 1997 .

[11]  J. Rice,et al.  Genome-wide search for genes affecting the risk for alcohol dependence. , 1998, American journal of medical genetics.

[12]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[13]  Myunghee C. Paik,et al.  Repeated measurement analysis for nonnormal data in small samples , 1988 .

[14]  L. Zhao,et al.  Estimating equations for parameters in means and covariances of multivariate discrete and continuous responses. , 1991, Biometrics.

[15]  A. Ziegler Practical considerations of the jackknife estimator of variance for generalized estimating equations , 1997 .