Generalized Estimating Equations for Ordinal Categorical Data: Arbitrary Patterns of Missing Responses and Missingness in a Key Covariate

We propose methods for regression analysis of repeatedly measured ordinal categorical data when there is nonmonotone missingness in these responses and when a key covariate is missing depending on observables. The methods use ordinal regression models in conjunction with generalized estimating equations (GEEs). We extend the GEE methodology to accommodate arbitrary patterns of missingness in the responses when this missingness is independent of the unobserved responses. We further extend the methodology to provide correction for possible bias when missingness in knowledge of a key covariate may depend on observables. The approach is illustrated with the analysis of data from a study in diagnostic oncology in which multiple correlated receiver operating characteristic curves are estimated and corrected for possible verification bias when the true disease status is missing depending on observables.

[1]  X H Zhou,et al.  A nonparametric maximum likelihood estimator for the receiver operating characteristic curve area in the presence of verification bias. , 1996, Biometrics.

[2]  John Uebersax,et al.  Statistical Modeling of Expert Ratings on Medical Treatment Appropriateness , 1993 .

[3]  C B Begg,et al.  A General Regression Methodology for ROC Curve Estimation , 1988, Medical decision making : an international journal of the Society for Medical Decision Making.

[4]  R A Greenes,et al.  Construction of Receiver Operating Characteristic Curves when Disease Verification Is Subject to Selection Bias , 1984, Medical decision making : an international journal of the Society for Medical Decision Making.

[5]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[6]  C A Gatsonis,et al.  Regression analysis of correlated receiver operating characteristic data. , 1995, Academic radiology.

[7]  D. Pierce The Asymptotic Effect of Substituting Estimators for Parameters in Certain Types of Statistics , 1982 .

[8]  J. Swets,et al.  Assessment of diagnostic technologies. , 1979, Science.

[9]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[10]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[11]  D. Dorfman,et al.  Maximum-likelihood estimation of parameters of signal-detection theory and determination of confidence intervals—Rating-method data , 1969 .

[12]  R A Greenes,et al.  Assessment of diagnostic tests when disease verification is subject to selection bias. , 1983, Biometrics.

[13]  A. Toledano,et al.  Ordinal regression methodology for ROC curves derived from correlated data. , 1996, Statistics in medicine.

[14]  S. Lipsitz,et al.  Analysis of repeated categorical data using generalized estimating equations. , 1994, Statistics in medicine.

[15]  J. R. Landis,et al.  The analysis of longitudinal polytomous data: generalized estimating equations and connections with weighted least squares. , 1993, Biometrics.

[16]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[17]  X H Zhou,et al.  Effect of verification bias on positive and negative predictive values. , 1994, Statistics in medicine.

[18]  R A Greenes,et al.  Assessment of Diagnostic Technologies: Methodology for Unbiased Estimation from Samples of Selectively Verified Patients , 1985, Investigative radiology.

[19]  B J McNeil,et al.  CT and MR imaging in staging non-small cell bronchogenic carcinoma: report of the Radiologic Diagnostic Oncology Group. , 1991, Radiology.

[20]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .