Conditional logistic regression with sandwich estimators: application to a meta-analysis.

Motivated by a meta-analysis of animal experiments on the effect of dietary fat and total caloric intake on mammary tumorigenesis, we explore the use of sandwich estimators of variance with conditional logistic regression. Classical conditional logistic regression assumes that the parameters are fixed effects across all clusters, while the sandwich estimator gives appropriate inferences for either fixed effects or random effects. However, inference using the standard Wald test with the sandwich estimator requires that each parameter is estimated using information from a large number of clusters. Since our example violates this condition, we introduce two modifications to the standard Wald test. First, we reduce the bias of the empirical variance estimator (the middle of the sandwich) by using standardized residuals. Second, we approximately account for the variance of these estimators by using the t-distribution instead of the normal distribution, where the degrees of freedom are estimated using Satterthwaite's approximation. Through simulations, we show that these sandwich estimators perform almost as well as classical estimators when the true effects are fixed and much better than the classical estimators when the true effects are random. We achieve simulated nominal coverage for these sandwich estimators even when some parameters are estimated from a small number of clusters.

[1]  D. Binder On the variances of asymptotically normal estimators from complex surveys , 1983 .

[2]  Nitin R. Patel,et al.  Exact logistic regression: theory and examples. , 1995, Statistics in medicine.

[3]  K. Carroll,et al.  Effects of dietary fat and dose level of 7,12-dimethylbenz(alpha)-anthracene on mammary tumor incidence in rats. , 1970, Cancer research.

[4]  R. Royall Model robust confidence intervals using maximum likelihood estimators , 1986 .

[5]  T A Louis,et al.  Random effects models with non-parametric priors. , 1992, Statistics in medicine.

[6]  William G. Cochran,et al.  Sampling Techniques, 3rd Edition , 1963 .

[7]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[8]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[9]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[10]  M. Gail,et al.  Likelihood calculations for matched case-control studies and survival studies with tied death times , 1981 .

[11]  D. Ruppert,et al.  Transformation and Weighting in Regression , 1988 .

[12]  T J O'Neill,et al.  Truncated logistic regression. , 1995, Biometrics.

[13]  M. Kelly,et al.  Effect of Caloric Restriction on Mammary-Tumor Formation in Strain C3H Mice and on the Response of Strain DBA to Painting with Methylcholanthrene , 1944 .

[14]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .

[15]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[16]  C. McCulloch Maximum Likelihood Algorithms for Generalized Linear Mixed Models , 1997 .

[17]  E. Korn,et al.  Regression analysis with clustered data. , 1994, Statistics in medicine.

[18]  J. Ware,et al.  Random-effects models for serial observations with binary response. , 1984, Biometrics.

[19]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[20]  D. R. Cox,et al.  The analysis of binary data , 1971 .

[21]  K Y Liang,et al.  Extended Mantel-Haenszel estimating procedure for multivariate logistic regression models. , 1987, Biometrics.

[22]  L. Freedman,et al.  Analysis of dietary fat, calories, body weight, and the development of mammary tumors in rats and mice: a review. , 1990, Cancer research.

[23]  R. Carroll,et al.  Variance Function Estimation , 1987 .

[24]  T C Chalmers,et al.  A comparison of statistical methods for combining event rates from clinical trials. , 1989, Statistics in medicine.

[25]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[26]  David A. Binder,et al.  Fitting Cox's proportional hazards models from survey data , 1992 .

[27]  J. Neuhaus Estimation efficiency and tests of covariate effects with clustered binary data. , 1993, Biometrics.

[28]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .