Optimal experimental designs for multilevel logistic models

When designing experiments in multilevel populations the following questions arise: what is the optimal level of randomization, and what is the optimal allocation of units? In this paper these questions will be dealt with for populations with two levels of nesting and binary outcomes. The multilevel logistic model, which is used to describe the relationship between treatment condition and outcome, is linearized. The variance of the regression coefficient associated with treatment condition in the linearized model is used to find the optimal level of randomization and the optimal allocation of units. An analytical expression for this variance can only be obtained for the first-order marginal quasi-likelihood linearization method, which is known to be biased. A simulation study shows that penalized quasi-likelihood linearization and numerical integration of the likelihood lead to conclusions about the optimal design that are similar to those from the analytical derivations for first-order marginal quasi-likelihood.

[1]  Joop J. Hox,et al.  Applied Multilevel Analysis. , 1995 .

[2]  A. Donner,et al.  Randomization by cluster. Sample size requirements and analysis. , 1981, American journal of epidemiology.

[3]  B. Woolf ON ESTIMATING THE RELATION BETWEEN BLOOD GROUP AND DISEASE , 1955, Annals of human genetics.

[4]  Harvey Goldstein,et al.  Improved Approximations for Multilevel Models with Binary Responses , 1996 .

[5]  Allan Donner,et al.  Some aspects of the design and analysis of cluster randomization trials , 2002 .

[6]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[7]  D. Holt,et al.  Planning and Analysis of Observational Studies. , 1983 .

[8]  Roel Bosker,et al.  Multilevel analysis : an introduction to basic and advanced multilevel modeling , 1999 .

[9]  W. Shih,et al.  Sample Size and Power Calculations for Periodontal and Other Studies with Clustered Samples Using the Method of Generalized Estimating Equations , 1997 .

[10]  P. Laycock,et al.  Optimum Experimental Designs , 1995 .

[11]  K Y Liang,et al.  Sample size calculations for studies with correlated observations. , 1997, Biometrics.

[12]  D. Hedeker,et al.  Random effects probit and logistic regression models for three-level data. , 1997, Biometrics.

[13]  D. Hedeker,et al.  MIXOR: a computer program for mixed-effects ordinal regression analysis. , 1996, Computer methods and programs in biomedicine.

[14]  D. Hedeker,et al.  A random-effects ordinal regression model for multilevel analysis. , 1994, Biometrics.

[15]  S. Lipsitz,et al.  Sample Size Calculations for Non‐Randomized Studies , 1995 .

[16]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[17]  H. Goldstein Nonlinear multilevel models, with an application to discrete response data , 1991 .

[18]  Martijn P. F. Berger,et al.  Design Issues for Experiments in Multilevel Populations , 2000 .

[19]  Z Feng,et al.  Correlated binomial variates: properties of estimator of intraclass correlation and its effect on sample size calculation. , 1992, Statistics in medicine.

[20]  Anthony C. Atkinson,et al.  Optimum Experimental Designs , 1992 .

[21]  Martijn P. F. Berger,et al.  OPTIMAL EXPERIMENTAL DESIGNS FOR MULTILEVEL MODELS WITH COVARIATES , 2001 .

[22]  Y. Plétan,et al.  Rufloxacin once daily versus ciprofloxacin twice daily in the treatment of patients with acute uncomplicated pyelonephritis. , 1995, The Journal of urology.

[23]  Noreen Goldman,et al.  An assessment of estimation procedures for multilevel models with binary responses , 1995 .

[24]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[25]  N Dubin,et al.  Estimation and sample size considerations for clustered binary responses. , 1994, Statistics in medicine.

[26]  D. Bloch,et al.  A simple method of sample size calculation for linear and logistic regression. , 1998, Statistics in medicine.

[27]  H. Goldstein Multilevel mixed linear model analysis using iterative generalized least squares , 1986 .

[28]  A Donner,et al.  Adjustments to the Mantel-Haenszel chi-square statistic and odds ratio variance estimator when the data are clustered. , 1987, Statistics in medicine.

[29]  F. Hsieh,et al.  Sample size formulae for intervention studies with the cluster as unit of randomization. , 1988, Statistics in medicine.

[30]  Jan de Leeuw,et al.  Introducing Multilevel Modeling , 1998 .

[31]  H. Goldstein Restricted unbiased iterative generalized least-squares estimation , 1989 .

[32]  Donald Hedeker,et al.  Analysis of clustered data in community psychology: With an example from a worksite smoking cessation project , 1994, American journal of community psychology.

[33]  G. Kok,et al.  A Dutch social influence smoking prevention approach for vocational school students , 1994 .

[34]  A Donner,et al.  A methodological review of non-therapeutic intervention trials employing cluster randomization, 1979-1989. , 1990, International journal of epidemiology.

[35]  Roel Bosker,et al.  Standard Errors and Sample Sizes for Two-Level Research , 1993 .