Bayesian sample size determination for binary regression with a misclassified covariate and no gold standard

Covariate misclassification is a common problem in epidemiology, genetics, and other biomedical areas. Because this form of misclassification is known to bias estimators, accounting for it at the design stage is of high importance. In this paper, we extend on previous work applied to response misclassification by developing a Bayesian approach to sample size determination for a covariate misclassification model with no gold standard. Our procedure considers both conditionally independent tests and tests in which dependence exists between classifiers. We specifically consider a Bayesian power criterion for the sample size determination scheme, and we demonstrate the improvement in model power for our dual classifier approach compared to a naive single classifier approach.

[1]  Edward J. Bedrick,et al.  Bayesian Binomial Regression: Predicting Survival at a Trauma Center , 1997 .

[2]  L. Joseph,et al.  Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. , 1995, American journal of epidemiology.

[3]  W. Blischke Moment Estimators for the Parameters of a Mixture of Two Binomial Distributions , 1960 .

[4]  Fan Zhang,et al.  Combining Internal and External Validation Data to Correct for Exposure Misclassification: A Case Study , 2007, Epidemiology.

[5]  Nandini Dendukuri,et al.  Bayesian Sample Size Determination for Prevalence and Diagnostic Test Studies in the Absence of a Gold Standard Test , 2004, Biometrics.

[6]  James D. Stamey,et al.  A Bayesian Model to Assess a Binary Measurement System When No Gold Standard System Is Available , 2011 .

[7]  P M Vacek,et al.  The effect of conditional dependence on the evaluation of diagnostic tests. , 1985, Biometrics.

[8]  Wessel N. van Wieringen,et al.  On identifiability of certain latent class models , 2005 .

[9]  John W Seaman,et al.  Bayesian sample‐size determination for inference on two binomial populations with no gold standard classifier , 2005, Statistics in medicine.

[10]  L. Joseph,et al.  Bayesian Approaches to Modeling the Conditional Dependence Between Multiple Diagnostic Tests , 2001, Biometrics.

[11]  Adam J. Branscum,et al.  Sample size calculations for studies designed to evaluate diagnostic test accuracy , 2007 .

[12]  R. Stone,et al.  A Bayesian Adjustment for Covariate Misclassification with Correlated Binary Outcome Data , 2007 .

[13]  B C Gladen,et al.  Estimating disease rates from a diagnostic test. , 1984, American journal of epidemiology.

[14]  P. Müller,et al.  Determining the Effective Sample Size of a Parametric Prior , 2008, Biometrics.

[15]  P. Holmans,et al.  Bayesian trio models for association in the presence of genotyping errors , 2004, Genetic epidemiology.

[16]  B. Craig,et al.  Estimating disease prevalence in the absence of a gold standard , 2002, Statistics in medicine.

[17]  Fei Wang,et al.  A simulation-based approach to Bayesian sample size determination for performance under a given model and for separating models , 2002 .

[18]  R. Christensen,et al.  A New Perspective on Priors for Generalized Linear Models , 1996 .

[19]  Wei Pan,et al.  Does it always help to adjust for misclassification of a binary outcome in logistic regression? , 2005, Statistics in medicine.

[20]  Pierpaolo Brutti,et al.  Robust Bayesian sample size determination in clinical trials , 2008, Statistics in medicine.

[21]  和泉 志津恵 Inference about misclassification probabilities from repeated binary responses , 2000 .

[22]  S. Walter,et al.  Estimating the error rates of diagnostic tests. , 1980, Biometrics.

[23]  N. Le,et al.  Impact of genotype misclassification on genetic association estimates and the bayesian adjustment. , 2009, American journal of epidemiology.

[24]  D. Spiegelman,et al.  Quantifying bias due to allele misclassification in case‐control studies of haplotypes , 2006, Genetic epidemiology.

[25]  Adam J Branscum,et al.  Bayesian approach to average power calculations for binary regression models with misclassified outcomes , 2009, Statistics in medicine.

[26]  P. Garthwaite,et al.  A Bayesian approach to prospective binary outcome studies with misclassification in a binary risk factor , 2005, Statistics in medicine.

[27]  Yosef Hochberg,et al.  On the Use of Double Sampling Schemes in Analyzing Categorical Data with Misclassification Errors , 1977 .