Adaptive Validation Design

Supplemental Digital Content is available in the text. An internal validation substudy compares an imperfect measurement of a variable with a gold-standard measurement in a subset of the study population. Validation data permit calculation of a bias-adjusted estimate, which has the same expected value as the association that would have been observed had the gold-standard measurement been available for the entire study population. Existing guidance on optimal sampling for validation substudies assumes complete enrollment and follow-up of the target cohort. No guidance exists for validation substudy design while cohort data are actively being collected. In this article, we use the framework of Bayesian monitoring methods to develop an adaptive approach to validation study design. This method monitors whether sufficient validation data have been collected to meet predefined criteria for estimation of the positive and negative predictive values. We demonstrate the utility of this method using the Study of Transition, Outcomes and Gender—a cohort study of transgender and gender nonconforming people. We demonstrate the method’s ability to determine efficacy (when sufficient validation data have accumulated to obtain estimates of the predictive values that fall above a threshold value) and futility (when sufficient validation data have accumulated to conclude the mismeasured variable is an untenable substitute for the gold-standard measurement). This proposed method can be applied within the context of any parent epidemiologic study design and modified to meet alternative criteria given specific study or validation study objectives. Our method provides a novel approach to effective and efficient estimation of classification parameters as validation data accrue.

[1]  P M Fayers,et al.  Tutorial in biostatistics Bayesian data monitoring in clinical trials. , 1997, Statistics in medicine.

[2]  D Spiegelman,et al.  Design of Validation Studies for Estimating the Odds Ratio of Exposure–Disease Relationships When Exposure Is Misclassified , 1999, Biometrics.

[3]  A. Olshan,et al.  EPIDEMIOLOGY Announces the "Validation Study" Submission Category. , 2016, Epidemiology.

[4]  T R Holford,et al.  Study design for epidemiologic studies with measurement error , 1995, Statistical methods in medical research.

[5]  Roger Logan,et al.  Estimation and Inference for Logistic Regression with Covariate Misclassification and Measurement Error in Main Study/Validation Study Designs , 2000 .

[6]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[7]  F. Collins,et al.  Policy: NIH plans to enhance reproducibility , 2014, Nature.

[8]  Rebecca Nash,et al.  Cohort profile: Study of Transition, Outcomes and Gender (STRONG) to assess health status of transgender people , 2017, BMJ Open.

[9]  Timothy L. Lash,et al.  Applying Quantitative Bias Analysis to Epidemiologic Data , 2009, Statistics for Biology and Health.

[10]  Liam Smeeth,et al.  Helping everyone do better: a call for validation studies of routinely recorded health data , 2016, Clinical epidemiology.

[11]  D Spiegelman,et al.  Cost-efficient study designs for binary response data with Gaussian covariate measurement error. , 1991, Biometrics.

[12]  T. Ahern,et al.  Validity of Danish Breast Cancer Group (DBCG) registry data used in the predictors of breast cancer recurrence (ProBeCaRe) premenopausal breast cancer cohort study , 2017, Acta oncologica.

[13]  Sander Greenland,et al.  Bias Analysis , 2011, International Encyclopedia of Statistical Science.

[14]  S Greenland,et al.  Variance estimation for epidemiologic effect estimates under misclassification. , 1988, Statistics in medicine.