A Cautionary Note on the Robustness of Latent Class Models for Estimating Diagnostic Error without a Gold Standard

Summary.  Modeling diagnostic error without a gold standard has been an active area of biostatistical research. In a majority of the approaches, model‐based estimates of sensitivity, specificity, and prevalence are derived from a latent class model in which the latent variable represents an individual's true unobserved disease status. For simplicity, initial approaches assumed that the diagnostic test results on the same subject were independent given the true disease status (i.e., the conditional independence assumption). More recently, various authors have proposed approaches for modeling the dependence structure between test results given true disease status. This note discusses a potential problem with these approaches. Namely, we show that when the conditional dependence between tests is misspecified, estimators of sensitivity, specificity, and prevalence can be biased. Importantly, we demonstrate that with small numbers of tests, likelihood comparisons and other model diagnostics may not be able to distinguish between models with different dependence structures. We present asymptotic results that show the generality of the problem. Further, data analysis and simulations demonstrate the practical implications of model misspecification. Finally, we present some guidelines about the use of these models for practitioners.

[1]  S. Walter,et al.  Estimating the error rates of diagnostic tests. , 1980, Biometrics.

[2]  P M Vacek,et al.  The effect of conditional dependence on the evaluation of diagnostic tests. , 1985, Biometrics.

[3]  S. Handelman,et al.  Clinical radiographic evaluation of sealed carious and sound tooth surfaces. , 1986, Journal of the American Dental Association.

[4]  M. Espeland,et al.  Using latent class models to characterize and assess relative error in discrete measurements. , 1989, Biometrics.

[5]  C B Begg,et al.  Consensus Diagnoses and "Gold Standards" , 1990, Medical decision making : an international journal of the Society for Medical Decision Making.

[6]  M. Tan,et al.  Random effects models in latent class analysis for evaluating accuracy of diagnostic tests. , 1996, Biometrics.

[7]  I Yang,et al.  Latent variable modeling of diagnostic accuracy. , 1997, Biometrics.

[8]  S D Walter,et al.  Effects of dependent errors in the assessment of diagnostic test performance. , 1997, Statistics in medicine.

[9]  S. Hui,et al.  Evaluation of diagnostic tests without gold standards , 1998, Statistical methods in medical research.

[10]  M S Pepe,et al.  Using a combination of reference tests to assess the accuracy of a new diagnostic test. , 1999, Statistics in medicine.

[11]  L. Joseph,et al.  Bayesian Approaches to Modeling the Conditional Dependence Between Multiple Diagnostic Tests , 2001, Biometrics.

[12]  W Miller,et al.  Using a combination of reference tests to assess the accuracy of a diagnostic test by A. Alonzo and M. Pepe, Statistics in Medicine 1999; 18: 2987–3003 , 2001, Statistics in medicine.

[13]  P. Heagerty,et al.  Misspecified maximum likelihood estimates and generalised linear mixed models , 2001 .

[14]  P S Albert,et al.  Latent Class Modeling Approaches for Assessing Diagnostic Error without a Gold Standard: With Applications to p53 Immunohistochemical Assays in Bladder Tumors , 2001, Biometrics.

[15]  B. Craig,et al.  Estimating disease prevalence in the absence of a gold standard , 2002, Statistics in medicine.

[16]  A. Hadgu,et al.  A biomedical application of latent class models with random effects , 2002 .