Comparison of approaches to estimate confidence intervals of post-test probabilities of diagnostic test results in a nested case-control study

BackgroundNested case–control studies become increasingly popular as they can be very efficient for quantifying the diagnostic accuracy of costly or invasive tests or (bio)markers. However, they do not allow for direct estimation of the test’s predictive values or post-test probabilities, let alone for their confidence intervals (CIs). Correct estimates of the predictive values itself can easily be obtained using a simple correction by the (inverse) sampling fractions of the cases and controls. But using this correction to estimate the corresponding standard error (SE), falsely increases the number of patients that are actually studied, yielding too small CIs. We compared different approaches for estimating the SE and thus CI of predictive values or post-test probabilities of diagnostic test results in a nested case–control study.MethodsWe created datasets based on a large, previously published diagnostic study on 2 different tests (D-dimer test and calf difference test) with a nested case–control design. We compared six different approaches; the approaches were: 1. the standard formula for the SE of a proportion, 2. adaptation of the standard formula with the sampling fraction, 3. A bootstrap procedure, 4. A approach, which uses the sensitivity, the specificity and the prevalence, 5. Weighted logistic regression, and 6. Approach 4 on the log odds scale. The approaches were compared with respect to coverage of the CI and CI-width.ResultsThe bootstrap procedure (approach 3) showed good coverage and relatively small CI widths. Approaches 4 and 6 showed some undercoverage, particularly for the D-dimer test with frequent positive results (positive results around 70%). Approaches 1, 2 and 5 showed clear overcoverage at low prevalences of 0.05 and 0.1 in the cohorts for all case–control ratios.ConclusionThe results from our study suggest that a bootstrap procedure is necessary to assess the confidence interval for the predictive values or post-test probabilities of diagnostic tests results in studies using a nested case–control design.

[1]  Samy Suissa,et al.  The nested case-control study in cardiology. , 2003, American heart journal.

[2]  Patrick Royston,et al.  The design of simulation studies in medical statistics , 2006, Statistics in medicine.

[3]  Diederick E Grobbee,et al.  Test research versus diagnostic research. , 2004, Clinical chemistry.

[4]  Yvonne Vergouwe,et al.  Bmc Medical Research Methodology Open Access Advantages of the Nested Case-control Design in Diagnostic Research , 2022 .

[5]  N. Mantel Synthetic retrospective studies and related topics. , 1973, Biometrics.

[6]  Johannes B Reitsma,et al.  Case-control and two-gate designs in diagnostic accuracy studies. , 2005, Clinical chemistry.

[7]  W. Ageno,et al.  The Wells rule was not useful in ruling out deep venous thrombosis in a primary care setting. , 2006, Evidence-based medicine.

[8]  Xiao-Hua Zhou,et al.  Confidence intervals for predictive values with an emphasis to case–control studies , 2007, Statistics in medicine.

[9]  V. Ernster,et al.  Nested case-control studies. , 1994, Preventive medicine.

[10]  Karel Moons,et al.  The Wells Rule Does Not Adequately Rule Out Deep Venous Thrombosis in Primary Care Patients , 2005, Annals of Internal Medicine.

[11]  Christian Gluud,et al.  Evidence based diagnostics , 2005, BMJ : British Medical Journal.

[12]  Dennis G. Fryback,et al.  The Efficacy of Diagnostic Imaging , 1991, Medical decision making : an international journal of the Society for Medical Decision Making.

[13]  Holly Janes,et al.  Pivotal Evaluation of the Accuracy of a Biomarker Used for Classification or Prediction: Standards for Study Design , 2008, Journal of the National Cancer Institute.

[14]  A. Hoes,et al.  Limited value of patient history and physical examination in diagnosing deep vein thrombosis in primary care. , 2004, Family practice.

[15]  A. Hoes,et al.  Different cut-off values for two D-dimer assays to exclude deep venous thrombosis in primary care , 2006, Thrombosis and Haemostasis.

[16]  Sudhir Srivastava,et al.  Markers for early detection of cancer: Statistical guidelines for nested case-control studies , 2002, BMC medical research methodology.

[17]  A. Dixon,et al.  Measuring the effects of imaging: an evaluative framework. , 1995, Clinical radiology.

[18]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .