Comparing two correlated C indices with right‐censored survival outcome: a one‐shot nonparametric approach

The area under the receiver operating characteristic curve is often used as a summary index of the diagnostic ability in evaluating biomarkers when the clinical outcome (truth) is binary. When the clinical outcome is right-censored survival time, the C index, motivated as an extension of area under the receiver operating characteristic curve, has been proposed by Harrell as a measure of concordance between a predictive biomarker and the right-censored survival outcome. In this work, we investigate methods for statistical comparison of two diagnostic or predictive systems, of which they could either be two biomarkers or two fixed algorithms, in terms of their C indices. We adopt a U-statistics-based C estimator that is asymptotically normal and develop a nonparametric analytical approach to estimate the variance of the C estimator and the covariance of two C estimators. A z-score test is then constructed to compare the two C indices. We validate our one-shot nonparametric method via simulation studies in terms of the type I error rate and power. We also compare our one-shot method with resampling methods including the jackknife and the bootstrap. Simulation results show that the proposed one-shot method provides almost unbiased variance estimations and has satisfactory type I error control and power. Finally, we illustrate the use of the proposed method with an example from the Framingham Heart Study.

[1]  Margaret Sullivan Pepe,et al.  The sensitivity and specificity of markers for event times. , 2005, Biostatistics.

[2]  K. Zou,et al.  Smooth non-parametric receiver operating characteristic (ROC) curves for continuous diagnostic tests. , 1997, Statistics in medicine.

[3]  N. T. Smith,et al.  A measure of association for assessing prediction accuracy that is a generalization of non-parametric ROC area. , 1996, Statistics in medicine.

[4]  F. Harrell,et al.  Regression modelling strategies for improved prognostic prediction. , 1984, Statistics in medicine.

[5]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[6]  Michael J Pencina,et al.  Quantifying discrimination of Framingham risk functions with different survival C statistics , 2012, Statistics in medicine.

[7]  D. Dorfman,et al.  Maximum-likelihood estimation of parameters of signal-detection theory and determination of confidence intervals—Rating-method data , 1969 .

[8]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[9]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[10]  W. R. Buckland Elements of Nonparametric Statistics , 1967 .

[11]  P. Heagerty,et al.  Survival Model Predictive Accuracy and ROC Curves , 2005, Biometrics.

[12]  M. Pencina,et al.  On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival data , 2011, Statistics in medicine.

[13]  C. Begg,et al.  Comparing tumour staging and grading systems: a case study and a review of the issues, using thymoma as a model. , 2000, Statistics in medicine.

[14]  T. Dawber,et al.  Epidemiological approaches to heart disease: the Framingham Study. , 1951, American journal of public health and the nation's health.

[15]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[16]  M. Gonen,et al.  Concordance probability and discriminatory power in proportional hazards regression , 2005 .

[17]  D. Levy,et al.  Genome-Wide Scan for Pulse Pressure in the National Heart, Lung and Blood Institute’s Framingham Heart Study , 2004, Hypertension.

[18]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[19]  R. Larsen,et al.  An introduction to mathematical statistics and its applications (2nd edition) , by R. J. Larsen and M. L. Marx. Pp 630. £17·95. 1987. ISBN 13-487166-9 (Prentice-Hall) , 1987, The Mathematical Gazette.

[20]  Berkman Sahiner,et al.  On the assessment of the added value of new predictive biomarkers , 2013, BMC Medical Research Methodology.

[21]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[22]  N. Cliff,et al.  Variances and Covariances of Kendall's Tau and Their Estimation. , 1991, Multivariate behavioral research.

[23]  J. Neaton,et al.  Serum cholesterol, blood pressure, cigarette smoking, and death from coronary heart disease. Overall findings and differences by age for 316,099 white men. Multiple Risk Factor Intervention Trial Research Group. , 1992, Archives of internal medicine.

[24]  Byung-Ho Nam,et al.  Discrimination Index, the Area Under the ROC Curve , 2002 .

[25]  W. Castelli,et al.  Epidemiology of coronary heart disease: the Framingham study. , 1984, The American journal of medicine.

[26]  R Henderson,et al.  Problems and prediction in survival-data analysis. , 1995, Statistics in medicine.

[27]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[28]  Enrique F Schisterman,et al.  Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection , 2008, Biometrics.

[29]  Nancy A Obuchowski,et al.  An ROC‐type measure of diagnostic accuracy when the gold standard is continuous‐scale , 2006, Statistics in medicine.

[30]  J. Koziol,et al.  The Concordance Index C and the Mann–Whitney Parameter Pr(X>Y) with Randomly Censored Data , 2009, Biometrical journal. Biometrische Zeitschrift.

[31]  C. Begg,et al.  One statistical test is sufficient for assessing new predictive markers , 2011, BMC Medical Research Methodology.

[32]  Laura Antolini,et al.  Inference on Correlated Discrimination Measures in Survival Analysis: A Nonparametric Approach , 2004 .

[33]  Pranab Kumar Sen,et al.  On Some Convergence Properties of UStatistics , 1960 .

[34]  C. Metz,et al.  A New Approach for Testing the Significance of Differences Between ROC Curves Measured from Correlated Data , 1984 .

[35]  Jae-On Kim,et al.  Predictive Measures of Ordinal Association , 1971, American Journal of Sociology.

[36]  K. Zou,et al.  Receiver-Operating Characteristic Analysis for Evaluating Diagnostic Tests and Predictive Models , 2007, Circulation.

[37]  John A. Swets,et al.  Evaluation of diagnostic systems : methods from signal detection theory , 1982 .

[38]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[39]  James D. Neaton,et al.  Serum Cholesterol, Blood Pressure, Cigarette Smoking, and Death From Coronary Heart Disease Overall Findings and Differences by Age for 316099 White Men , 1992 .

[40]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[41]  D. Levy,et al.  The epidemiology of heart failure: the Framingham Study. , 1993, Journal of the American College of Cardiology.

[42]  R Simon,et al.  Measures of explained variation for survival data. , 1990, Statistics in medicine.

[43]  J. Neaton,et al.  Blood pressure, systolic and diastolic, and cardiovascular risks. US population data. , 1993, Archives of internal medicine.

[44]  A. Belanger,et al.  The Framingham study. , 1976, British medical journal.

[45]  J Stare,et al.  Explained variation in survival analysis. , 1996, Statistics in medicine.

[46]  Ralph B D'Agostino,et al.  Misuse of DeLong test to compare AUCs for nested models , 2012, Statistics in medicine.

[47]  M. Greiner,et al.  Principles and practical application of the receiver-operating characteristic analysis for diagnostic tests. , 2000, Preventive veterinary medicine.

[48]  M. Pencina,et al.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation , 2004, Statistics in medicine.

[49]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[50]  M C Weinstein,et al.  Performance of screening and diagnostic tests. Application of receiver operating characteristic analysis. , 1987, Archives of general psychiatry.

[51]  Ronald M. Krauss,et al.  American Heart Association Call to Action: Obesity as a Major Risk Factor for Coronary Heart Disease , 1998 .

[52]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[53]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[54]  W. Kannel,et al.  Systolic versus diastolic blood pressure and risk of coronary heart disease. The Framingham study. , 1971, The American journal of cardiology.

[55]  Guoqing Diao,et al.  Estimation of time‐dependent area under the ROC curve for long‐term risk prediction , 2006, Statistics in medicine.

[56]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[57]  Related Topics,et al.  Survival analysis : state of the art , 1992 .

[58]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .