Robust combination of multiple diagnostic tests for classifying censored event times.

Recent advancement in technology promises to yield a multitude of tests for disease diagnosis and prognosis. When there are multiple sources of information available, it is often of interest to construct a composite score that can provide better classification accuracy than any individual measurement. In this paper, we consider robust procedures for optimally combining tests when test results are measured prior to disease onset and disease status evolves over time. To account for censoring of disease onset time, the most commonly used approach to combining tests to detect subsequent disease status is to fit a proportional hazards model (Cox, 1972) and use the estimated risk score. However, simulation studies suggested that such a risk score may have poor accuracy when the proportional hazards assumption fails. We propose the use of a nonparametric transformation model (Han, 1987) as a working model to derive an optimal composite score with theoretical justification. We demonstrate that the proposed score is the optimal score when the model holds and is optimal "on average" among linear scores even if the model fails. Time-dependent sensitivity, specificity, and receiver operating characteristic curve functions are used to quantify the accuracy of the resulting composite score. We provide consistent and asymptotically Gaussian estimators of these accuracy measures. A simple model-free resampling procedure is proposed to obtain all consistent variance estimators. We illustrate the new proposals with simulation studies and an analysis of a breast cancer gene expression data set.

[1]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[2]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[3]  Anthony N. Pettitt,et al.  Proportional Odds Models for Survival Data and Estimates Using Ranks , 1984 .

[4]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[5]  Aaron K. Han Non-parametric analysis of a generalized regression model: the maximum rank correlation estimator , 1987 .

[6]  D. Pollard,et al.  $U$-Processes: Rates of Convergence , 1987 .

[7]  D. Pollard Empirical Processes: Theory and Applications , 1990 .

[8]  Zhiliang Ying,et al.  Linear regression analysis of censored survival data based on rank tests , 1990 .

[9]  R. Sherman The Limiting Distribution of the Maximum Rank Correlation Estimator , 1993 .

[10]  Z. Ying,et al.  A resampling method based on pivotal estimating functions , 1994 .

[11]  Z. Ying,et al.  Analysis of transformation models with censored data , 1995 .

[12]  Daniel B. Mark,et al.  TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[13]  J Stare,et al.  Explained variation in survival analysis. , 1996, Statistics in medicine.

[14]  F. Harrell,et al.  Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors , 2005 .

[15]  J. Horowitz,et al.  Semiparametric Estimation of a Censored Regression Model with an Unknown Transformation of the Dependent Variable , 1999 .

[16]  Susan A. Murphy,et al.  Maximum Likelihood Estimation in the Proportional Odds Model , 1997 .

[17]  J. Robins,et al.  Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. , 1997, Statistics in medicine.

[18]  D. Levy,et al.  Prediction of coronary heart disease using risk factor categories. , 1998, Circulation.

[19]  Anastasios A. Tsiatis,et al.  Semiparametric Efficient Estimation in the Generalized Odds-Rate Class of Regression Models for Right-Censored Time-to-Event Data , 1998, Lifetime data analysis.

[20]  R. Sherman,et al.  Rank estimators for monotonic index models , 1998 .

[21]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[22]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[23]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[24]  Zhiliang Ying,et al.  Semiparametric Transformation Models for Point Processes , 2001 .

[25]  Zhiliang Ying,et al.  Semiparametric analysis of transformation models with censored data , 2002 .

[26]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[27]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[28]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[29]  Margaret Sullivan Pepe,et al.  Combining Several Screening Tests: Optimality of the Risk Score , 2002, Biometrics.

[30]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[31]  Songnian Chen,et al.  RANK ESTIMATION OF TRANSFORMATION MODELS , 2002 .

[32]  Z. Ying,et al.  Rank-based inference for the accelerated failure time model , 2003 .

[33]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[34]  Jason Abrevaya Pairwise-Difference Rank Estimation of the Transformation Model , 2003 .

[35]  Laurence L. George,et al.  The Statistical Analysis of Failure Time Data , 2003, Technometrics.

[36]  Werner Dubitzky,et al.  A Practical Approach to Microarray Data Analysis , 2003, Springer US.

[37]  Luis Mateus Rocha,et al.  Singular value decomposition and principal component analysis , 2003 .

[38]  M. Pepe,et al.  Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker. , 2004, American journal of epidemiology.

[39]  M. Pencina,et al.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation , 2004, Statistics in medicine.

[40]  Chaya S Moskowitz,et al.  Quantifying and comparing the accuracy of binary biomarkers when predicting a failure time outcome. , 2004, Statistics in medicine.

[41]  P. Heagerty,et al.  Survival Model Predictive Accuracy and ROC Curves , 2005, Biometrics.

[42]  Margaret Sullivan Pepe,et al.  The sensitivity and specificity of markers for event times. , 2005, Biostatistics.

[43]  T. Cai,et al.  Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve , 2006, Biometrics.

[44]  Tianxi Cai,et al.  Application of the Time‐Dependent ROC Curves for Prognostic Accuracy with Multiple Biomarkers , 2006, Biometrics.

[45]  Elie Tamer,et al.  Partial rank estimation of duration models with general forms of censoring , 2007 .