Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data

BackgroundPrognostic studies of time-to-event data, where researchers aim to develop or validate multivariable prognostic models in order to predict survival, are commonly seen in the medical literature; however, most are performed retrospectively and few consider sample size prior to analysis. Events per variable rules are sometimes cited, but these are based on bias and coverage of confidence intervals for model terms, which are not of primary interest when developing a model to predict outcome. In this paper we aim to develop sample size recommendations for multivariable models of time-to-event data, based on their prognostic ability.MethodsWe derive formulae for determining the sample size required for multivariable prognostic models in time-to-event data, based on a measure of discrimination, D, developed by Royston and Sauerbrei. These formulae fall into two categories: either based on the significance of the value of D in a new study compared to a previous estimate, or based on the precision of the estimate of D in a new study in terms of confidence interval width. Using simulation we show that they give the desired power and type I error and are not affected by random censoring. Additionally, we conduct a literature review to collate published values of D in different disease areas.ResultsWe illustrate our methods using parameters from a published prognostic study in liver cancer. The resulting sample sizes can be large, and we suggest controlling study size by expressing the desired accuracy in the new study as a relative value as well as an absolute value. To improve usability we use the values of D obtained from the literature review to develop an equation to approximately convert the commonly reported Harrell’s c-index to D. A flow chart is provided to aid decision making when using these methods.ConclusionWe have developed a suite of sample size calculations based on the prognostic ability of a survival model, rather than the magnitude or significance of model coefficients. We have taken care to develop the practical utility of the calculations and give recommendations for their use in contemporary clinical research.

[1]  P Royston,et al.  A simulation study of predictive ability measures in a survival model II: explained randomness and predictive accuracy , 2012, Statistics in medicine.

[2]  Douglas G Altman,et al.  Prognostic Models: A Methodological Framework and Review of Models for Breast Cancer , 2009, Cancer investigation.

[3]  R. C. Jinks,et al.  Sample size for multivariable prognostic models , 2012 .

[4]  E. Steyerberg,et al.  Prognosis Research Strategy (PROGRESS) 2: Prognostic Factor Research , 2013, PLoS medicine.

[5]  John T. Kent,et al.  Measures of dependence for censored survival data , 1988 .

[6]  R Z Omar,et al.  An evaluation of penalised survival methods for developing prognostic models with rare events , 2012, Statistics in medicine.

[7]  Yvonne Vergouwe,et al.  Prognosis and prognostic research: what, why, and how? , 2009, BMJ : British Medical Journal.

[8]  Ralf Bender,et al.  Generating survival times to simulate Cox proportional hazards models , 2005, Statistics in medicine.

[9]  Patrick Royston,et al.  Reporting methods in studies developing prognostic models in cancer: a review , 2010, BMC medicine.

[10]  Patrick Royston,et al.  Explained Variation for Survival Models , 2006 .

[11]  P W Lavori,et al.  Sample-size calculations for the Cox proportional hazards regression model with nonbinary covariates. , 2000, Controlled clinical trials.

[12]  M Schumacher,et al.  Sample size considerations for the evaluation of prognostic factors in survival analysis. , 2000, Statistics in medicine.

[13]  Yvonne Vergouwe,et al.  Prognosis and prognostic research: Developing a prognostic model , 2009, BMJ : British Medical Journal.

[14]  Sample size calculations for failure time random variables in non-randomized studies , 2000 .

[15]  Yvonne Vergouwe,et al.  External validity of risk models: Use of benchmark values to disentangle a case-mix effect from incorrect coefficients. , 2010, American journal of epidemiology.

[16]  X. Paoletti,et al.  Prognosis of advanced hepatocellular carcinoma: comparison of three staging systems in two French clinical trials. , 2008, Annals of oncology : official journal of the European Society for Medical Oncology.

[17]  Patrick Royston,et al.  A simulation study of predictive ability measures in a survival model I: Explained variation measures , 2012, Statistics in medicine.

[18]  E. Steyerberg Clinical Prediction Models , 2008, Statistics for Biology and Health.

[19]  P Peduzzi,et al.  Importance of events per independent variable in proportional hazards analysis. I. Background, goals, and general strategy. , 1995, Journal of clinical epidemiology.

[20]  F. Harrell,et al.  Regression modelling strategies for improved prognostic prediction. , 1984, Statistics in medicine.

[21]  M. Gonen,et al.  Concordance probability and discriminatory power in proportional hazards regression , 2005 .

[22]  A. Raftery,et al.  Bayesian Information Criterion for Censored Survival Models , 2000, Biometrics.

[23]  Yulia V. Marchenko Estimating Variance Components in Stata , 2006 .

[24]  D. Schoenfeld,et al.  Sample-size formula for the proportional-hazards regression model. , 1983, Biometrics.

[25]  Patrick Royston,et al.  A new measure of prognostic separation in survival data , 2004, Statistics in medicine.

[26]  Wenyaw Chan,et al.  Statistical Methods in Medical Research , 2013, Model. Assist. Stat. Appl..

[27]  D. Purdie Statistical Methods in Medical Research, 4th edn , 2003 .

[28]  Yvonne Vergouwe,et al.  Substantial effective sample sizes were required for external validation studies of predictive logistic regression models. , 2005, Journal of clinical epidemiology.

[29]  W. L. McGuire,et al.  Breast cancer prognostic factors: evaluation guidelines. , 1991, Journal of the National Cancer Institute.

[30]  Charles E McCulloch,et al.  Relaxing the rule of ten events per variable in logistic and Cox regression. , 2007, American journal of epidemiology.

[31]  J. Copas Regression, Prediction and Shrinkage , 1983 .

[32]  J. Concato,et al.  Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. , 1995, Journal of clinical epidemiology.

[33]  Douglas G. Altman,et al.  Methodological challenges in the evaluation of prognostic factors in breast cancer , 2004, Breast Cancer Research and Treatment.