Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves

BackgroundThe results of Randomized Controlled Trials (RCTs) on time-to-event outcomes that are usually reported are median time to events and Cox Hazard Ratio. These do not constitute the sufficient statistics required for meta-analysis or cost-effectiveness analysis, and their use in secondary analyses requires strong assumptions that may not have been adequately tested. In order to enhance the quality of secondary data analyses, we propose a method which derives from the published Kaplan Meier survival curves a close approximation to the original individual patient time-to-event data from which they were generated.MethodsWe develop an algorithm that maps from digitised curves back to KM data by finding numerical solutions to the inverted KM equations, using where available information on number of events and numbers at risk. The reproducibility and accuracy of survival probabilities, median survival times and hazard ratios based on reconstructed KM data was assessed by comparing published statistics (survival probabilities, medians and hazard ratios) with statistics based on repeated reconstructions by multiple observers.ResultsThe validation exercise established there was no material systematic error and that there was a high degree of reproducibility for all statistics. Accuracy was excellent for survival probabilities and medians, for hazard ratios reasonable accuracy can only be obtained if at least numbers at risk or total number of events are reported.ConclusionThe algorithm is a reliable tool for meta-analysis and cost-effectiveness analyses of RCTs reporting time-to-event data. It is recommended that all RCTs should report information on numbers at risk and total number of events alongside KM curves.

[1]  Christopher U. Jones,et al.  Radiotherapy plus cetuximab for squamous-cell carcinoma of the head and neck. , 2006, The New England journal of medicine.

[2]  H Putter,et al.  Meta‐analysis of pairs of survival curves under heterogeneity: A Poisson correlated gamma‐frailty approach , 2009, Statistics in medicine.

[3]  Tsiporah Shore,et al.  A meta‐analysis of stages I and II Hodgkin's disease , 1990, Cancer.

[4]  N. Black CONSORT , 1996, The Lancet.

[5]  D. Moher,et al.  CONSORT 2010 Explanation and Elaboration: updated guidelines for reporting parallel group randomised trials , 2010, BMJ : British Medical Journal.

[6]  Max Wolf,et al.  Rituximab maintenance improves clinical outcome of relapsed/resistant follicular non-Hodgkin lymphoma in patients both with and without rituximab during induction: results of a prospective randomized phase 3 intergroup trial. , 2006, Blood.

[7]  Timothy L. Lash,et al.  Comprar Modern Epidemiology | Timothy L. Lash | 9781451190052 | Lippincott Williams & Wilkins , 2012 .

[8]  Z. Philips,et al.  Network meta‐analysis of parametric survival curves , 2010, Research synthesis methods.

[9]  Stefan Michiels,et al.  Meta-analysis when only the median survival times are known: A comparison with individual patient data results , 2005, International Journal of Technology Assessment in Health Care.

[10]  Paula R Williamson,et al.  Aggregate data meta‐analysis with time‐to‐event outcomes , 2002, Statistics in medicine.

[11]  Gaëtan MacGrogan,et al.  Variables with time-varying effects and the Cox model: Some statistical concepts illustrated with a prognostic factor study in breast cancer , 2010, BMC medical research methodology.

[12]  Christopher Cox,et al.  The generalized F distribution: An umbrella for parametric survival analysis , 2008, Statistics in medicine.

[13]  K. Dear,et al.  Iterative generalized least squares for meta-analysis of survival data at multiple times. , 1994, Biometrics.

[14]  J L Hutton,et al.  Individual patient data meta-analysis of randomized anti-epileptic drug monotherapy trials. , 2000, Journal of evaluation in clinical practice.

[15]  K. Abrams,et al.  Assessing methods for dealing with treatment switching in randomised controlled trials: a simulation study , 2011, BMC medical research methodology.

[16]  P. Royston,et al.  Flexible parametric proportional‐hazards and proportional‐odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects , 2002, Statistics in medicine.

[17]  JEH-NAN PAN Evaluating the Gauge Repeatability and Reproducibility for Different Industries , 2006 .

[18]  J B Wong,et al.  Meta-analysis of failure-time Data with Adjustment for Covariates , 1994, Medical decision making : an international journal of the Society for Medical Decision Making.

[19]  W. D. Ray 4. Modelling Survival Data in Medical Research , 1995 .

[20]  Richard J Stephens,et al.  Different strategies of sequential and combination chemotherapy for patients with poor prognosis advanced colorectal cancer (MRC FOCUS): a randomised controlled trial , 2007, The Lancet.

[21]  George A. Wells,et al.  An Assessment of Methods to Combine Published Survival Curves , 2000, Medical decision making : an international journal of the Society for Medical Decision Making.

[22]  James B. McDonald,et al.  A generalization of the beta distribution with applications , 1995 .

[23]  N. Welton,et al.  Survival time outcomes in randomized, controlled trials and meta-analyses: the parallel universes of efficacy and cost-effectiveness. , 2011, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[24]  L. Stewart,et al.  To IPD or not to IPD? , 2002, Evaluation & the health professions.

[25]  J. Neijt,et al.  A meta-analysis of prognostic factors in advanced ovarian cancer with median survival and overall survival (measured with the log (relative risk)) as main objectives. , 1989, European journal of cancer & clinical oncology.

[26]  Robert B Livingston,et al.  Randomized trial of letrozole following tamoxifen as extended adjuvant therapy in receptor-positive breast cancer: updated findings from NCIC CTG MA.17. , 2005, Journal of the National Cancer Institute.

[27]  M. Parmar,et al.  Extracting summary statistics to perform meta-analyses of the published literature for survival endpoints. , 1998, Statistics in medicine.

[28]  D. Moher,et al.  CONSORT 2010 Explanation and Elaboration: updated guidelines for reporting parallel group randomised trials , 2010, BMJ : British Medical Journal.

[29]  Fu-Kwun Wang,et al.  Confidence intervals in repeatability and reproducibility using the Bootstrap method , 2003 .

[30]  Theo Stijnen,et al.  Interim analysis on survival data: its potential bias and how to repair it , 2005, Statistics in medicine.

[31]  A Whitehead,et al.  A general parametric approach to the meta-analysis of randomized clinical trials. , 1991, Statistics in medicine.

[32]  Theo Stijnen,et al.  Meta‐analysis of summary survival curve data , 2008, Statistics in medicine.

[33]  J. Jansen Network meta-analysis of survival data with fractional polynomials , 2011, BMC medical research methodology.