On the use of flexible excess hazard regression models for describing long-term breast cancer survival: a case-study using population-based cancer registry data

BackgroundBreast cancer prognosis has dramatically improved over 40 years. There is, however, no proof of population ‘cure’. This research aimed to examine the pattern of long-term excess mortality due to breast cancer and evaluate its determinants in the context of cancer registry data.MethodsWe used data from the Geneva Cancer Registry to identify women younger than 75 years diagnosed with invasive, localised and operated breast cancer between 1995 and 2002. Flexible modelling of excess mortality hazard, including time-dependent (TD) regression parameters, was used to estimate mortality related to breast cancer. We derived a single “final” model using a backward selection procedure and evaluated its stability through sensitivity analyses using a bootstrap technique.ResultsWe analysed data from 1574 breast cancer women including 351 deaths (22.3%). The model building strategy retained age at diagnosis (TD), tumour size and grade (TD), chemotherapy and hormonal treatment (TD) as prognostic factors, while the sensitivity analysis on bootstrap samples identified nodes involvement and hormone receptors (TD) as additional long-term prognostic factors but did not identify chemotherapy and hormonal treatment as important prognostic factors.ConclusionsTwo main issues were observed when describing the determinants of long-term survival. First, the modelling strategy presented a lack of robustness, probably due to the limited number of events observed in our study. The second was the misspecification of the model, probably due to confounding by indication. Our results highlight the need for more detailed data and the use of causal inference methods.

[1]  E. Crocetti,et al.  Net survival: comparison between relative and cause-specific survival estimates. , 2001, Epidemiologia e prevenzione.

[2]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[3]  Bernard Rachet,et al.  A multilevel excess hazard model to estimate net survival on hierarchical data allowing for non‐linear and non‐proportional effects of covariates , 2016, Statistics in medicine.

[4]  P. Austin The use of propensity score methods with survival or time-to-event outcomes: reporting measures of effect similar to those used in randomized experiments , 2013, Statistics in medicine.

[5]  Patrick Royston,et al.  Multivariable Model-Building: A Pragmatic Approach to Regression Analysis based on Fractional Polynomials for Modelling Continuous Variables , 2008 .

[6]  Bernard Rachet,et al.  Analysing population-based cancer survival – settling the controversies , 2016, BMC Cancer.

[7]  J. Estève,et al.  Relative survival and the estimation of net survival: elements for further discussion. , 1990, Statistics in medicine.

[8]  F. Berrino,et al.  Time trends of breast cancer survival in Europe in relation to incidence and mortality , 2006, International journal of cancer.

[9]  J. Coebergh,et al.  Recent trends in cancer survival across Europe between 2000 and 2004: a model-based period analysis from 12 cancer registries. , 2008, European journal of cancer.

[10]  E. Sellier,et al.  Analysis of the effect of age on the prognosis of breast cancer , 2009, Breast Cancer Research and Treatment.

[11]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[12]  P. Royston,et al.  Multivariable model-building , 2008 .

[13]  M. Coleman,et al.  'Cure' from breast cancer among two populations of women followed for 23 years after diagnosis. , 2009, Annals of oncology : official journal of the European Society for Medical Oncology.

[14]  P Peduzzi,et al.  Importance of events per independent variable in proportional hazards analysis. I. Background, goals, and general strategy. , 1995, Journal of clinical epidemiology.

[15]  B. Bodai,et al.  Breast cancer survivorship: a comprehensive review of long-term medical issues and lifestyle recommendations. , 2015, The Permanente journal.

[16]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[17]  James M Robins,et al.  Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. , 2016, American journal of epidemiology.

[18]  F. Bray,et al.  Trends in cancer survival in 11 European populations from 1990 to 2009: a model-based analysis. , 2009, Annals of oncology : official journal of the European Society for Medical Oncology.

[19]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[20]  B. Rachet,et al.  No ‘cure’ within 12 years of diagnosis among breast cancer patients who are diagnosed via mammographic screening: women diagnosed in the West Midlands region of England 1989–2011 , 2016, Annals of oncology : official journal of the European Society for Medical Oncology.

[21]  Hermann Brenner,et al.  Are patients diagnosed with breast cancer before age 50 years ever cured? , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[22]  Willy Wynant,et al.  Impact of the model‐building strategy on inference about nonlinear and time‐dependent covariate effects in survival analysis , 2014, Statistics in medicine.

[23]  D. A. Kenny,et al.  Correlation and Causation. , 1982 .

[24]  P. Mahadevan,et al.  An overview , 2007, Journal of Biosciences.

[25]  M Vaeth,et al.  Simple parametric and nonparametric models for excess and relative mortality. , 1989, Biometrics.

[26]  Yun-Jiu Cheng,et al.  Long‐Term Cardiovascular Risk After Radiotherapy in Women With Breast Cancer , 2017, Journal of the American Heart Association.

[27]  J. Coebergh,et al.  An overview of prognostic factors for long-term survivors of breast cancer , 2007, Breast Cancer Research and Treatment.

[28]  K. Rygiel Cardiotoxic effects of radiotherapy and strategies to reduce them in patients with breast cancer: An overview , 2017, Journal of cancer research and therapeutics.

[29]  N. Wolmark,et al.  Time-Varying Effects of Breast Cancer Adjuvant Systemic Therapy. , 2016, Journal of the National Cancer Institute.

[30]  Stephen R Cole,et al.  The Parametric g-Formula for Time-to-event Data: Intuition and a Worked Example , 2014, Epidemiology.

[31]  Liang Li,et al.  On the propensity score weighting analysis with survival outcome: Estimands, estimation, and inference , 2018, Statistics in medicine.

[32]  Ruth H Keogh,et al.  Estimating long‐term treatment effects in observational data: A comparison of the performance of different methods under real‐world uncertainty , 2018, Statistics in medicine.

[33]  J. Gietema,et al.  Long-term outcome of cardiac function in a population-based cohort of breast cancer survivors: A cross-sectional study. , 2017, European journal of cancer.

[34]  L. Remontet,et al.  New insights into survival trend analyses in cancer population-based studies: the SUDCAN methodology , 2017, European journal of cancer prevention : the official journal of the European Cancer Prevention Organisation.

[35]  G. Launoy,et al.  Survival of cancer patients in France: a population-based study from The Association of the French Cancer Registries (FRANCIM). , 2007, European journal of cancer.

[36]  J. Estève,et al.  An overall strategy based on regression models to estimate relative survival and model the effects of prognostic factors in cancer survival studies , 2007, Statistics in medicine.

[37]  J. Concato,et al.  Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. , 1995, Journal of clinical epidemiology.

[38]  Stijn Vansteelandt,et al.  Structural nested models and G-estimation: the partially realized promise , 2014, 1503.01589.

[39]  Jong-Hyeon Jeong,et al.  Breast cancer adjuvant therapy: time to consider its time-dependent effects. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[40]  Douglas G Altman,et al.  Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study , 2010, BMC medical research methodology.

[41]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[42]  Helena Carreira,et al.  Global surveillance of cancer survival 1995–2009: analysis of individual data for 25 676 887 patients from 279 population-based registries in 67 countries (CONCORD-2) , 2015, The Lancet.