Flexible and structured survival model for a simultaneous estimation of non-linear and non-proportional effects and complex interactions between continuous variables: Performance of this multidimensional penalized spline approach in net survival trend analysis

Cancer survival trend analyses are essential to describe accurately the way medical practices impact patients’ survival according to the year of diagnosis. To this end, survival models should be able to account simultaneously for non-linear and non-proportional effects and for complex interactions between continuous variables. However, in the statistical literature, there is no consensus yet on how to build such models that should be flexible but still provide smooth estimates of survival. In this article, we tackle this challenge by smoothing the complex hypersurface (time since diagnosis, age at diagnosis, year of diagnosis, and mortality hazard) using a multidimensional penalized spline built from the tensor product of the marginal bases of time, age, and year. Considering this penalized survival model as a Poisson model, we assess the performance of this approach in estimating the net survival with a comprehensive simulation study that reflects simple and complex realistic survival trends. The bias was generally small and the root mean squared error was good and often similar to that of the true model that generated the data. This parametric approach offers many advantages and interesting prospects (such as forecasting) that make it an attractive and efficient tool for survival trend analyses.

[1]  A. F. Militino,et al.  ON PREDICTING CANCER MORTALITY USING ANOVA-TYPE P-SPLINE MODELS , 2015 .

[2]  B. Marx,et al.  Multivariate calibration with temperature interaction using two-dimensional penalized signal regression , 2003 .

[3]  Willy Wynant,et al.  Impact of the model‐building strategy on inference about nonlinear and time‐dependent covariate effects in survival analysis , 2014, Statistics in medicine.

[4]  Paul H. C. Eilers,et al.  Multidimensional Penalized Signal Regression , 2005, Technometrics.

[5]  M. D. Ugarte,et al.  Projections of cancer mortality risks using spatio-temporal P-spline models , 2012, Statistical methods in medical research.

[6]  Carmen Cadarso-Suárez,et al.  Model building in nonproportional hazard regression , 2013, Statistics in medicine.

[7]  Giampiero Marra,et al.  Penalised regression splines: theory and application to medical research , 2010, Statistical methods in medical research.

[8]  Paul H. C. Eilers,et al.  Smoothing and forecasting mortality rates , 2004 .

[9]  L. Remontet,et al.  Probabilities of dying from cancer and other causes in French cancer patients based on an unbiased estimator of net survival: a study of five common cancers. , 2013, Cancer epidemiology.

[10]  C. J. Stone,et al.  Hazard Regression , 2022 .

[11]  Michael J Crowther,et al.  Simulating biologically plausible complex survival data , 2013, Statistics in medicine.

[12]  David Ruppert,et al.  Semiparametric regression during 2003-2007. , 2009, Electronic journal of statistics.

[13]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[14]  Douglas Nychka,et al.  Bayesian Confidence Intervals for Smoothing Splines , 1988 .

[15]  J. Estève,et al.  An overall strategy based on regression models to estimate relative survival and model the effects of prognostic factors in cancer survival studies , 2007, Statistics in medicine.

[16]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[17]  A. Bouvier,et al.  Survie des personnes atteintes de cancers solides en France métropolitaine, 1989–2013 , 2016 .

[18]  Virgilio Gómez-Rubio,et al.  Generalized Additive Models: An Introduction with R (2nd Edition) , 2018 .

[19]  Willi Sauerbrei,et al.  Comparison of procedures to assess non‐linear and time‐varying effects in multivariable models for survival data , 2011, Biometrical journal. Biometrische Zeitschrift.

[20]  M. Coleman,et al.  Cancer survival in Europe 1999-2007 by country and age: results of EUROCARE--5-a population-based study. , 2014, The Lancet. Oncology.

[21]  Bernard Rachet,et al.  A multilevel excess hazard model to estimate net survival on hierarchical data allowing for non‐linear and non‐proportional effects of covariates , 2016, Statistics in medicine.

[22]  P. Royston,et al.  A New Proposal for Multivariable Modelling of Time‐Varying Effects in Survival Data Based on Fractional Polynomial Time‐Transformation , 2007, Biometrical journal. Biometrische Zeitschrift.

[23]  Mike Quinn,et al.  Standard cancer patient population for age standardising survival ratios. , 2004, European journal of cancer.

[24]  G. Launoy,et al.  Survival of cancer patients in France: a population-based study from The Association of the French Cancer Registries (FRANCIM). , 2007, European journal of cancer.

[25]  H. Becher,et al.  Using Penalized Splines to Model Age‐ and Season‐of‐Birth‐Dependent Effects of Childhood Mortality Risk Factors in Rural Burkina Faso , 2009, Biometrical journal. Biometrische Zeitschrift.

[26]  Laurent Remontet,et al.  Estimating net survival: the importance of allowing for informative censoring , 2012, Statistics in medicine.

[27]  Mats Lambe,et al.  Estimating the loss in expectation of life due to cancer using flexible parametric survival models , 2013, Statistics in medicine.

[28]  M. Wand,et al.  Semiparametric Regression: Parametric Regression , 2003 .

[29]  Frank E. Harrell,et al.  The restricted cubic spline hazard model , 1990 .

[30]  Giampiero Marra,et al.  Practical variable selection for generalized additive models , 2011, Comput. Stat. Data Anal..

[31]  S. Wood,et al.  GAMs with integrated model selection using penalized regression splines and applications to environmental modelling , 2002 .

[32]  R. De Angelis,et al.  Changes in dynamics of excess mortality rates and net survival after diagnosis of follicular lymphoma or diffuse large B-cell lymphoma: comparison between European population-based data (EUROCARE-5). , 2015, The Lancet. Haematology.

[33]  Paul W Dickman,et al.  Regression models for relative survival , 2004, Statistics in medicine.

[34]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[35]  P. Royston,et al.  Flexible parametric proportional‐hazards and proportional‐odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects , 2002, Statistics in medicine.

[36]  L. Remontet,et al.  New insights into survival trend analyses in cancer population-based studies: the SUDCAN methodology , 2017, European journal of cancer prevention : the official journal of the European Cancer Prevention Organisation.

[37]  Robert Gray,et al.  Flexible Methods for Analyzing Survival Data Using Splines, with Applications to Breast Cancer Prognosis , 1992 .

[38]  F. Ederer,et al.  The relative survival rate: a statistical methodology. , 1961, National Cancer Institute monograph.

[39]  B. Rachet,et al.  Development of a cross-cultural deprivation index in five European countries , 2015, Journal of Epidemiology and Community Health.

[40]  Patrick Royston,et al.  Multivariable Model-Building: A Pragmatic Approach to Regression Analysis based on Fractional Polynomials for Modelling Continuous Variables , 2008 .

[41]  Helena Carreira,et al.  Global surveillance of cancer survival 1995–2009: analysis of individual data for 25 676 887 patients from 279 population-based registries in 67 countries (CONCORD-2) , 2015, The Lancet.

[42]  Yudi Pawitan,et al.  Parametric and penalized generalized survival models , 2018, Statistical methods in medical research.

[43]  Göran Kauermann,et al.  Penalized spline smoothing in multivariable survival models with varying coefficients , 2005, Comput. Stat. Data Anal..

[44]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[45]  F E Harrell,et al.  The restricted cubic spline as baseline hazard in the proportional hazards model with step function time-dependent covariables. , 1995, Statistics in medicine.

[46]  S. Wood Low‐Rank Scale‐Invariant Tensor Product Smooths for Generalized Additive Mixed Models , 2006, Biometrics.

[47]  Michael J Crowther,et al.  A general framework for parametric survival analysis , 2014, Statistics in medicine.

[48]  Catherine Quantin,et al.  A relative survival regression model using B‐spline functions to model non‐proportional hazards , 2003, Statistics in medicine.

[49]  M. Abrahamowicz,et al.  Joint estimation of time‐dependent and non‐linear effects of continuous covariates on survival , 2007, Statistics in medicine.

[50]  Janez Stare,et al.  On Estimation in Relative Survival , 2012, Biometrics.

[51]  P. Grosclaude,et al.  Changes in the risk of death from cancer up to five years after diagnosis in elderly patients: A study of five common cancers , 2010, International journal of cancer.