Multivariate modeling of complications with data driven variable selection: guarding against overfitting and effects of data set size.

PURPOSE Multivariate modeling of complications after radiotherapy is frequently used in conjunction with data driven variable selection. This study quantifies the risk of overfitting in a data driven modeling method using bootstrapping for data with typical clinical characteristics, and estimates the minimum amount of data needed to obtain models with relatively high predictive power. MATERIALS AND METHODS To facilitate repeated modeling and cross-validation with independent datasets for the assessment of true predictive power, a method was developed to generate simulated data with statistical properties similar to real clinical data sets. Characteristics of three clinical data sets from radiotherapy treatment of head and neck cancer patients were used to simulate data with set sizes between 50 and 1000 patients. A logistic regression method using bootstrapping and forward variable selection was used for complication modeling, resulting for each simulated data set in a selected number of variables and an estimated predictive power. The true optimal number of variables and true predictive power were calculated using cross-validation with very large independent data sets. RESULTS For all simulated data set sizes the number of variables selected by the bootstrapping method was on average close to the true optimal number of variables, but showed considerable spread. Bootstrapping is more accurate in selecting the optimal number of variables than the AIC and BIC alternatives, but this did not translate into a significant difference of the true predictive power. The true predictive power asymptotically converged toward a maximum predictive power for large data sets, and the estimated predictive power converged toward the true predictive power. More than half of the potential predictive power is gained after approximately 200 samples. Our simulations demonstrated severe overfitting (a predicative power lower than that of predicting 50% probability) in a number of small data sets, in particular in data sets with a low number of events (median: 7, 95th percentile: 32). Recognizing overfitting from an inverted sign of the estimated model coefficients has a limited discriminative value. CONCLUSIONS Despite considerable spread around the optimal number of selected variables, the bootstrapping method is efficient and accurate for sufficiently large data sets, and guards against overfitting for all simulated cases with the exception of some data sets with a particularly low number of events. An appropriate minimum data set size to obtain a model with high predictive power is approximately 200 patients and more than 32 events. With fewer data samples the true predictive power decreases rapidly, and for larger data set sizes the benefit levels off toward an asymptotic maximum predictive power.

[1]  Johannes A Langendijk,et al.  Bath and shower effects in the rat parotid gland explain increased relative risk of parotid gland dysfunction after intensity-modulated radiotherapy. , 2009, International journal of radiation oncology, biology, physics.

[2]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[3]  Joseph O Deasy,et al.  Radiotherapy dose-volume effects on salivary gland function. , 2010, International journal of radiation oncology, biology, physics.

[4]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[5]  Johannes A Langendijk,et al.  The impact of heart irradiation on dose-volume effects in the rat lung. , 2007, International journal of radiation oncology, biology, physics.

[6]  Joseph O Deasy,et al.  Predicting radiotherapy outcomes using statistical learning techniques , 2009, Physics in medicine and biology.

[7]  Joseph O Deasy,et al.  Quantitative Analyses of Normal Tissue Effects in the Clinic (QUANTEC): an introduction to the scientific issues. , 2010, International journal of radiation oncology, biology, physics.

[8]  H. Akaike A new look at the statistical model identification , 1974 .

[9]  Johannes A Langendijk,et al.  Potential benefits of scanned intensity-modulated proton therapy versus advanced photon therapy with regard to sparing of the salivary glands in oropharyngeal cancer. , 2011, International journal of radiation oncology, biology, physics.

[10]  T. Alonzo Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating By Ewout W. Steyerberg , 2009 .

[11]  M. Gapany Radiation-induced hypothyroidism in head and neck cancer patients: A systematic review , 2012 .

[12]  J. Deasy,et al.  Multivariable modeling of radiotherapy outcomes, including dose-volume and clinical factors. , 2006, International journal of radiation oncology, biology, physics.

[13]  J. Habbema,et al.  Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets. , 2000, Statistics in medicine.

[14]  Johannes A Langendijk,et al.  Impact of statistical learning methods on the predictive power of multivariate normal tissue complication probability models. , 2012, International journal of radiation oncology, biology, physics.

[15]  Riccardo Calandrino,et al.  Intensity-modulated proton therapy versus helical tomotherapy in nasopharynx cancer: planning comparison and NTCP evaluation. , 2008, International journal of radiation oncology, biology, physics.

[16]  A Brahme,et al.  Tumour and normal tissue responses to fractionated non-uniform dose delivery. , 1992, International journal of radiation biology.

[17]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[18]  Riccardo Calandrino,et al.  Helical tomotherapy and intensity modulated proton therapy in the treatment of early stage prostate cancer: a treatment planning comparison. , 2011, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[19]  J. Deasy,et al.  Modeling radiation pneumonitis risk with clinical, dosimetric, and spatial parameters. , 2006, International journal of radiation oncology, biology, physics.

[20]  I El Naqa,et al.  Dose response explorer: an integrated open-source tool for exploring and modelling radiotherapy dose–volume outcome relationships , 2006, Physics in medicine and biology.

[21]  L. Marks,et al.  A genetic algorithm for variable selection in logistic regression analysis of radiotherapy treatment outcomes. , 2008, Medical physics.

[22]  G. Bedogni,et al.  Clinical Prediction Models—a Practical Approach to Development, Validation and Updating , 2009 .

[23]  Johannes A Langendijk,et al.  Ipsilateral irradiation for oral and oropharyngeal carcinoma treated with primary surgery and postoperative radiotherapy. , 2010, International journal of radiation oncology, biology, physics.

[24]  S. Bentzen Preventing or reducing late side effects of radiation therapy: radiobiology meets molecular pathology , 2006, Nature Reviews Cancer.

[25]  R. Lane,et al.  A test of the claim that plan rankings are determined by relative complication and tumor-control probabilities. , 1998, International journal of radiation oncology, biology, physics.

[26]  Laura Marucci,et al.  A novel dose constraint to reduce xerostomia in head-and-neck cancer patients treated with intensity-modulated radiotherapy. , 2010, International journal of radiation oncology, biology, physics.

[27]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[28]  Johannes A Langendijk,et al.  Development of NTCP models for head and neck cancer patients treated with three-dimensional conformal radiotherapy for xerostomia and sticky saliva: the role of dosimetric and clinical factors. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[29]  J M Slater,et al.  Normal tissue complication probability (NTCP) calculations as a means to compare proton and photon plans and evaluation of clinical appropriateness of calculated values , 2000, International journal of cancer.

[30]  Ulf Isacsson,et al.  Potential outcomes of modalities and techniques in radiotherapy for patients with hypopharyngeal carcinoma. , 2004, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[31]  M. Poulsen,et al.  Predictors of acute grade 4 swallowing toxicity in patients with stages III and IV squamous carcinoma of the head and neck treated with radiotherapy alone. , 2008, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[32]  Masumi Kadoya,et al.  Validation of the Total Dysphagia Risk Score (TDRS) as a predictive measure for acute swallowing dysfunction induced by chemoradiotherapy for head and neck cancers. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[33]  C. Burman,et al.  Calculation of complication probability factors for non-uniform normal tissue irradiation: the effective volume method. , 1989, International journal of radiation oncology, biology, physics.

[34]  J. Lyman Complication probability as assessed from dose-volume histograms. , 1985, Radiation research. Supplement.

[35]  Johannes A. Langendijk,et al.  A predictive model for swallowing dysfunction after curative radiotherapy in head and neck cancer , 2011 .

[36]  Issam El Naqa,et al.  Heart irradiation as a risk factor for radiation pneumonitis , 2011, Acta oncologica.

[37]  Hao Wu,et al.  Open Access Research , 2022 .

[38]  Johannes A Langendijk,et al.  Predictive modelling for swallowing dysfunction after primary (chemo)radiation: results of a prospective observational study. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[39]  Harm Meertens,et al.  Volume effects and region-dependent radiosensitivity of the parotid gland. , 2005, International journal of radiation oncology, biology, physics.

[40]  Harm Meertens,et al.  Radiation damage to the heart enhances early radiation-induced lung function loss. , 2005, Cancer research.