Missing covariate data within cancer prognostic studies: a review of current reporting and proposed guidelines

Prognostic models play a crucial role in the clinical decision-making process. Unfortunately, missing covariate data impede the construction of valid and reliable models, potentially introducing bias, if handled inappropriately. The extent of missing covariate data within reported cancer prognostic studies, the current handling and the quality of reporting this missing covariate data are unknown. Therefore, a review was conducted of 100 articles reporting multivariate survival analyses to assess potential prognostic factors, published within seven cancer journals in 2002. Missing covariate data is a common occurrence in studies performing multivariate survival analyses, being apparent in 81 of the 100 articles reviewed. The percentage of eligible cases with complete data was obtainable in 39 articles, and was <90% in 17 of these articles. The methods used to handle incomplete covariates were obtainable in 32 of the 81 articles with known missing data and the most commonly reported approaches were complete case and available case analysis. This review has highlighted deficiencies in the reporting of missing covariate data. Guidelines for presenting prognostic studies with missing covariate data are proposed, which if followed should clarify and standardise the reporting in future articles.

[1]  W Vach,et al.  Some issues in estimating the effect of prognostic factors from incomplete covariate data. , 1997, Statistics in medicine.

[2]  J. Concato,et al.  Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. , 1995, Journal of clinical epidemiology.

[3]  S Greenland,et al.  A critical look at methods for handling missing covariates in epidemiologic regression analyses. , 1995, American journal of epidemiology.

[4]  P C Lambert,et al.  Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future , 2003, British Journal of Cancer.

[5]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[6]  F. Harrell,et al.  USING FULL PROBABILITY MODELS TO COMPUTE PROBABILITIES OF ACTUAL INTEREST TO DECISION MAKERS , 2001, International Journal of Technology Assessment in Health Care.

[7]  D. Osoba,et al.  Guidelines for reporting results of quality of life assessments in clinical trials , 1996, Quality of Life Research.

[8]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[9]  D. G. Altman,et al.  Statistical aspects of prognostic factor studies in oncology. , 1994, British Journal of Cancer.

[10]  D. G. Altman,et al.  Review of survival analyses published in cancer journals. , 1995, British Journal of Cancer.

[11]  Douglas G. Altman,et al.  Methodological challenges in the evaluation of prognostic factors in breast cancer , 2004, Breast Cancer Research and Treatment.

[12]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[13]  Z Lou,et al.  Tree-structured prediction for censored survival data and the Cox model. , 1995, Journal of clinical epidemiology.

[14]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.