Assessing the goodness of fit of personal risk models

We describe a flexible family of tests for evaluating the goodness of fit (calibration) of a pre-specified personal risk model to the outcomes observed in a longitudinal cohort. Such evaluation involves using the risk model to assign each subject an absolute risk of developing the outcome within a given time from cohort entry and comparing subjects' assigned risks with their observed outcomes. This comparison involves several issues. For example, subjects followed only for part of the risk period have unknown outcomes. Moreover, existing tests do not reveal the reasons for poor model fit when it occurs, which can reflect misspecification of the model's hazards for the competing risks of outcome development and death. To address these issues, we extend the model-specified hazards for outcome and death, and use score statistics to test the null hypothesis that the extensions are unnecessary. Simulated cohort data applied to risk models whose outcome and mortality hazards agreed and disagreed with those generating the data show that the tests are sensitive to poor model fit, provide insight into the reasons for poor fit, and accommodate a wide range of model misspecification. We illustrate the methods by examining the calibration of two breast cancer risk models as applied to a cohort of participants in the Breast Cancer Family Registry. The methods can be implemented using the Risk Model Assessment Program, an R package freely available at http://stanford.edu/~ggong/rmap/.

[1]  N. Breslow,et al.  Analysis of Survival Data under the Proportional Hazards Model , 1975 .

[2]  M. Gail,et al.  Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. , 1989, Journal of the National Cancer Institute.

[3]  Stephen W Duffy,et al.  A breast cancer prediction model incorporating familial and personal risk factors , 2004, Hereditary Cancer in Clinical Practice.

[4]  Konstantin Strauch,et al.  Breast cancer risk assessment across the risk continuum: genetic and nongenetic risk factors contributing to differential model performance , 2012, Breast Cancer Research.

[5]  Melissa Bondy,et al.  Projecting individualized absolute invasive breast cancer risk in African American women. , 2007, Journal of the National Cancer Institute.

[6]  J Benichou,et al.  Validation studies for models projecting the risk of invasive and total breast cancer incidence. , 1999, Journal of the National Cancer Institute.

[7]  D. Cox Regression Models and Life-Tables , 1972 .

[8]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data: Kalbfleisch/The Statistical , 2002 .

[9]  Norman Boyd,et al.  The Breast Cancer Family Registry: an infrastructure for cooperative multinational, interdisciplinary and translational studies of the genetic epidemiology of breast cancer , 2004, Breast Cancer Research.

[10]  D. Hosmer,et al.  Goodness of fit tests for the multiple logistic regression model , 1980 .