Calibration and Validation of the Colorectal Cancer and Adenoma Incidence and Mortality (CRC-AIM) Microsimulation Model Using Deep Neural Networks

Objectives Machine learning (ML)–based emulators improve the calibration of decision-analytical models, but their performance in complex microsimulation models is yet to be determined. Methods We demonstrated the use of an ML-based emulator with the Colorectal Cancer (CRC)-Adenoma Incidence and Mortality (CRC-AIM) model, which includes 23 unknown natural history input parameters to replicate the CRC epidemiology in the United States. We first generated 15,000 input combinations and ran the CRC-AIM model to evaluate CRC incidence, adenoma size distribution, and the percentage of small adenoma detected by colonoscopy. We then used this data set to train several ML algorithms, including deep neural network (DNN), random forest, and several gradient boosting variants (i.e., XGBoost, LightGBM, CatBoost) and compared their performance. We evaluated 10 million potential input combinations using the selected emulator and examined input combinations that best estimated observed calibration targets. Furthermore, we cross-validated outcomes generated by the CRC-AIM model with those made by CISNET models. The calibrated CRC-AIM model was externally validated using the United Kingdom Flexible Sigmoidoscopy Screening Trial (UKFSST). Results The DNN with proper preprocessing outperformed other tested ML algorithms and successfully predicted all 8 outcomes for different input combinations. It took 473 s for the trained DNN to predict outcomes for 10 million inputs, which would have required 190 CPU-years without our DNN. The overall calibration process took 104 CPU-days, which included building the data set, training, selecting, and hyperparameter tuning of the ML algorithms. While 7 input combinations had acceptable fit to the targets, a combination that best fits all outcomes was selected as the best vector. Almost all of the predictions made by the best vector laid within those from the CISNET models, demonstrating CRC-AIM’s cross-model validity. Similarly, CRC-AIM accurately predicted the hazard ratios of CRC incidence and mortality as reported by UKFSST, demonstrating its external validity. Examination of the impact of calibration targets suggested that the selection of the calibration target had a substantial impact on model outcomes in terms of life-year gains with screening. Conclusions Emulators such as a DNN that is meticulously selected and trained can substantially reduce the computational burden of calibrating complex microsimulation models. Highlights Calibrating a microsimulation model, a process to find unobservable parameters so that the model fits observed data, is computationally complex. We used a deep neural network model, a popular machine learning algorithm, to calibrate the Colorectal Cancer Adenoma Incidence and Mortality (CRC-AIM) model. We demonstrated that our approach provides an efficient and accurate method to significantly speed up calibration in microsimulation models. The calibration process successfully provided cross-model validation of CRC-AIM against 3 established CISNET models and also externally validated against a randomized controlled trial.

[1]  A. Jemal,et al.  Annual report to the nation on the status of cancer, part 1: National cancer statistics , 2022, Cancer.

[2]  Timothy M Wolock,et al.  Seq2Seq Surrogates of Epidemic Models to Facilitate Bayesian Inference , 2022, AAAI.

[3]  T. Ayer,et al.  Cost-Effectiveness and Value-of-Information Analysis Using Machine Learning–Based Metamodeling: A Case of Hepatitis C Treatment , 2022, Medical decision making : an international journal of the Society for Medical Decision Making.

[4]  Jiong Cai,et al.  Microsimulation Model for Prevention and Intervention of Coloretal Cancer in China (MIMIC-CRC): Development, Calibration, Validation, and Application , 2022, Frontiers in Oncology.

[5]  S. Kudo,et al.  Cost-effectiveness of artificial intelligence for screening colonoscopy: a modelling study. , 2022, The Lancet. Digital health.

[6]  S. Chrysanthopoulou,et al.  Microsimulation Model Calibration with Approximate Bayesian Computation in R: A Tutorial , 2022, Medical decision making : an international journal of the Society for Medical Decision Making.

[7]  Canberra,et al.  Bayesian calibration of simulation models: A tutorial and an Australian smoking behaviour model , 2022, 2202.02923.

[8]  Jonathan Ozik,et al.  Sequentially calibrating a Bayesian microsimulation model to incorporate new information and assumptions , 2022, BMC Medical Informatics and Decision Making.

[9]  S. Whyte,et al.  Calibrating Natural History of Cancer Models in the Presence of Data Incompatibility: Problems and Solutions , 2022, PharmacoEconomics.

[10]  P. Nyasulu,et al.  Inferring HIV Transmission Network Determinants Using Agent-Based Models Calibrated to Multi-Data Sources , 2021, Mathematics.

[11]  L. Breiman Random Forests , 2001, Machine Learning.

[12]  D. T. Volpatto,et al.  A generalised SEIRD model with implicit social distancing mechanism: A Bayesian approach for the identification of the spread of COVID-19 with applications in Brazil and Rio de Janeiro state , 2021, J. Simulation.

[13]  H. Koffijberg,et al.  Choosing a Metamodel of a Simulation Model for Uncertainty Quantification , 2021, Medical decision making : an international journal of the Society for Medical Decision Making.

[14]  Amy B. Knudsen,et al.  Colorectal Cancer Screening: An Updated Modeling Study for the US Preventive Services Task Force. , 2021, JAMA.

[15]  M. Cabana,et al.  Screening for Colorectal Cancer: US Preventive Services Task Force Recommendation Statement. , 2021, JAMA.

[16]  Charles J. Kahi,et al.  ACG Clinical Guidelines: Colorectal Cancer Screening 2021. , 2021, The American journal of gastroenterology.

[17]  Thomas A. Smith,et al.  Emulator-based Bayesian optimization for efficient multi-objective calibration of an individual-based model of malaria , 2021, Nature Communications.

[18]  T. Hladish,et al.  Calibration of models to data: a comparison of methods , 2020, bioRxiv.

[19]  Fernando Alarid-Escudero,et al.  BayCANN: Streamlining Bayesian Calibration With Artificial Neural Network Metamodeling , 2020, Frontiers in Physiology.

[20]  Rushil Anirudh,et al.  Accurate Calibration of Agent-based Epidemiological Models with Neural Network Surrogates , 2020, Healthcare AI and COVID-19 Workshop.

[21]  J. Lew,et al.  Validation of Microsimulation Models against Alternative Model Predictions and Long-Term Colorectal Cancer Incidence and Mortality Outcomes of Randomized Controlled Trials , 2020, Medical decision making : an international journal of the Society for Medical Decision Making.

[22]  Douglas K Owens,et al.  Methods for Model Calibration under High Uncertainty: Modeling Cholera in Bangladesh , 2020, Medical decision making : an international journal of the Society for Medical Decision Making.

[23]  C. Angione,et al.  Using Machine Learning to Emulate Agent-Based Simulations , 2020, ArXiv.

[24]  M. Weinstein,et al.  Novel microsimulation model of tobacco use behaviours and outcomes: calibration and validation in a US population , 2020, BMJ Open.

[25]  S. Whyte,et al.  External Validation of a Colorectal Cancer Model Against Screening Trial Long-Term Follow-Up Data. , 2019, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[26]  G. Zarkin,et al.  Estimating Long-Term Drinking Patterns for People with Lifetime Alcohol Use Disorder , 2019, Medical decision making : an international journal of the Society for Medical Decision Making.

[27]  Wim Delva,et al.  Calibration of individual-based models to epidemiological data: A systematic review , 2019, PLoS Comput. Biol..

[28]  Boyang Li,et al.  NormLime: A New Feature Importance Metric for Explaining Deep Neural Networks , 2019, ArXiv.

[29]  Nan Kong,et al.  Multiobjective Calibration of Disease Simulation Models Using Gaussian Processes , 2019, Medical decision making : an international journal of the Society for Medical Decision Making.

[30]  E. Arias United States Life Tables, 2017. , 2019, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[31]  Zhenhua Wang,et al.  Development of an Empirically Calibrated Model of Esophageal Squamous Cell Carcinoma in High-Risk Regions , 2019, BioMed research international.

[32]  D. Dowdy,et al.  Impact of Providing Preexposure Prophylaxis for Human Immunodeficiency Virus at Clinics for Sexually Transmitted Infections in Baltimore City: An Agent-based Model , 2018, Sexually transmitted diseases.

[33]  Brandon D. L. Marshall,et al.  Effect of Racial Inequities in Pre-exposure Prophylaxis Use on Racial Disparities in HIV Incidence Among Men Who Have Sex With Men: A Modeling Study , 2018, Journal of acquired immune deficiency syndromes.

[34]  Anna Veronika Dorogush,et al.  CatBoost: gradient boosting with categorical features support , 2018, ArXiv.

[35]  Fernando Alarid-Escudero,et al.  Nonidentifiability in Model Calibration and Implications for Medical Decision Making , 2018, Medical decision making : an international journal of the Society for Medical Decision Making.

[36]  A. Galvani,et al.  Potential effectiveness of long-acting injectable pre-exposure prophylaxis for HIV prevention in men who have sex with men: a modelling study. , 2018, The lancet. HIV.

[37]  S. Goodreau,et al.  Development of an Agent-Based Model to Investigate the Impact of HIV Self-Testing Programs on Men Who Have Sex With Men in Atlanta and Seattle , 2018, JMIR public health and surveillance.

[38]  C. Flowers,et al.  Colorectal cancer screening for average‐risk adults: 2018 guideline update from the American Cancer Society , 2018, CA: a cancer journal for clinicians.

[39]  Jonathan Ozik,et al.  MICROSIMULATION MODEL CALIBRATION USING INCREMENTAL MIXTURE APPROXIMATE BAYESIAN COMPUTATION. , 2018, The annals of applied statistics.

[40]  Sanku Dey,et al.  The Fréchet distribution: Estimation and application - An overview , 2018, Journal of Statistics and Management Systems.

[41]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[42]  M. Lipsitch,et al.  Displacement of sexual partnerships in trials of sexual behavior interventions: A model-based assessment of consequences. , 2017, Epidemics.

[43]  Philip S Rosenberg,et al.  Colorectal Cancer Incidence Patterns in the United States, 1974–2013 , 2017, Journal of the National Cancer Institute.

[44]  J. Lew,et al.  Long-term evaluation of benefits, harms, and cost-effectiveness of the National Bowel Cancer Screening Program in Australia: a modelling study. , 2017, The Lancet. Public health.

[45]  I. Kralj-Hans,et al.  Long term effects of once-only flexible sigmoidoscopy screening after 17 years of follow-up: the UK Flexible Sigmoidoscopy Screening randomised controlled trial , 2017, The Lancet.

[46]  Nicolas A. Menzies,et al.  Bayesian Methods for Calibrating Health Policy Models: A Tutorial , 2017, PharmacoEconomics.

[47]  Alexander M. Hainen,et al.  Comparative study of simulated annealing, tabu search, and the genetic algorithm for calibration of the microsimulation model , 2017, Simul..

[48]  Oguzhan Alagoz,et al.  Using Active Learning for Speeding up Calibration in Simulation Models , 2016, Medical decision making : an international journal of the Society for Medical Decision Making.

[49]  Karen M Kuntz,et al.  Estimation of Benefits, Burden, and Harms of Colorectal Cancer Screening Strategies: Modeling Study for the US Preventive Services Task Force. , 2016, JAMA.

[50]  M. Phipps,et al.  Screening for Colorectal Cancer: US Preventive Services Task Force Recommendation Statement. , 2016, JAMA.

[51]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[52]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[53]  Tracey L Marsh,et al.  Validation of Models Used to Inform Colorectal Cancer Screening Guidelines , 2016, Medical decision making : an international journal of the Society for Medical Decision Making.

[54]  Virginia Moyer,et al.  The US Preventive Services Task Force , 2015, North Carolina Medical Journal.

[55]  M. Vieth,et al.  Serrated lesions in colorectal cancer screening: detection, resection, pathology and surveillance , 2015, Gut.

[56]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[57]  N. Segnan,et al.  Multitarget stool DNA testing for colorectal-cancer screening. , 2014, The New England journal of medicine.

[58]  E. Kuipers,et al.  Modeling the Adenoma and Serrated Pathway to Colorectal CAncer (ASCCA) , 2014, Risk analysis : an official publication of the Society for Risk Analysis.

[59]  Alois Knoll,et al.  Gradient boosting machines, a tutorial , 2013, Front. Neurorobot..

[60]  Eric J Feuer,et al.  Secular trends in colon and rectal cancer relative survival. , 2013, Journal of the National Cancer Institute.

[61]  Joseph A Hill United States Life Tables , 2013 .

[62]  James L. Beck,et al.  Bayesian inversion for finite fault earthquake source models I—theory and algorithm , 2013 .

[63]  T. Smyrk,et al.  Serrated colon polyps as precursors to colorectal cancer. , 2013, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[64]  Bruce H Fireman,et al.  Variation of adenoma prevalence by age, sex, race, and colon location in a large population: implications for screening and quality programs. , 2013, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[65]  H. Brenner,et al.  Sojourn time of preclinical colorectal cancer by sex and age: estimates from the German national screening colonoscopy database. , 2011, American journal of epidemiology.

[66]  E. Tjørve,et al.  A unified approach to the Richards-model family for use in growth analyses: why we need only two model forms. , 2010, Journal of theoretical biology.

[67]  A. Levy,et al.  The cost-effectiveness of screening for colorectal cancer , 2010, Canadian Medical Association Journal.

[68]  Cancer,et al.  Once-only flexible sigmoidoscopy screening in prevention of colorectal cancer: a multicentre randomised controlled trial , 2010, The Lancet.

[69]  G. Gazelle,et al.  Cost-effectiveness of breast MR imaging and screen-film mammography for screening BRCA1 gene mutation carriers. , 2010, Radiology.

[70]  A. Jemal,et al.  Annual report to the nation on the status of cancer, 1975‐2006, featuring colorectal cancer trends and impact of interventions (risk factors, screening, and treatment) to reduce future rates , 2010, Cancer.

[71]  J. Savarino,et al.  Bayesian Calibration of Microsimulation Models , 2009, Journal of the American Statistical Association.

[72]  G Scott Gazelle,et al.  Calibration of disease simulation model using an engineering approach. , 2009, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[73]  D. Lieberman,et al.  Polyp size and advanced histology in patients undergoing colonoscopy screening: implications for CT colonography. , 2008, Gastroenterology.

[74]  Perry J Pickhardt,et al.  The Cost-Effectiveness of CT Colonography in Screening for Colorectal Neoplasia , 2007, The American Journal of Gastroenterology.

[75]  P. Tappenden,et al.  Option appraisal of population-based colorectal cancer screening programmes in England , 2006, Gut.

[76]  Y. Yamaji,et al.  The Malignant Potential of Freshly Developed Colorectal Polyps According to Age , 2006, Cancer Epidemiology Biomarkers & Prevention.

[77]  U. Ladabaum,et al.  Projected national impact of colorectal cancer screening on clinical and economic outcomes and health services demand. , 2005, Gastroenterology.

[78]  R. Wolff,et al.  Evaluation of a large, population-based sample supports a CpG island methylator phenotype in colon cancer. , 2005, Gastroenterology.

[79]  Yen Lin Chia,et al.  Simulation-based parameter estimation for complex models: a breast cancer natural history modelling illustration , 2004, Statistical methods in medical research.

[80]  Ann G Zauber,et al.  National Polyp Study data: Evidence for regression of adenomas , 2004, International journal of cancer.

[81]  J. Church Clinical significance of small colorectal polyps , 2004, Diseases of the colon and rectum.

[82]  P. Pickhardt,et al.  Computed tomographic virtual colonoscopy to screen for colorectal neoplasia in asymptomatic adults. , 2003, The New England journal of medicine.

[83]  F. J. Davis,et al.  Latin hypercube sampling and the propagation of uncertainty in analyses of complex systems , 2002, Reliab. Eng. Syst. Saf..

[84]  Milton C Weinstein,et al.  Empirically calibrated model of hepatitis C virus infection in the United States. , 2002, American journal of epidemiology.

[85]  R. Ward,et al.  CpG island methylation in sporadic colorectal cancers and its relationship to microsatellite instability. , 2002, Gastroenterology.

[86]  J. Wardle,et al.  Single flexible sigmoidoscopy screening to prevent colorectal cancer: baseline findings of a UK multicentre randomised trial , 2002, The Lancet.

[87]  J. Friedman Stochastic gradient boosting , 2002 .

[88]  Grace Huang,et al.  Aspirin as an Adjunct to Screening for Prevention of Sporadic Colorectal Cancer: A Cost-Effectiveness Analysis , 2001, Annals of Internal Medicine.

[89]  D. Ransohoff,et al.  Risk of advanced proximal neoplasms in asymptomatic adults according to the distal colorectal findings. , 2000, The New England journal of medicine.

[90]  D C McCrory,et al.  Mathematical model for the natural history of human papillomavirus infection and cervical carcinogenesis. , 2000, American journal of epidemiology.

[91]  T. Prevost,et al.  Evaluation of a selective screening for colorectal carcinoma , 1999, Cancer.

[92]  G. David Garson,et al.  Interpreting neural-network connection weights , 1991 .

[93]  Balaji Krishnamurthy,et al.  Differentiable agent-based epidemiological modeling for end-to-end learning , 2022 .

[94]  Colorectal Cancer Screening in Average-Risk Adults , 2019, Annals of Internal Medicine.

[95]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[96]  Wolfram C. Rinke,et al.  An Algorithm to Transform an Artificial Neural Network into its Open Equation Form and its Potential Applications , 2015 .

[97]  Natasha K. Stout,et al.  Calibration Methods Used in Cancer Simulation Models and Suggested Reporting Guidelines , 2012, PharmacoEconomics.

[98]  E. G. Hahn,et al.  Risk of invasive carcinoma in colorectal adenomas assessed by size and site , 1997, International Journal of Colorectal Disease.

[99]  A. T. C. Goh,et al.  Back-propagation neural networks for modeling complex systems , 1995, Artif. Intell. Eng..

[100]  S Kamen,et al.  The task force. , 1976, Journal of hospital dental practice.