Applying reinforcement learning techniques to detect hepatocellular carcinoma under limited screening capacity

We investigate the problem faced by a healthcare system wishing to allocate its constrained screening resources across a population at risk for developing a disease. A patient’s risk of developing the disease depends on his/her biomedical dynamics. However, knowledge of these dynamics must be learned by the system over time. Three classes of reinforcement learning policies are designed to address this problem of simultaneously gathering and utilizing information across multiple patients. We investigate a case study based upon the screening for Hepatocellular Carcinoma (HCC), and optimize each of the three classes of policies using the indifference zone method. A simulation is built to gauge the performance of these policies, and their performance is compared to current practice. We then demonstrate how the benefits of learning-based screening policies differ across various levels of resource scarcity and provide metrics of policy performance.

[1]  Marvin Zelen,et al.  Mortality Modeling of Early Detection Programs , 2008, Biometrics.

[2]  Graham A. Colditz,et al.  Cost-effectiveness of screening for colorectal cancer in the general population. , 2000, JAMA.

[3]  K. Aoki,et al.  Follow-up examination schedule of postoperative HCC patients based on tumor volume doubling time. , 1993, Hepato-gastroenterology.

[4]  S. Hill,et al.  Cost-effectiveness and resource allocation - Reply , 2006 .

[5]  J. Ward,et al.  Hepatocellular carcinoma - United States, 2001-2006. , 2010, MMWR. Morbidity and mortality weekly report.

[6]  Reza Yaesoubi,et al.  How much is a health insurer willing to pay for Colorectal Cancer screening tests? , 2008, 2008 Winter Simulation Conference.

[7]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[8]  Ruth Davies,et al.  A Simulation to Evaluate Screening for Helicobacter Pylori Infection in the Prevention of Peptic Ulcers and Gastric Cancers , 2002, Health care management science.

[9]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[10]  Mariel S. Lavieri,et al.  Improving screening for hepatocellular carcinoma by incorporating data on levels of α-fetoprotein, over time. , 2013, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[11]  Rob Boer,et al.  The MISCAN-COLON Simulation Model for the Evaluation of Colorectal Cancer Screening , 1999, Comput. Biomed. Res..

[12]  Oguzhan Alagöz,et al.  The Effect of Budgetary Restrictions on Breast Cancer Diagnostic Decisions , 2012, Manuf. Serv. Oper. Manag..

[13]  Yue Zhang,et al.  Developing an adaptive policy for long-term care capacity planning , 2013, Health care management science.

[14]  A Tsodikov,et al.  A population model of prostate cancer incidence , 2006, Statistics in medicine.

[15]  C E Stevenson,et al.  Statistical models for cancer screening , 1995, Statistical methods in medical research.

[16]  T. Wright,et al.  Cost-Effectiveness of Human Papillomavirus DNA Testing for Cervical Cancer Screening in Women Aged 30 Years or More , 2004, Obstetrics and gynecology.

[17]  Fatih Safa Erenay,et al.  Optimizing Colonoscopy Screening for Colorectal Cancer Prevention and Surveillance , 2014, Manuf. Serv. Oper. Manag..

[18]  S. Dalal,et al.  ALLOCATION OF OBSERVATIONS IN RANKING AND SELECTION WITH UNEQUAL VARIANCES , 1971 .

[19]  L Pagliaro,et al.  Clinical management of hepatocellular carcinoma. Conclusions of the Barcelona-2000 EASL conference. European Association for the Study of the Liver. , 2001, Journal of hepatology.

[20]  R. Duncan Luce,et al.  Individual Choice Behavior: A Theoretical Analysis , 1979 .

[21]  P. Harper,et al.  Mathematical Models for the Early Detection and Treatment of Colorectal Cancer , 2005, Health care management science.

[22]  G. Casazza,et al.  Accuracy of Ultrasonography, Spiral CT, Magnetic Resonance, and Alpha-Fetoprotein in Diagnosing Hepatocellular Carcinoma: A Systematic Review , 2006, The American Journal of Gastroenterology.

[23]  A. Burroughs,et al.  Special article Clinical Management of Hepatocellular Carcinoma. Conclusions of the Barcelona-2000 EASL Conference , 2001 .

[24]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[25]  Turgay Ayer,et al.  OR Forum - A POMDP Approach to Personalize Mammography Screening Decisions , 2012, Oper. Res..

[26]  Lisa M. Maillart,et al.  Assessing Dynamic Breast Cancer Screening Policies , 2008, Oper. Res..

[27]  M. Jansen-Vullers,et al.  Admission and capacity planning for the implementation of one-stop-shop in skin cancer treatment using simulation-based optimization , 2013, Health care management science.

[28]  Rubén A. Proaño,et al.  Determining the optimal vaccine vial size in developing countries: a Monte Carlo simulation approach , 2012, Health care management science.

[29]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[30]  Brian T. Denton,et al.  Optimization of Prostate Biopsy Referral Decisions , 2012, Manuf. Serv. Oper. Manag..

[31]  Oguzhan Alagoz,et al.  Operations Research Models for Cancer Screening , 2011 .

[32]  D C McCrory,et al.  Mathematical model for the natural history of human papillomavirus infection and cervical carcinogenesis. , 2000, American journal of epidemiology.

[33]  Shalini L Kulasingam,et al.  Adding a quadrivalent human papillomavirus vaccine to the UK cervical cancer screening programme: A cost-effectiveness analysis , 2008, Cost effectiveness and resource allocation : C/E.

[34]  Christopher J. Lacke,et al.  Analysis of Colorectal Cancer Screening Regimens , 2001, Health Care Management Science.

[35]  N Urban,et al.  Use of a stochastic simulation model to identify an efficient protocol for ovarian cancer screening. , 1997, Controlled clinical trials.

[36]  William P. Pierskalla,et al.  Chapter 13 Applications of operations research in health care delivery , 1994, Operations research and the public sector.

[37]  Marvin Zelen,et al.  Modeling and Optimization in Early Detection Programs with a Single Exam , 2002, Biometrics.

[38]  Oguzhan Alagöz,et al.  Optimal Breast Biopsy Decision-Making Based on Mammographic Features and Demographic Factors , 2010, Oper. Res..

[39]  T. Bodenheimer,et al.  Confronting the growing burden of chronic disease: can the U.S. health care workforce do the job? , 2009, Health affairs.

[40]  Joachim Wagner,et al.  Dynamic Policy Modeling for Chronic Diseases: Metaheuristic-Based Identification of Pareto-Optimal Screening Strategies , 2010, Oper. Res..

[41]  R. Bechhofer A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances , 1954 .

[42]  T. Wilkins,et al.  Hepatitis C: diagnosis and treatment. , 2010, American family physician.

[43]  S. Altekruse,et al.  Hepatocellular carcinoma incidence, mortality, and survival trends in the United States from 1975 to 2005. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[44]  A. Yu. Yakovlev,et al.  Optimal schedules of cancer surveillance and tumor size at detection , 2001 .

[45]  G. Gazelle,et al.  Use of modeling to evaluate the cost-effectiveness of cancer screening programs. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[46]  S. Chick,et al.  Breast Cancer Screening Services: Trade-offs in Quality, Capacity, Outreach, and Centralization , 2004, Health care management science.

[47]  M Zelen,et al.  Modelling the early detection of breast cancer. , 2003, Annals of oncology : official journal of the European Society for Medical Oncology.

[48]  M. Leshno,et al.  Cost-Effectiveness of Colorectal Cancer Screening in the Average Risk Population , 2003, Health care management science.