Predicting colorectal polyp recurrence using time-to-event analysis of medical records

Identifying patient characteristics that influence the rate of colorectal polyp recurrence can provide important insights into which patients are at higher risk for recurrence. We used natural language processing to extract polyp morphological characteristics from 953 polyp-presenting patients' electronic medical records. We used subsequent colonoscopy reports to examine how the time to polyp recurrence (731 patients experienced recurrence) is influenced by these characteristics as well as anthropometric features using Kaplan-Meier curves, Cox proportional hazards modeling, and random survival forest models. We found that the rate of recurrence differed significantly by polyp size, number, and location and patient smoking status. Additionally, right-sided colon polyps increased recurrence risk by 30% compared to left-sided polyps. History of tobacco use increased polyp recurrence risk by 20% compared to never-users. A random survival forest model showed an AUC of 0.65 and identified several other predictive variables, which can inform development of personalized polyp surveillance plans.

[1]  A. Dreher Modeling Survival Data Extending The Cox Model , 2016 .

[2]  P. Albert,et al.  The Association Between Cigarette Smoking and Colorectal Polyp Recurrence (United States) , 2005, Cancer Causes & Control.

[3]  J. Viel,et al.  Predictors of Colorectal Polyp Recurrence after the First Polypectomy in Private Practice Settings: A Cohort Study , 2012, PloS one.

[4]  M. Wallace,et al.  The Effect of Polyp Location and Patient Gender on the Presence of Dysplasia in Colonic Polyps , 2012, Clinical and Translational Gastroenterology.

[5]  M. Ebert,et al.  Risk Factors for Local Recurrence of Large, Flat Colorectal Polyps after Endoscopic Mucosal Resection , 2016, Digestion.

[6]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[7]  D. Ahnen,et al.  Adenomatous Polyps of the Colon , 2006 .

[8]  E. Kuipers,et al.  Features of adenoma and colonoscopy associated with recurrent colorectal neoplasia based on a large community-based study. , 2013, Gastroenterology.

[9]  D. Alberts,et al.  Smoking exposure as a risk factor for prevalent and recurrent colorectal adenomas. , 2003, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[10]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[11]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[12]  J. Hurley,et al.  The molecular genetics of colorectal cancer , 2013, Frontline Gastroenterology.

[13]  A. Papalambros,et al.  Predictors of survival in stage IV metastatic colorectal cancer. , 2010, Anticancer research.

[14]  S. Baek Laterality: Right-Sided and Left-Sided Colon Cancer , 2017, Annals of coloproctology.

[15]  Thea D. Tlsty,et al.  Benign breast disease and the risk of breast cancer. , 2005 .

[16]  M. Pagano,et al.  Survival analysis. , 1996, Nutrition.

[17]  Douglas K Rex,et al.  Guidelines for colonoscopy surveillance after screening and polypectomy: a consensus update by the US Multi-Society Task Force on Colorectal Cancer. , 2012, Gastroenterology.

[18]  N. Hyman,et al.  Hyperplastic Polyposis and the Risk of Colorectal Cancer , 2004, Diseases of the colon and rectum.

[19]  Tomohiro Shinozaki,et al.  Colonoscopy reduces colorectal cancer mortality: A multicenter, long-term, colonoscopy-based cohort study , 2017, PloS one.

[20]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[21]  W. Sha,et al.  Colonoscopy surveillance of colorectal polyp recurrence in two years after the first polypectomy , 2016 .

[22]  T. Kim,et al.  Risk Factors for Recurrent High-Risk Polyps after the Removal of High-Risk Polyps at Initial Colonoscopy , 2015, Yonsei medical journal.

[23]  Y. Baskın,et al.  Difference Between Left-Sided and Right-Sided Colorectal Cancer: A Focused Review of Literature , 2018, Gastroenterology research.

[24]  Víctor Urrea,et al.  Letter to the Editor: Stability of Random Forest importance measures , 2011, Briefings Bioinform..

[25]  F. Moy,et al.  Survival rates and predictors of survival among colorectal cancer patients in a Malaysian tertiary hospital , 2017, BMC Cancer.

[26]  R. Lewis,et al.  Time-to-Event Analysis. , 2016, JAMA.

[27]  Mark R. Segal,et al.  Regression Trees for Censored Data , 1988 .

[28]  H. Friess,et al.  Right Sided Colon Cancer as a Distinct Histopathological Subtype with Reduced Prognosis , 2016, Digestive Surgery.

[29]  M. Amonkar,et al.  Surveillance Patterns and Polyp Recurrence following Diagnosis and Excision of Colorectal Polyps in a Medicare Population , 2005, Cancer Epidemiology Biomarkers & Prevention.

[30]  P. Stang,et al.  Colon polyp recurrence in a managed care population. , 2003, Archives of internal medicine.