Prognosticating Colorectal Cancer Recurrence using Machine Learning Techniques

Colorectal cancer is among the top three most commonly occurring cancers worldwide, and around 30-40% of patients treated by curative intent surgery will experience cancer recurrence. Proactive prognostication would enable clinicians to better plan treatment modality and intensity, and follow-up frequency to reduce recurrence. Here, we study the application of machine learning models to predict cancer recurrence in a cohort of 904 post-resection colorectal cancer patients. We employ heterogeneous structured and temporal clinical features including demographic and diagnostic information, tumour stage and location details, biochemistry and molecular typing results, as well as surgical details and treatment parameters. We characterize the performance of multiple machine learning classifiers including logistic regression, support vector machine, gradient boosting and multi-layer perceptron on structured data. Our best model achieved a sensitivity of 80.7% and a specificity of 88.2%. This is comparable to and even exceeding the performance of carcinoembryonic antigen (CEA), a tumour marker commonly used in the clinic for colorectal cancer monitoring. We also demonstrate feasibility for accurate forecasting of recurrence up to 4 months in advance, as well as the possibility of predicting recurrence as early as 6 months post-surgery. Our results have positive implications for better management of colorectal cancer patients in the post-resection setting.

[1]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[2]  J. Wind,et al.  Symptomatic and Asymptomatic Colon Cancer Recurrence: A Multicenter Cohort Study , 2016, The Annals of Family Medicine.

[3]  A. Papalambros,et al.  Predictors of survival in stage IV metastatic colorectal cancer. , 2010, Anticancer research.

[4]  Neofytos Dimitriou,et al.  A principled machine learning framework improves accuracy of stage II colorectal cancer prognosis , 2018, npj Digital Medicine.

[5]  S. P. Akpabio World Health Organisation , 1983, British Dental Journal.

[6]  R. Gillies,et al.  Deep learning for lung cancer prognostication: A retrospective multi-cohort radiomics study , 2018, PLoS medicine.

[7]  M. Fararouei,et al.  Risk Factors Predicting Colorectal Cancer Recurrence Following Initial Treatment: A 5-year Cohort Study , 2017, Asian Pacific journal of cancer prevention : APJCP.

[8]  R. A. Baker,et al.  The value of serum carcinoembryonic antigen in predicting recurrent disease following curative resection of colorectal cancer , 1994, Diseases of the colon and rectum.

[9]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[10]  Z. Hall Cancer , 1906, The Hospital.

[11]  Noam Auslander,et al.  Predicting Complete Remission of Acute Myeloid Leukemia: Machine Learning Applied to Gene Expression , 2019, Cancer informatics.

[12]  M. Gonen,et al.  Association of Preoperative and Postoperative Serum Carcinoembryonic Antigen and Colon Cancer Outcome , 2017, JAMA oncology.

[13]  Brian D Nicholson,et al.  The diagnostic accuracy of a single CEA blood test in detecting colorectal cancer recurrence: Results from the FACS trial , 2017, PloS one.

[14]  Hans-Ulrich Prokosch,et al.  Predicting Clinical Outcomes in Colorectal Cancer Using Machine Learning , 2018, MIE.

[15]  Andreas W. Kempa-Liehr,et al.  Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python package) , 2018, Neurocomputing.

[16]  M. Fakih,et al.  CEA monitoring in colorectal cancer. What you should know. , 2006, Oncology.

[17]  R. Lamerz,et al.  Multivariate analysis of the prognostic value of CEA and CA 19-9 serum levels in colorectal cancer. , 2000, Anticancer research.

[18]  M. Mai,et al.  Preoperative CEA and PPD values as prognostic factors for immunochemotherapy using PSK and 5-FU. , 2005, Anticancer research.

[19]  Rae Woong Park,et al.  Development of Novel Breast Cancer Recurrence Prediction Model Using Support Vector Machine , 2012, Journal of breast cancer.

[20]  T. Treasure,et al.  The CEA Second-Look Trial: a randomised controlled trial of carcinoembryonic antigen prompted reoperation for recurrent colorectal cancer , 2014, BMJ Open.

[21]  N. Armitage,et al.  Preoperative carcinoembryonic antigen is related to tumour stage and long-term survival in colorectal cancer. , 1998, British Journal of Cancer.

[22]  M. Duffy,et al.  Carcinoembryonic antigen as a marker for colorectal cancer: is it clinically useful? , 2001, Clinical chemistry.

[23]  Jianjun Yang,et al.  Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection , 2020, Scientific Reports.

[24]  G. Yoon,et al.  Predictive factors and the prognosis of recurrence of colorectal cancer within 2 years after curative resection , 2014, Annals of surgical treatment and research.

[25]  N. Tahri,et al.  [Sensitivity, specificity and prognostic value of CEA in colorectal cancer: results of a Tunisian series and literature review]. , 2001, La Tunisie medicale.

[26]  Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth - Proceedings of MIE 2018, Medical Informatics Europe, Gothenburg, Sweden, April 24-26, 2018 , 2018, MIE.

[27]  G. Choi,et al.  Serum Carcinoembryonic Antigen Monitoring After Curative Resection for Colorectal Cancer: Clinical Significance of the Preoperative Level , 2009, Annals of Surgical Oncology.

[28]  M. Kanda,et al.  Hazard rate of tumor recurrence over time in patients with colon cancer: implications for postoperative surveillance from three Japanese Foundation for Multidisciplinary Treatment of Cancer (JFMC) clinical trials , 2017, Journal of Cancer.