Predictive models for endoscopic disease activity in patients with ulcerative colitis: Practical machine learning-based modeling and interpretation

Background Endoscopic disease activity monitoring is important for the long-term management of patients with ulcerative colitis (UC), there is currently no widely accepted non-invasive method that can effectively predict endoscopic disease activity. We aimed to develop and validate machine learning (ML) models for predicting it, which are desired to reduce the frequency of endoscopic examinations and related costs. Methods The patients with a diagnosis of UC in two hospitals from January 2016 to January 2021 were enrolled in this study. Thirty nine clinical and laboratory variables were collected. All patients were divided into four groups based on MES or UCEIS scores. Logistic regression (LR) and four ML algorithms were applied to construct the prediction models. The performance of models was evaluated in terms of accuracy, sensitivity, precision, F1 score, and area under the receiver-operating characteristic curve (AUC). Then Shapley additive explanations (SHAP) was applied to determine the importance of the selected variables and interpret the ML models. Results A total of 420 patients were entered into the study. Twenty four variables showed statistical differences among the groups. After synthetic minority oversampling technique (SMOTE) oversampling and RFE variables selection, the random forests (RF) model with 23 variables in MES and the extreme gradient boosting (XGBoost) model with 21 variables in USEIS, had the greatest discriminatory ability (AUC = 0.8192 in MES and 0.8006 in UCEIS in the test set). The results obtained from SHAP showed that albumin, rectal bleeding, and CRP/ALB contributed the most to the overall model. In addition, the above three variables had a more balanced contribution to each classification under the MES than the UCEIS according to the SHAP values. Conclusion This proof-of-concept study demonstrated that the ML model could serve as an effective non-invasive approach to predicting endoscopic disease activity for patients with UC. RF and XGBoost, which were first introduced into data-based endoscopic disease activity prediction, are suitable for the present prediction modeling.

[1]  A. Iwashita,et al.  A review on the current status and definitions of activity indices in inflammatory bowel disease: how to use indices for precise evaluation , 2022, Journal of Gastroenterology.

[2]  Yuntao Liu,et al.  Serum Albumin to Globulin Ratio is Associated with the Presence and Severity of Inflammatory Bowel Disease , 2022, Journal of inflammation research.

[3]  G. Radford-Smith,et al.  Markers of Systemic Inflammation in Acute Attacks of Ulcerative Colitis: What Level of C-reactive Protein Constitutes Severe Colitis? , 2022, Journal of Crohn's & colitis.

[4]  Futian Weng,et al.  Differentiation of intestinal tuberculosis and Crohn’s disease through an explainable machine learning method , 2022, Scientific Reports.

[5]  Tatjana Loncar-Turukalo,et al.  Machine Learning Based Metagenomic Prediction of Inflammatory Bowel Disease , 2021, pHealth.

[6]  J. Miyoshi,et al.  Machine learning using clinical data at baseline predicts the efficacy of vedolizumab at week 22 in patients with ulcerative colitis , 2021, Scientific Reports.

[7]  Jun Shen,et al.  Artificial Intelligence Enhances Studies on Inflammatory Bowel Disease , 2021, Frontiers in Bioengineering and Biotechnology.

[8]  R. Pai,et al.  Utilizing Deep Learning to Analyze Whole Slide Images of Colonic Biopsies for Associations Between Eosinophil Density and Clinicopathologic Features in Active Ulcerative Colitis. , 2021, Inflammatory bowel diseases.

[9]  Jun Kato,et al.  Leucine-rich alpha-2 glycoprotein as a marker of mucosal healing in inflammatory bowel disease , 2021, Scientific Reports.

[10]  S. Sinha,et al.  Artificial intelligence applications in inflammatory bowel disease: Emerging technologies and future directions , 2021, World journal of gastroenterology.

[11]  J. Miyoshi,et al.  Extent of disease affects the usefulness of fecal biomarkers in ulcerative colitis , 2021, BMC Gastroenterology.

[12]  B. Sands,et al.  New Therapeutics for Ulcerative Colitis. , 2021, Annual review of medicine.

[13]  R. Leal,et al.  Assessment of disease activity in inflammatory bowel diseases: Non-invasive biomarkers and endoscopic scores , 2020, World journal of gastrointestinal endoscopy.

[14]  Jun Shen,et al.  Development of a susceptibility gene based novel predictive model for the diagnosis of ulcerative colitis using random forest and artificial neural network , 2020, Aging.

[15]  Andrew Q. Ninh,et al.  Central reading of ulcerative colitis clinical trial videos using neural networks. , 2020, Gastroenterology.

[16]  B. Xing,et al.  Development and Interpretation of Multiple Machine Learning Models for Predicting Postoperative Delayed Remission of Acromegaly Patients During Long-Term Follow-Up , 2020, Frontiers in Endocrinology.

[17]  A. Cifu,et al.  Ulcerative Colitis in Adults. , 2020, JAMA.

[18]  A. Bernabé-Ortiz,et al.  A systematic review of population-based studies on lipid profiles in Latin America and the Caribbean , 2020, eLife.

[19]  S. Ng,et al.  Worldwide Management of Inflammatory Bowel Disease During the COVID-19 Pandemic: An International Survey , 2020, Inflammatory bowel diseases.

[20]  A. C. Gomes,et al.  Endoscopic and histologic activity assessment considering disease extent and prediction of treatment failure in ulcerative colitis , 2020, Scandinavian journal of gastroenterology.

[21]  X. Teng,et al.  Evaluation of inflammatory bowel disease activity in children using serum trefoil factor peptide , 2020, Pediatric Research.

[22]  S. Bonovas,et al.  Review article: faecal calprotectin and histologic remission in ulcerative colitis , 2020, Alimentary pharmacology & therapeutics.

[23]  K. Ohtsuka,et al.  Development and Validation of a Deep Neural Network for Accurate Evaluation of Endoscopic Images From Patients with Ulcerative Colitis. , 2020, Gastroenterology.

[24]  A. Schoepfer,et al.  Treatment Algorithm for Mild and Moderate-to-Severe Ulcerative Colitis: An Update , 2020, Digestion.

[25]  Jürgen Bajorath,et al.  Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values. , 2020, Journal of medicinal chemistry.

[26]  K. Faber,et al.  Serum Free Thiols Are Superior to Fecal Calprotectin in Reflecting Endoscopic Disease Activity in Inflammatory Bowel Disease , 2019, Antioxidants.

[27]  P. Higgins,et al.  Development and Validation of Machine Learning Models in Prediction of Remission in Patients With Moderate to Severe Crohn Disease , 2019, JAMA network open.

[28]  Ashwin N. Ananthakrishnan,et al.  ACG Clinical Guideline: Ulcerative Colitis in Adults , 2019, The American journal of gastroenterology.

[29]  Jie Ma,et al.  A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. , 2019, Journal of clinical epidemiology.

[30]  P. de Vos,et al.  A Combined Set of Four Serum Inflammatory Biomarkers Reliably Predicts Endoscopic Disease Activity in Inflammatory Bowel Disease , 2019, Front. Med..

[31]  B. Xing,et al.  Prediction of Recurrence after Transsphenoidal Surgery for Cushing’s Disease: The Use of Machine Learning Algorithms , 2019, Neuroendocrinology.

[32]  H. Ogata,et al.  Efficacy of Therapeutic Intervention for Patients With an Ulcerative Colitis Mayo Endoscopic Score of 1. , 2018, Inflammatory bowel diseases.

[33]  P. Munkholm,et al.  The sensitivity of fecal calprotectin in predicting deep remission in ulcerative colitis , 2018, Scandinavian journal of gastroenterology.

[34]  C. Dai,et al.  Fecal immunochemical test for predicting mucosal healing in ulcerative colitis patients: A systematic review and meta‐analysis , 2018, Journal of gastroenterology and hepatology.

[35]  Nima Hamidi,et al.  Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies , 2017, The Lancet.

[36]  M. Delgado-Rodríguez,et al.  Systematic review and meta-analysis. , 2017, Medicina intensiva.

[37]  Chao Ding,et al.  Ulcerative Colitis Endoscopic Index of Severity (UCEIS) versus Mayo Endoscopic Score (MES) in guiding the need for colectomy in patients with acute severe colitis , 2017, Gastroenterology report.

[38]  V Kishore Ayyadevara,et al.  Gradient Boosting Machine , 2018 .

[39]  B. Feagan,et al.  Endoscopic scoring indices for evaluation of disease activity in ulcerative colitis. , 2018, The Cochrane database of systematic reviews.

[40]  Patrick Felicia,et al.  An International Survey , 2018 .

[41]  Namrata Singh,et al.  Evaluating clinical, dietary, and psychological risk factors for relapse of ulcerative colitis in clinical, endoscopic, and histological remission , 2017, Journal of gastroenterology and hepatology.

[42]  I. Ordás,et al.  Usefulness of Transcriptional Blood Biomarkers as a Non-invasive Surrogate Marker of Mucosal Healing and Endoscopic Response in Ulcerative Colitis , 2017, Journal of Crohn's & colitis.

[43]  L. Peyrin-Biroulet,et al.  Ulcerative colitis , 2017, The Lancet.

[44]  S. Ng,et al.  The Worldwide Incidence and Prevalence of Inflammatory Bowel Disease in the 21 st Century: A Systematic Review of Population-Based Studies , 2017 .

[45]  B. Sands,et al.  Mucosal Healing Is Associated With Improved Long-term Outcomes of Patients With Ulcerative Colitis: A Systematic Review and Meta-analysis. , 2016, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[46]  G. delle Fave,et al.  Endoscopic scores for inflammatory bowel disease in the era of 'mucosal healing': Old problem, new perspectives. , 2016, Digestive and liver disease : official journal of the Italian Society of Gastroenterology and the Italian Association for the Study of the Liver.

[47]  S. Osawa,et al.  The Ulcerative Colitis Endoscopic Index of Severity More Accurately Reflects Clinical Outcomes and Long-term Prognosis than the Mayo Endoscopic Score. , 2016, Journal of Crohn's & colitis.

[48]  Jamal Shahrabi,et al.  The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes , 2016, Medical decision making : an international journal of the Society for Medical Decision Making.

[49]  Hui-mao Zhang,et al.  Diffusion-weighted magnetic resonance imaging without bowel preparation for detection of ulcerative colitis. , 2015, World journal of gastroenterology.

[50]  M. Silverberg,et al.  Predictors of Outcome in Ulcerative Colitis , 2015, Inflammatory bowel diseases.

[51]  Ying Wei,et al.  Computer-Aided Lung Nodule Recognition by SVM Classifier Based on Combination of Random Undersampling and SMOTE , 2015, Comput. Math. Methods Medicine.

[52]  S. Travis,et al.  Association between the ulcerative colitis endoscopic index of severity (UCEIS) and outcomes in acute severe ulcerative colitis. , 2013, Journal of Crohn's & colitis.

[53]  G. D'Haens,et al.  A Systematic Review of the Measurement of Endoscopic Healing in Ulcerative Colitis Clinical Trials: Recommendations and Implications for Future Research , 2014, Inflammatory bowel diseases.

[54]  M. Färkkilä,et al.  Surrogate markers and clinical indices, alone or combined, as indicators for endoscopic remission in anti-TNF-treated luminal Crohn's disease , 2012, Scandinavian journal of gastroenterology.

[55]  Joyce A. Mitchell,et al.  Countering imbalanced datasets to improve adverse drug event predictive models in labor and delivery , 2009, J. Biomed. Informatics.

[56]  J. Ellenberg,et al.  Use of the noninvasive components of the mayo score to assess clinical response in Ulcerative Colitis , 2008, Inflammatory bowel diseases.

[57]  O. Maeda,et al.  Value of colonoscopy for prediction of prognosis in patients with ulcerative colitis. , 2008, World journal of gastroenterology.

[58]  A. Michalsen,et al.  Noninvasive Markers in the Assessment of Intestinal Inflammation in Inflammatory Bowel Diseases: Performance of Fecal Lactoferrin, Calprotectin, and PMN-Elastase, CRP, and Clinical Indices , 2008, The American Journal of Gastroenterology.

[59]  P. Rutgeerts,et al.  Mucosal healing in inflammatory bowel disease: impossible ideal or therapeutic target? , 2007, Gut.

[60]  Philippe Marteau,et al.  A review of activity indices and efficacy end points for clinical trials of medical therapy in adults with ulcerative colitis. , 2007, Gastroenterology.

[61]  Charles X. Ling,et al.  AUC: A Better Measure than Accuracy in Comparing Learning Algorithms , 2003, Canadian Conference on AI.

[62]  Nikolas P. Galatsanos,et al.  A support vector machine approach for detection of microcalcifications , 2002, IEEE Transactions on Medical Imaging.

[63]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[64]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[65]  R. Pounder,et al.  A simple clinical colitis activity index , 1998, Gut.

[66]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[67]  K. Johnson An Update. , 1984, Journal of food protection.