Automatic estimation of ulcerative colitis severity from endoscopy videos using ordinal multi-instance learning

Ulcerative colitis (UC) is a chronic inflammatory bowel disease characterized by relapsing inflammation of the large intestine. The severity of UC is often represented by the Mayo Endoscopic Subscore (MES) which quantifies mucosal disease activity from endoscopy videos. In clinical trials, an endoscopy video is assigned an MES based upon the most severe disease activity observed in the video. For this reason, severe inflammation spread throughout the colon will receive the same MES as an otherwise healthy colon with severe inflammation restricted to a small, localized segment. Therefore, the extent of disease activity throughout the large intestine, and overall response to treatment, may not be completely captured by the MES. In this work, we aim to automatically estimate UC severity for each frame in an endoscopy video to provide a higher resolution assessment of disease activity throughout the colon. Because annotating severity at the frame-level is expensive, labor-intensive, and highly subjective, we propose a novel weakly supervised, ordinal classification method to estimate frame severity from video MES labels alone. Using clinical trial data, we first achieved 0.92 and 0.90 AUC for predicting mucosal healing and remission of UC, respectively. Then, for severity estimation, we demonstrate that our models achieve substantial Cohen’s Kappa agreement with ground truth MES labels, comparable to the inter-rater agreement of expert clinicians. These findings indicate that our framework could serve as a foundation for novel clinical endpoints, based on a more localized scoring system, to better evaluate UC drug efficacy in clinical trials.

[1]  Ming Y. Lu,et al.  Data-efficient and weakly supervised computational pathology on whole-slide images , 2020, Nature Biomedical Engineering.

[2]  Snehashis Roy,et al.  Extracting 2D weak labels from volume labels using multiple instance learning in CT hemorrhage detection , 2019, Medical Imaging: Image Processing.

[3]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[4]  Tsuyoshi Ozawa,et al.  Novel computer-assisted diagnosis system for endoscopic disease activity in patients with ulcerative colitis. , 2019, Gastrointestinal endoscopy.

[5]  Ming Dong,et al.  Using Ranking-CNN for Age Estimation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  F. Arcadu,et al.  Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data , 2021, Therapeutic advances in gastrointestinal endoscopy.

[7]  A. Di Leo,et al.  Inter-Observer Agreement of a New Endoscopic Score for Ulcerative Colitis Activity: Preliminary Experience , 2020, Diagnostics.

[8]  W. Tremaine,et al.  Coated oral 5-aminosalicylic acid therapy for mildly to moderately active ulcerative colitis. A randomized study. , 1987, The New England journal of medicine.

[9]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[10]  Axel Saalbach,et al.  Localization of Critical Findings in Chest X-Ray Without Local Annotations Using Multi-Instance Learning , 2020, 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI).

[11]  A. Griffiths,et al.  STRIDE-II: An Update on the Selecting Therapeutic Targets in Inflammatory Bowel Disease (STRIDE) Initiative of the International Organization for the Study of IBD (IOIBD): Determining Therapeutic Goals for Treat-to-Target strategies in IBD. , 2020, Gastroenterology.

[12]  S. Targan,et al.  Ustekinumab as Induction and Maintenance Therapy for Ulcerative Colitis. , 2019, The New England journal of medicine.

[13]  K. Najarian,et al.  Fully automated endoscopic disease activity assessment in ulcerative colitis. , 2020, Gastrointestinal endoscopy.

[14]  F. Rizzello,et al.  Inter-observer agreement in endoscopic scoring systems: preliminary report of an ongoing study from the Italian Group for Inflammatory Bowel Disease (IG-IBD). , 2014, Digestive and liver disease : official journal of the Italian Society of Gastroenterology and the Italian Association for the Study of the Liver.

[15]  Marius Pedersen,et al.  PS-DeVCEM: Pathology-sensitive deep learning model for video capsule endoscopy based on weakly labeled data , 2020, Comput. Vis. Image Underst..

[16]  Ryan W. Stidham,et al.  Performance of a Deep Learning Model vs Human Reviewers in Grading Endoscopic Disease Severity of Patients With Ulcerative Colitis , 2019, JAMA network open.