Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists

Background: Machine learning, especially deep learning, is becoming more and more relevant in research and development in the medical domain. For all of the supervised deep learning applications, data is the most critical factor in securing successful implementation and sustaining the progress of the machine learning model. Especially gastroenterological data, which often involves endoscopic videos, are cumbersome to annotate. Domain experts are needed to interpret and annotate the videos. To support those domain experts, we generated a framework. With this framework, instead of annotating every frame in the video sequence, experts are just performing key annotations at the beginning and the end of sequences with pathologies, e.g. visible polyps. Subsequently, non-expert annotators supported by machine learning add the missing annotations for the frames in-between. Results: Using this framework we were able to reduce work load of domain experts on average by a factor of 20. This is primarily due to the structure of the framework, which is designed to minimize the workload of the domain expert. Pairing this framework with a state-of-the-art semi-automated pre-annotation model enhances the annotation speed further. Through a study with 10 participants we show that semi-automated annotation using our tool doubles the annotation speed of non-expert annotators compared to a well-known state-of-the-art annotation tool. Conclusion: In summary, we introduce a framework for fast expert annotation for gastroenterologists, which reduces the workload of the domain expert considerably while maintaining a very high annotation quality. The framework incorporates a semi-automated annotation system utilizing trained object detection models. The software and framework are open-source.

[1]  Sharib Ali,et al.  Real-Time Polyp Detection, Localisation and Segmentation in Colonoscopy Using Deep Learning , 2020, ArXiv.

[2]  Bradley J. Erickson,et al.  RIL-Contour: a Medical Imaging Dataset Annotation Tool for and with Deep Learning , 2019, Journal of Digital Imaging.

[3]  Niall O' Mahony,et al.  Deep Learning vs. Traditional Computer Vision , 2019, CVC.

[4]  Abhishek Dutta,et al.  The VIA Annotation Software for Images, Audio and Video , 2019, ACM Multimedia.

[5]  Guido Gerig,et al.  ITK-SNAP: An interactive tool for semi-automatic segmentation of multi-modality biomedical images , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  Sarah Webb Deep learning for biology , 2018, Nature.

[7]  Prakash Choudhary,et al.  Image annotation: Then and now , 2018, Image Vis. Comput..

[8]  Fei Wang,et al.  Deep Learning in Medicine-Promise, Progress, and Challenges. , 2019, JAMA internal medicine.

[9]  Ilangko Balasingham,et al.  Improving Automatic Polyp Detection Using CNN by Exploiting Temporal Dependency in Colonoscopy Video , 2020, IEEE Journal of Biomedical and Health Informatics.

[10]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[11]  Yu Cao,et al.  Colorectal Polyp Detection in Real-world Scenario: Design and Experiment Study , 2020, 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI).

[12]  Klaus Schöffmann,et al.  Endometriosis Annotation in Endoscopic Videos , 2017, 2017 IEEE International Symposium on Multimedia (ISM).

[13]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, CVPR.

[14]  Matjaž Kukar,et al.  Application of machine learning for hematological diagnosis , 2017 .

[15]  V. Shackleton Boredom and Repetitive Work: A Review , 1981 .

[16]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[17]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[18]  I. Tagkopoulos,et al.  Application of machine learning in rheumatic disease research , 2019, The Korean journal of internal medicine.

[19]  Bogdan J. Matuszewski,et al.  GIANA Polyp Segmentation with Fully Convolutional Dilation Neural Networks , 2019, VISIGRAPP.

[20]  Ece Kamar,et al.  Revolt: Collaborative Crowdsourcing for Labeling Machine Learning Datasets , 2017, CHI.

[21]  Tao Song,et al.  Discriminative Correlation Filter for Long-Time Tracking , 2020, Comput. J..

[22]  Aymeric Histace,et al.  Comparative Validation of Polyp Detection Methods in Video Colonoscopy: Results From the MICCAI 2015 Endoscopic Vision Challenge , 2017, IEEE Transactions on Medical Imaging.

[23]  David G. Stork,et al.  Character and document research in the Open Mind Initiative , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[24]  Horst Bischof,et al.  Real-Time Tracking via On-line Boosting , 2006, BMVC.

[25]  Bin Li,et al.  Applications of machine learning in drug discovery and development , 2019, Nature Reviews Drug Discovery.

[26]  Huilong Duan,et al.  Real-time gastric polyp detection using convolutional neural networks , 2019, PloS one.

[27]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[28]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[29]  Mohammad Motiur Rahman,et al.  Gastrointestinal polyp detection through a fusion of contourlet transform and Neural features , 2020, J. King Saud Univ. Comput. Inf. Sci..

[30]  Fei Wang,et al.  Deep learning for healthcare: review, opportunities and challenges , 2018, Briefings Bioinform..

[31]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[32]  Jiri Matas,et al.  Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[33]  Jiri Matas,et al.  Discriminative Correlation Filter with Channel and Spatial Reliability , 2017, CVPR.

[34]  B. Erickson,et al.  Machine Learning for Medical Imaging. , 2017, Radiographics : a review publication of the Radiological Society of North America, Inc.

[35]  N. Halama Machine learning for tissue diagnostics in oncology: brave new world , 2019, British Journal of Cancer.