Fast tissue detection in volumetric laser endomicroscopy using convolutional neural networks: an object-detection approach

Barrett's Esophagus is a precursor of esophageal adenocarcinoma, one of the most lethal forms of cancer. Volumetric laser endomicroscopy (VLE) is a relatively new technology used for early detection of abnormal cells in BE by imaging the inner tissue layers of the esophagus. Computer-Aided Detection (CAD) shows great promise in analyzing the VLE frames due to the advances in deep learning. However, a full VLE scan produces 1,200 scans of 4,096 x 2,048 pixels, making automated pre-processing for the tissue of interest extraction necessary. This paper explores an object detection for tissue detection in VLE scans. We show that this can be achieved in real time with very low inference time, using single-stage object detection like YOLO. Our best performing model achieves a value of 98.23% for the mean average precision of bounding boxes correctly predicting the tissue of interest. Additionally, we have found that the tiny YOLO with Partial Residual Networks architecture further reduces the inference speed with a factor of 10, while only sacrificing less than 1% of accuracy. This proposed method does not only segment the tissue of interest in real time without any latency, but it can also achieve this efficiently using limited GPU resources, rendering it attractive for embedded applications. Our paper is the first to introduce object detection as a new approach for VLE-data tissue segmentation and paves the way for real-time VLE-based detection of early cancer in BE.

[1]  Jun-Wei Hsieh,et al.  Enriching Variety of Layer-Wise Learning Information by Gradient Combination , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[2]  Shuyuan Yang,et al.  A Survey of Deep Learning-Based Object Detection , 2019, IEEE Access.

[3]  Anne-Fré Swager,et al.  Automated segmentation and characterization of esophageal wall in vivo by tethered capsule optical coherence tomography endomicroscopy. , 2016, Biomedical optics express.

[4]  Alexey Shvets,et al.  TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation , 2018, Computer-Aided Analysis of Gastrointestinal Videos.

[5]  Shu Liu,et al.  Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  E. Schoon,et al.  Ensemble of Deep Convolutional Neural Networks for Classification of Early Barrett’s Neoplasia Using Volumetric Laser Endomicroscopy , 2019, Applied Sciences.

[7]  Kim L. Boyer,et al.  Retinal thickness measurements from optical coherence tomography using a Markov boundary model , 2001, IEEE Transactions on Medical Imaging.

[8]  Joel H Rubenstein,et al.  Epidemiology, Diagnosis, and Management of Esophageal Adenocarcinoma. , 2015, Gastroenterology.

[9]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[10]  L. Gerson,et al.  Barrett’s esophagus: Incidence, etiology, pathophysiology, prevention and treatment , 2007, Therapeutics and clinical risk management.

[11]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[12]  Joost van der Putten,et al.  Tissue segmentation in volumetric laser endomicroscopy data using FusionNet and a domain-specific loss function , 2019, Medical Imaging: Image Processing.

[13]  Hiroshi Ishikawa,et al.  Macular segmentation with optical coherence tomography. , 2005, Investigative ophthalmology & visual science.

[14]  Won-Ki Jeong,et al.  FusionNet: A Deep Fully Residual Convolutional Neural Network for Image Segmentation in Connectomics , 2016, Frontiers in Computer Science.

[15]  Cong Wang,et al.  Fast esophageal layer segmentation in OCT images of guinea pigs based on sparse Bayesian classification and graph search. , 2019, Biomedical optics express.