Automatic content understanding with cascaded spatial-temporal deep framework for capsule endoscopy videos

Capsule endoscopy (CE) is the first-line diagnostic tool for inspecting gastrointestinal (GI) tract diseases. It is a tremendous task on examining and managing the CE videos by endoscopists. Therefore, a computer-aided diagnosis system is desired and urgent. In this paper, a general cascaded spatial-temporal deep framework is proposed to understand the most commonly seen contents of whole GI tract videos. First, the noisy contents such as feces, bile, bubble, and low power images are detected and removed by a Convolutional Neural Network (CNN) model. The clear images are then classified into entrance, stomach, small intestine, and colon by the second CNN. Finally, the topographic segmentation of the whole video is performed with a global temporal integration strategy by Hidden Markov Model (HMM). Compared to existing methods, the proposed framework performs noise content detection and topographic segmentation at the same time, which significantly reduces the number of images to be checked by endoscopists and segments images of different organs more accurately. Experiments on a dataset with 630K images from 14 patients demonstrate that the proposed approach achieves a promising performance in terms of effectiveness and efficiency.

[1]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[2]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[3]  Fernando Vilariño,et al.  Intestinal Motility Assessment With Video Capsule Endoscopy: Automatic Annotation of Phasic Intestinal Contractions , 2010, IEEE Transactions on Medical Imaging.

[4]  George A. Tsihrintzis,et al.  The Class Imbalance Problem , 2017 .

[5]  Ronald M. Summers,et al.  Improving Computer-Aided Detection Using Convolutional Neural Networks and Random View Aggregation , 2015, IEEE Transactions on Medical Imaging.

[6]  李抱朴,et al.  Wireless capsule endoscopy images enhancement via adaptive contrast diffusion , 2012 .

[7]  Yaozong Gao,et al.  Deformable MR Prostate Segmentation via Deep Feature Learning and Sparse Patch Matching , 2017, Deep Learning for Medical Image Analysis.

[8]  Özgür Ulusoy,et al.  Bilvideo-7: an MPEG-7- compatible video indexing and retrieval system , 2010 .

[9]  Bill P. Buckles,et al.  Wireless Capsule Endoscopy Video Segmentation Using an Unsupervised Learning Approach Based on Probabilistic Latent Semantic Analysis With Scale Invariant Features , 2012, IEEE Transactions on Information Technology in Biomedicine.

[10]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Wei Zhang,et al.  Computer-Aided Bleeding Detection in WCE Video , 2014, IEEE Journal of Biomedical and Health Informatics.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[14]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[15]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  D. Iakovidis,et al.  Software for enhanced video capsule endoscopy: challenges for essential progress , 2015, Nature Reviews Gastroenterology &Hepatology.

[17]  Gregory D. Hager,et al.  A Meta Method for Image Matching , 2011, IEEE Transactions on Medical Imaging.

[18]  Max Q.-H. Meng,et al.  A general framework for wireless capsule endoscopy study synopsis , 2015, Comput. Medical Imaging Graph..

[19]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[20]  Fons van der Sommen,et al.  Supportive automatic annotation of early esophageal cancer using local gabor and color features , 2014, Neurocomputing.

[21]  Taghi M. Khoshgoftaar,et al.  RUSBoost: A Hybrid Approach to Alleviating Class Imbalance , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[22]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[23]  Alexandros Karargyris,et al.  Optimizing lesion detection in small-bowel capsule endoscopy: from present problems to future solutions , 2015, Expert review of gastroenterology & hepatology.

[24]  Max Q.-H. Meng,et al.  Bleeding Frame and Region Detection in the Wireless Capsule Endoscopy Video , 2016, IEEE Journal of Biomedical and Health Informatics.

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[27]  P. Swain,et al.  Wireless capsule endoscopy. , 2002, The Israel Medical Association journal : IMAJ.

[28]  John N. Lygouras,et al.  A neuro-fuzzy-based system for detecting abnormal patterns in wireless-capsule endoscopic images , 2007, Neurocomputing.

[29]  Leontios J. Hadjileontiadis,et al.  Capsule endoscopy image analysis using texture information from various colour models , 2012, Comput. Methods Programs Biomed..

[30]  Khan A. Wahid,et al.  Automated Bleeding Detection in Capsule Endoscopy Videos Using Statistical Features and Region Growing , 2014, Journal of Medical Systems.

[31]  Zvi Fireman Capsule endoscopy: Future horizons. , 2010, World journal of gastrointestinal endoscopy.

[32]  Kuk-Jin Yoon,et al.  Polyp Detection via Imbalanced Learning and Discriminative Feature Learning , 2015, IEEE Transactions on Medical Imaging.

[33]  KlepaczkoArtur,et al.  Texture and color based image segmentation and pathology detection in capsule endoscopy videos , 2014 .

[34]  Max Q.-H. Meng,et al.  Tumor Recognition in Wireless Capsule Endoscopy Images Using Textural Features and SVM-Based Feature Selection , 2012, IEEE Transactions on Information Technology in Biomedicine.

[35]  Gregory D. Hager,et al.  Assessment of Crohn’s Disease Lesions in Wireless Capsule Endoscopy Images , 2012, IEEE Transactions on Biomedical Engineering.

[36]  Fernando Vilariño,et al.  Categorization and Segmentation of Intestinal Content Frames for Wireless Capsule Endoscopy , 2012, IEEE Transactions on Information Technology in Biomedicine.

[37]  李抱朴,et al.  Automatic polyp detection for wireless capsule endoscopy images , 2012 .

[38]  Dinggang Shen,et al.  A Hybrid of Deep Network and Hidden Markov Model for MCI Identification with Resting-State fMRI , 2015, MICCAI.

[39]  Ronald M. Summers,et al.  DeepOrgan: Multi-level Deep Convolutional Networks for Automated Pancreas Segmentation , 2015, MICCAI.

[40]  Anant Madabhushi,et al.  A Deep Convolutional Neural Network for segmenting and classifying epithelial and stromal regions in histopathological images , 2016, Neurocomputing.

[41]  Isabel N. Figueiredo,et al.  Automated Polyp Detection in Colon Capsule Endoscopy , 2013, IEEE Transactions on Medical Imaging.

[42]  Jung-Hwan Oh,et al.  Abnormal image detection in endoscopy videos using a filter bank and local binary patterns , 2014, Neurocomputing.

[43]  Xiao Liu,et al.  Stacked deep polynomial network based representation learning for tumor classification with small ultrasound image dataset , 2016, Neurocomputing.

[44]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[45]  Nikolaos G. Bourbakis,et al.  Detection of Small Bowel Polyps and Ulcers in Wireless Capsule Endoscopy Videos , 2011, IEEE Transactions on Biomedical Engineering.

[46]  Chong-Wah Ngo,et al.  Automatic Hookworm Detection in Wireless Capsule Endoscopy Images , 2016, IEEE Transactions on Medical Imaging.

[47]  Michal Mackiewicz,et al.  Wireless Capsule Endoscopy Color Video Segmentation , 2008, IEEE Transactions on Medical Imaging.