Abnormality tracking during video capsule endoscopy using an affine triangular constraint based on surrounding features

The precise tracking of an abnormality in the gastrointestinal tract is useful for medical training purposes. However, the gastrointestinal wall deforms continuously in an unpredictable manner, while abnormalities lack distinctive features, making them difficult to track over continuous frames. To address this problem, we propose a tracking method for capsule endoscopy using the surrounding features of abnormalities. By applying triangular constraints using an affine transformation, we are able to track abnormalities that do not have distinctive features over consecutive image frames. We demonstrate the efficacy of our approach using eight common types of gastrointestinal abnormalities.

[1]  Suvidha Sawant,et al.  Tumor Recognition in Wireless Capsule Endoscopy Images , 2014 .

[2]  W. S. L. Jebarani,et al.  Assessment of Crohn's disease lesions in Wireless Capsule Endoscopy images using SVM based classification , 2013, 2013 International Conference on Signal Processing , Image Processing & Pattern Recognition.

[3]  Michael Felsberg,et al.  Convolutional Features for Correlation Filter Based Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[4]  H. Sorenson Least-squares estimation: from Gauss to Kalman , 1970, IEEE Spectrum.

[5]  Yasushi Yagi,et al.  Tracking abnormalities in video capsule endoscopy using surrounding features with a triangular constraint , 2012, 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI).

[6]  Artur Klepaczko,et al.  Texture and color based image segmentation and pathology detection in capsule endoscopy videos , 2014, Comput. Methods Programs Biomed..

[7]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Robert T. Collins,et al.  Mean-shift blob tracking through scale space , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Yasushi Kanazawa,et al.  Wide Baseline Matching using Triplet Vector Descriptor , 2006, BMVC.

[10]  Yasushi Yagi,et al.  Image-Enhanced Capsule Endoscopy Preserving the Original Color Tones , 2011, Abdominal Imaging.

[11]  Max Q.-H. Meng,et al.  Computer-Aided Detection of Bleeding Regions for Capsule Endoscopy Images , 2009, IEEE Transactions on Biomedical Engineering.

[12]  M. Liedlgruber,et al.  A Summary of Research Targeted at Computer-Aided Decision Support in Endoscopy of the Gastrointestinal Tract , 2011 .

[13]  Michal Mackiewicz,et al.  Wireless Capsule Endoscopy Color Video Segmentation , 2008, IEEE Transactions on Medical Imaging.

[14]  Michael Felsberg,et al.  The Visual Object Tracking VOT2015 Challenge Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[15]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[16]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[17]  김재호,et al.  SURF(Speeded Up Robust Features)와 Kalman Filter를 이용한 컬러 객체 추적 방법의 제안 , 2012 .

[18]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[19]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[20]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[21]  Chia-Hsiang Wu,et al.  Three-Dimensional Modeling From Endoscopic Video Using Geometric Constraints Via Feature Positioning , 2007, IEEE Transactions on Biomedical Engineering.

[22]  Max Q.-H. Meng,et al.  Texture analysis for ulcer detection in capsule endoscopy images , 2009, Image Vis. Comput..

[23]  Philippe C. Cattin,et al.  Tracking the invisible: Learning where the object might be , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.