Bag-of-Visual-Words Approach to Abnormal Image Detection in Wireless Capsule Endoscopy Videos

One of the main goals of Wireless Capsule Endoscopy (WCE) is to detect the mucosal abnormalities such as blood, ulcer, polyp, and so on in the gastrointestinal tract. Only less than 5% of total 55,000 frames of a WCE video typically have abnormalities, so it is critical to develop a technique to automatically discriminate abnormal findings from normal ones. We introduce "Bag-of-Visual-Words" method which has been successfully used in particular for image classification in non-medical domains. Initially the training image patches are represented by color and texture features, and then the bag of words model is constructed by K-means clustering algorithm. Subsequently the document is represented as the histogram of the visual words which is the feature vector of the image. Finally, a SVM classifier is trained using these feature vectors to distinguish images with abnormal regions from ones without them. Experimental results on our current data set show that the proposed method achieves promising performances.

[1]  Max Q.-H. Meng,et al.  Wireless Capsule Endoscopy Images Enhancement using Contrast Driven Forward and Backward Anisotropic Diffusion , 2007, 2007 IEEE International Conference on Image Processing.

[2]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Pong C. Yuen,et al.  Regularized color clustering in medical image database , 2000, IEEE Transactions on Medical Imaging.

[6]  Fahad Shahbaz Khan,et al.  The Impact of Color on Bag-of-Words Based Object Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Sae Hwang,et al.  Polyp detection in Wireless Capsule Endoscopy videos based on image segmentation and geometric feature , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  R. L. de Valois,et al.  Relationship between spatial-frequency and orientation tuning of striate-cortex cells. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[9]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[10]  Jung-Hwan Oh,et al.  Automatic classification of digestive organs in wireless capsule endoscopy videos , 2007, SAC '07.

[11]  Max Q.-H. Meng,et al.  Computer-based detection of bleeding and ulcer in wireless capsule endoscopy images by chromaticity moments , 2009, Comput. Biol. Medicine.

[12]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[13]  M. Bertoni,et al.  Video capsule endoscopy for evaluating obscure gastrointestinal bleeding and suspected small-bowel pathology , 2004, Journal of Gastroenterology.

[14]  Dit-Yan Yeung,et al.  Localized content-based image retrieval through evidence region identification , 2009, CVPR.

[15]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Kristen Grauman,et al.  Watch, Listen & Learn: Co-training on Captioned Images and Videos , 2008, ECML/PKDD.