Bag-Of-Visual-Words Approach based on SURF Features to Polyp Detection in Wireless Capsule Endoscopy Videos

Wireless Capsule Endoscopy (WCE) is a relatively new technology (FDA approved in 2002) allowing doctors to view most of the small intestine. One of the most important goals of WCE is the early detection of colorectal polyps. We introduce “Bag-of-Visual-Words” method which has been successfully used in particular for image classification in non-medical domains. Initially the training image patches are sampled and represented by speeded up robust features (SURF) descriptor, and then the bag of words model is constructed by K-means clustering algorithm. Subsequently the document is represented as the histogram of the visual words which is the feature vector of the image. Finally, a SVM classifier is trained using these feature vectors to distinguish images with polyp regions from ones without them. Our preliminary experiments on our current data set demonstrate that the proposed method achieves promising performances.

[1]  Wu-Jun Li,et al.  Localized content-based image retrieval through evidence region identification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Kristen Grauman,et al.  Watch, Listen & Learn: Co-training on Captioned Images and Videos , 2008, ECML/PKDD.

[3]  M. Bertoni,et al.  Video capsule endoscopy for evaluating obscure gastrointestinal bleeding and suspected small-bowel pathology , 2004, Journal of Gastroenterology.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Fahad Shahbaz Khan,et al.  The Impact of Color on Bag-of-Words Based Object Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[8]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[9]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .