BoVW Model for Animal Recognition: An Evaluation on SIFT Feature Strategies

Nowadays classifying images into categories have taken a lot of interests in both research and practice. Content Based Image Retrieval (CBIR) was not successful in solving semantic gap problem. Therefore, Bag of Visual Words (BoVW) model was created for quantizing different visual features into words. SIFT detector is invariant and robust to translation, rotations, scaling and partially invariant to affine distortion and illumination changes. The aim of this paper is to investigate the potential usage of BoVW Word model in animal recognition. The better SIFT feature extraction method for pictures of the animal was also specified. The performance evaluation on several SIFT feature strategies validates that MSDSIFT feature extraction will get better results.

[1]  Amir-Masoud Eftekhari-Moghadam,et al.  Combination of classification and regression in decision tree for multi-labeling image annotation and retrieval , 2013, Appl. Soft Comput..

[2]  Radu Tudor Ionescu,et al.  Objectness to improve the bag of visual words model , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[3]  Bernt Schiele,et al.  Local features for object class recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[4]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Stéphane Herbin,et al.  Semantic hierarchies for image annotation: A survey , 2012, Pattern Recognit..

[6]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[7]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[8]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9]  Bruce A. Draper,et al.  Introduction to the Bag of Features Paradigm for Image Classification and Retrieval , 2011, ArXiv.

[10]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  James Theiler,et al.  Decoupling sparse coding of SIFT descriptors for large-scale visual recognition , 2013, Defense, Security, and Sensing.

[12]  Cheng-Chieh Chiang Interactive tool for image annotation using a semi-supervised and hierarchical approach , 2013, Comput. Stand. Interfaces.

[13]  Ibrahim A. El Rube,et al.  Image registration based on multi-scale SIFT for remote sensing images , 2009, 2009 3rd International Conference on Signal Processing and Communication Systems.

[14]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Wesley E. Snyder,et al.  Content-based image retrieval in picture archiving and communications systems , 2009, Journal of Digital Imaging.

[16]  Thomas S. Huang,et al.  CBIR: from low-level features to high-level semantics , 2000, Electronic Imaging.

[17]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[18]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[20]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[21]  Hsin-Chang Yang,et al.  An image annotation approach using location references to enhance geographic knowledge discovery , 2011, Expert Syst. Appl..