An improved BOW approach using fuzzy feature encoding and visual-word weighting

The bag-of-words (BOW) has become a popular image representation model with successful implementations in visual analysis. Although the original model has been improved in several ways, the utilization of the Fuzzy Set Theory in BOW has not been investigated thoroughly. This paper presents a fuzzy feature encoding approach to address the problems associated with the hard and soft assignments of image features to the visual-words. Our encoding method assigns each image feature to only the first and second closest words in the codebook to overcome the word-uncertainty problem. Moreover, we introduce a new word-weighting scheme for image categories based on image histograms. Experiments conducted on some image datasets show that both methods increase the BOW performance in content based image retrieval.

[1]  Mathias Lux,et al.  Bag of visual words revisited: an exploratory study on robust image retrieval exploiting fuzzy codebooks , 2010, MDMKDD '10.

[2]  Pierre Tirilly,et al.  Distances and weighting schemes for bag of visual words image retrieval , 2010, MIR '10.

[3]  Lei Wang,et al.  In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.

[4]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Hai Jin,et al.  Weighting scheme for image retrieval based on bag-of-visual-words , 2014, IET Image Process..

[7]  Cor J. Veenman,et al.  Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Xin Chen,et al.  Spatial Weighting for Bag-of-Visual-Words and Its Application in Content-Based Image Retrieval , 2009, PAKDD.

[9]  Adnan Yazici,et al.  Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features , 2015, IEEE Transactions on Multimedia.

[10]  C. V. Jawahar,et al.  Bag of Visual Words: A Soft Clustering Based Exposition , 2011, 2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics.

[11]  Qingming Huang,et al.  Weighted visual vocabulary to balance the descriptive ability on general dataset , 2013, Neurocomputing.

[12]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[13]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[14]  Chong-Wah Ngo,et al.  Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.

[15]  Xuelong Li,et al.  Fuzzy bag of words for social image description , 2014, Multimedia Tools and Applications.

[16]  Wassim Bouachir,et al.  Fuzzy indexing for Bag of Features scene categorization , 2010, 2010 5th International Symposium On I/V Communications and Mobile Network.

[17]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[18]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[19]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.