Improving Bag of Visual Words Image Retrieval: A Fuzzy Weighting Scheme for Efficient Indexation

Recent works on Content Based Image Retrieval rely on Bag of Visual Words to index images. Analogically to the Bag of Words approach used in text retrieval, this model allows describing an image as a bag of elementary local features called visual words. As a result, an image is represented by a vector of weights, where each weight corresponds to the importance of a visual word in the image. The choice of local features and the weighting scheme are very important to perform image retrieval. The existing weighting schemes are mostly migrated from text retrieval domain and don’t take into account fundamental differences between textual words and visual words. In this paper, a novel approach based on Scale Invariant Features Transform (SIFT) features and a new weighting scheme is proposed. The proposed scheme uses a fuzzy representation to index images with a more robust signature. Experimental results with the Coil-100 image database demonstrate that the proposed method produces better performance than known term weighting representations.

[1]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  Shawn D. Newsam,et al.  Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery , 2008, 2008 15th IEEE International Conference on Image Processing.

[5]  Shawn D. Newsam,et al.  Comparing global and interest point descriptors for similarity retrieval in remote sensed imagery , 2007, GIS.

[6]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[7]  Wen-Tzu Chen,et al.  Computer simulation of the human-body effects on a circular-loop-wire antenna for radio-pager communications at 152, 280, and 400 MHz , 1997 .

[8]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[9]  Jens-Rainer Ohm,et al.  A Visual Search Engine for Distributed Image and Video Database Retrieval Applications , 1999, VISUAL.

[10]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Raphaël Marée,et al.  Random subwindows for robust image classification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Bernt Schiele,et al.  On Performance Characterization and Optimization for Image Retrieval , 2002, ECCV.

[13]  Chong-Wah Ngo,et al.  Evaluating bag-of-visual-words representations in scene classification , 2007, MIR '07.

[14]  William Scanlon,et al.  Numerical analysis of bodyworn UHF antenna systems , 2001 .

[15]  Yang Hao,et al.  Wireless body sensor networks for health-monitoring applications , 2008, Physiological measurement.

[16]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[17]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[18]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[19]  Shih-Fu Chang,et al.  Single color extraction and image query , 1995, Proceedings., International Conference on Image Processing.

[20]  Y. Rahmat-Samii,et al.  Dual-band wearable textile antenna , 2004, IEEE Antennas and Propagation Society Symposium, 2004..

[21]  Yang Hao,et al.  Statistical Analysis and Performance Evaluation for On-Body Radio Propagation With Microstrip Patch Antennas , 2007, IEEE Transactions on Antennas and Propagation.

[22]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[23]  Chong-Wah Ngo,et al.  Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help? , 2006, CIVR.

[24]  Shimon Ullman,et al.  Object recognition with informative features and linear classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[25]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[26]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.