Image Auto-Annotation and Retrieval Using Saliency Region Detecting and Segmentation Algorithm

Automatically assigning one or more relevant keywords to image has important significance. It is easier for people to retrieve and understand large collections of image data. Recent years much research has focused upon this field. In this paper, we introduce a salient region detection and segmentation algorithm used for image retrieval and keywords auto-annotation. We investigate the properties of a bin-cross bin metric between two feather-vectors called the Earth Mover's Distance (EMD), to enhance the precision and recall performance. The EMD is based on a solution to the transportation problem from linear optimization. It is more robust than histogram matching techniques. In this paper we only focus on applications about color-feathers, and we compare the performances about image auto-annotation and retrieval between EMD and other histogram matching distances. The results indicate that our methods are more flexible and reliable.

[1]  Youbin Chen,et al.  Evaluation on color spaces and distance measures for color histogram-based image retrieval , 2011, International Conference on Graphic and Image Processing.

[2]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jiebo Luo,et al.  Large-scale multimodal semantic concept detection for consumer video , 2007, MIR '07.

[4]  Nicu Sebe,et al.  Exploiting the entire feature space with sparsity for automatic image annotation , 2011, ACM Multimedia.

[5]  Shamik Sural,et al.  A comparative analysis of two distance measures in color image databases , 2002, Proceedings. International Conference on Image Processing.

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  Jing Liu,et al.  Image annotation via graph learning , 2009, Pattern Recognit..

[8]  James Ze Wang,et al.  Real-Time Computerized Annotation of Pictures , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Feichao Wang A Survey on Automatic Image Annotation and Trends of the New Age , 2011 .

[10]  Simone Frintrop,et al.  A Real-time Visual Attention System Using Integral Images , 2007, ICVS 2007.

[11]  Z WangJames,et al.  Real-Time Computerized Annotation of Pictures , 2008 .

[12]  Xi Chen,et al.  Learning feature distance measures for image correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[14]  Dongjian He,et al.  Ensemble of multiple descriptors for automatic image annotation , 2010, 2010 3rd International Congress on Image and Signal Processing.

[15]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[16]  Jacob D. Furst,et al.  Effect of Image Linearization on Normalized Compression Distance , 2009, FGIT-SIP.

[17]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[18]  Md Jan Nordin,et al.  Review on statistical approaches for automatic image annotation , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[19]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[22]  Sabine Süsstrunk,et al.  Salient Region Detection and Segmentation , 2008, ICVS.

[23]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[25]  Samy Bengio,et al.  A Discriminative Kernel-Based Approach to Rank Images from Text Queries , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Bernt Schiele,et al.  International Journal of Computer Vision manuscript No. (will be inserted by the editor) Semantic Modeling of Natural Scenes for Content-Based Image Retrieval , 2022 .

[27]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[28]  Yang Yu,et al.  Automatic image annotation using group sparsity , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[31]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.