Visual attention mechanism and support vector machine based automatic image annotation

Automatic image annotation not only has the efficiency of text-based image retrieval but also achieves the accuracy of content-based image retrieval. Users of annotated images can locate images they want to search by providing keywords. Currently most automatic image annotation algorithms do not consider the relative importance of each region in the image, and some algorithms extract the image features as a whole. This makes it difficult for annotation words to reflect salient versus non-salient areas of the image. Users searching for images are usually only interested in the salient areas. We propose an algorithm that integrates a visual attention mechanism with image annotation. A preprocessing step divides the image into two parts, the salient regions and everything else, and the annotation step places a greater weight on the salient region. When the image is annotated, words relating to the salient region are given first. The support vector machine uses particle swarm optimization to annotate the images automatically. Experimental results show the effectiveness of the proposed algorithm.

[1]  Samet Hicsonmez,et al.  Creating image tags for text based image retrieval using additional corpora , 2016, 2016 24th Signal Processing and Communication Application Conference (SIU).

[2]  Pankoo Kim,et al.  Automatic Image Annotation Using Semantic Text Analysis , 2012, CD-ARES.

[3]  Jianping Fan,et al.  Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers , 2006, MM '06.

[4]  Qiang Ji,et al.  Multi-label learning with missing labels for image annotation and facial action unit recognition , 2015, Pattern Recognit..

[5]  Cong Jin,et al.  A multi-label image annotation scheme based on improved SVM multiple kernel learning , 2017, International Conference on Graphic and Image Processing.

[6]  Jason Weston,et al.  WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.

[7]  Cong Jin,et al.  Image distance metric learning based on neighborhood sets for automatic image annotation , 2016, J. Vis. Commun. Image Represent..

[8]  Hongwei Ge,et al.  Automatic Image Annotation Based on Particle Swarm Optimization and Support Vector Clustering , 2017 .

[9]  George Loizou,et al.  Computer vision and pattern recognition , 2007, Int. J. Comput. Math..

[10]  Ronald M. Summers,et al.  Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Lamberto Ballan,et al.  Love Thy Neighbors: Image Annotation by Exploiting Image Metadata , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Nicu Sebe,et al.  Content-based image retrieval using wavelet-based salient points , 2000, IS&T/SPIE Electronic Imaging.

[13]  Zhihua Xia,et al.  A Privacy-Preserving and Copy-Deterrence Content-Based Image Retrieval Scheme in Cloud Computing , 2016, IEEE Transactions on Information Forensics and Security.

[14]  Kin-Man Lam,et al.  Scene cut detection using the colored pattern appearance model , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[15]  Dapeng Tao,et al.  Manifold regularized kernel logistic regression for web image annotation , 2013, Neurocomputing.

[16]  Vladimir Pavlovic,et al.  A New Baseline for Image Annotation , 2008, ECCV.

[17]  Raimondo Schettini,et al.  Image annotation using SVM , 2003, IS&T/SPIE Electronic Imaging.

[18]  Qi Tian,et al.  Adaptive Discriminant Projection for Content-based Image Retrieval , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[19]  Weifeng Liu,et al.  Multiview Hessian Regularization for Image Annotation , 2013, IEEE Transactions on Image Processing.

[20]  C. V. Jawahar,et al.  Exploring SVM for Image Annotation in Presence of Confusing Labels , 2013, BMVC.

[21]  Jafar Majidpour,et al.  Interactive tool to improve the automatic image annotation using MPEG-7 and multi-class SVM , 2015, 2015 7th Conference on Information and Knowledge Technology (IKT).

[22]  Barbara Caputo,et al.  CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach , 2008, CLEF.

[23]  Alberto Del Bimbo,et al.  Automatic image annotation via label transfer in the semantic space , 2016, Pattern Recognit..

[24]  Anu Bala,et al.  Local texton XOR patterns: A new feature descriptor for content-based image retrieval , 2016 .

[25]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[26]  Hassan Foroosh,et al.  Feature-independent context estimation for automatic image annotation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  David Zhang,et al.  Multi-Label Dictionary Learning for Image Annotation , 2016, IEEE Transactions on Image Processing.

[28]  Mansour Jamzad,et al.  Image annotation using multi-view non-negative matrix factorization with different number of basis vectors , 2017, J. Vis. Commun. Image Represent..