A Survey on Automatic Image Annotation and Trends of the New Age

Abstract With the rapid development of digital cameras, we have witnessed great interest and promise in automatic image annotation as a hot research field. Automatic image annotation could help to retrieval images in a large scale image database more rapidly and precisely. In this paper, different approaches of automatic annotation are reviewed:1) generative model based image annotation, 2) discriminative model based image annotation, 3) Graph model based image annotation. Several key theoretical and empirical contributions in the current decade related to automatic image annotation are discussed. Based on the analysis of what have been achieved in recent years, we believe that automatic image annotation will be paid more and more attentions in the near future.

[1]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, CVPR 2004.

[2]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[3]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[4]  Jing Liu,et al.  Image Annotation Based on Graph Learning: Image Annotation Based on Graph Learning , 2009 .

[5]  Christos Faloutsos,et al.  Automatic multimedia cross-modal correlation discovery , 2004, KDD.

[6]  Lu Jing and Ma Shaoping Region-Based Image Annotation Using Heuristic Support Vector Machine in Multiple-Instance Learning , 2009 .

[7]  Bin Wang,et al.  Dual cross-media relevance model for image annotation , 2007, ACM Multimedia.

[8]  Allan Hanbury,et al.  A survey of methods for image annotation , 2008, J. Vis. Lang. Comput..

[9]  Anil K. Jain,et al.  On image classification: city vs. landscape , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[10]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[11]  Tom Minka,et al.  Principled Hybrids of Generative and Discriminative Models , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Jianping Fan,et al.  Hierarchical classification for automatic image annotation , 2007, SIGIR.

[13]  Horst Bischof,et al.  Eigenboosting: Combining Discriminative and Generative Information , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jing Hua,et al.  Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Hagai Attias,et al.  Topic regression multi-modal Latent Dirichlet Allocation for image annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[17]  Wei-Ying Ma,et al.  Bipartite graph reinforcement model for web image annotation , 2007, ACM Multimedia.

[18]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.

[20]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[21]  Daniel Gatica-Perez,et al.  PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.

[22]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[23]  Jing Liu,et al.  Image annotation via graph learning , 2009, Pattern Recognit..

[24]  Lu Han Image Annotation Based on Graph Learning , 2008 .

[25]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.