A combined hierarchical model for automatic image annotation and retrieval

Automatic image annotation is the process by which a computer system automatically assigns metadata in the form of captioning or keywords to a digital image. This application of computer vision technique is used in image retrieval system to organize and locate images of interest from a database. Many techniques have been proposed for image annotation in the last decade that gives reasonable performance on standard datasets. In this work, we introduce an innovative hybrid model for image annotation that treats annotation as a retrieval problem. The proposed technique utilizes low level image features and a simple combination of basic distances using JEC to find the nearest neighbors of a given image; the keywords are then assigned using SVM approach which aims to explore the combination of three different methods. First, the initial annotation of the data using two known methods, and that takes the hierarchy into consideration by classifying consecutively its instances; finally, we make use of pair wise majority voting between methods by simply summing strings in order to produce a final annotation. The proposed technique results show that this method outperforms the current state of art methods on the standard datasets.

[1]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[2]  Axel Pinz,et al.  Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[3]  Antonio Torralba,et al.  Statistics of natural image categories , 2003, Network.

[4]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.

[5]  Jaime S. Cardoso,et al.  Hierarchical medical image annotation using SVM-based approaches , 2010, Proceedings of the 10th IEEE International Conference on Information Technology and Applications in Biomedicine.

[6]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[7]  Md Jan Nordin,et al.  Review on statistical approaches for automatic image annotation , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[8]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, CVPR 2004.

[9]  Chihli Hung and Chih-Fong Tsai,et al.  Automatically Annotating Images with Keywords: A Review of Image Annotation Systems , 2008 .

[10]  George Loizou,et al.  Computer vision and pattern recognition , 2007, Int. J. Comput. Math..

[11]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[12]  Daniel Gatica-Perez,et al.  PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Vladimir Pavlovic,et al.  A New Baseline for Image Annotation , 2008, ECCV.

[15]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[16]  Jiayu Tang Automatic image annotation and object detection , 2008 .

[17]  Y. Mori,et al.  Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[18]  Gang Wang,et al.  An empirical study of automatic image annotation through Multi-Instance Multi-Label Learning , 2010, 2010 IEEE Youth Conference on Information, Computing and Telecommunications.

[19]  Maozhen Li,et al.  Evaluating Machine Learning Techniques for Automatic Image Annotations , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  Jianping Fan,et al.  Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers , 2006, MM '06.

[22]  Hervé Glotin,et al.  Efficient image concept indexing by harmonic & arithmetic profiles entropy , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).