Automatic Image Annotation by Mining the Web

Automatic image annotation has been becoming an attractive research subject. Most current image annotation methods are based on training techniques. The major weaknesses of such solutions include limited annotation vocabulary and labor-intensive involvement. However, Web images possess a lot of texts, and rich annotation of samples is provided. Therefore, this report provides a novel image annotation method by mining the Web that term-image correlation is obtained from the Web not by learning. Without question, there are many noises in that relation, and some cleaning works are necessary. In the system, entropy weighting and image clustering technique are employed. Our experiment results show that our solution can achieve a satisfactory performance.

[1]  Edward Y. Chang,et al.  CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Beng Chin Ooi,et al.  Giving meanings to WWW images , 2000, MM 2000.

[3]  SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28 - August 1, 2003, Toronto, Canada , 2003, SIGIR.

[4]  Y. Mori,et al.  Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[5]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[6]  Pu-Jen Cheng,et al.  Effective image annotation for searches using multilevel semantics , 2004, International Journal on Digital Libraries.

[7]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[8]  Zhiguo Gong,et al.  An Implementation of Web Image Search Engines , 2004, ICADL.

[9]  Susan T. Dumais,et al.  Improving the retrieval of information from external sources , 1991 .

[10]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.

[11]  Tat-Seng Chua,et al.  A Novel Approach to Auto Image Annotation Based on Pairwise Constrained Clustering and Semi-Naïve Bayesian Model , 2005, 11th International Multimedia Modelling Conference.

[12]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[13]  Wan-Chi Siu,et al.  Multimedia Information Retrieval and Management , 2003 .

[14]  David A. Forsyth,et al.  Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[15]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[16]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[17]  James Ze Wang,et al.  Learning-based linguistic indexing of pictures with 2--d MHMMs , 2002, MULTIMEDIA '02.

[18]  Tat-Seng Chua,et al.  A bootstrapping approach to annotating large image collection , 2003, MIR '03.