论文信息 - Content-Based Image Annotation Refinement

Content-Based Image Annotation Refinement

Automatic image annotation has been an active research topic due to its great importance in image retrieval and management. However, results of the state-of-the-art image annotation methods are often unsatisfactory. Despite continuous efforts in inventing new annotation algorithms, it would be advantageous to develop a dedicated approach that could refine imprecise annotations. In this paper, a novel approach to automatically refining the original annotations of images is proposed. For a query image, an existing image annotation method is first employed to obtain a set of candidate annotations. Then, the candidate annotations are re-ranked and only the top ones are reserved as the final annotations. By formulating the annotation refinement process as a Markov process and defining the candidate annotations as the states of a Markov chain, a content-based image annotation refinement (CIAR) algorithm is proposed to re-rank the candidate annotations. It leverages both corpus information and the content feature of a query image. Experimental results on a typical Corel dataset show not only the validity of the refinement, but also the superiority of the proposed algorithm over existing ones.

[1] Jianping Fan,et al. Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers , 2006, MM '06.

[2] Gustavo Carneiro,et al. Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[4] Changhu Wang,et al. Scalable search-based image annotation of personal images , 2006, MIR '06.

[5] Gustavo Carneiro,et al. A database centric view of semantic image annotation and retrieval , 2005, SIGIR '05.

[6] R. Manmatha,et al. Multiple Bernoulli relevance models for image and video annotation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[7] Latifur Khan,et al. Image annotations by combining multiple evidence & wordNet , 2005, ACM Multimedia.

[8] L. Breuer. Introduction to Stochastic Processes , 2022, Statistical Methods for Climate Scientists.

[9] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10] W. Bruce Croft,et al. Relevance-Based Language Models , 2001, SIGIR '01.

[11] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[12] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.

[13] Jitendra Malik,et al. Normalized Cut and Image Segmentation , 1997 .

[14] Wei-Ying Ma,et al. Image annotation by large-scale content-based image retrieval , 2006, MM '06.

[15] Changhu Wang,et al. Image annotation refinement using random walk with restarts , 2006, MM '06.

[16] Jing Hua,et al. Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17] Edward Y. Chang,et al. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18] Raimondo Schettini,et al. Image annotation using SVM , 2003, IS&T/SPIE Electronic Imaging.

[19] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[20] J. Jeon,et al. Automatic Image Annotation of News Images with Large Vocabularies and Low Quality Training Data , 2004 .

[21] Y. Mori,et al. Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[22] R. Manmatha,et al. A Model for Learning the Semantics of Pictures , 2003, NIPS.

[23] Wei-Ying Ma,et al. AnnoSearch: Image Auto-Annotation by Search , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24] Gene H. Golub,et al. Matrix computations , 1983 .