Currently text-based retrieval approaches, which utilize web textual information to index and access images, are still widely used by many modern prevalent search engines due to the nature of simplicity and effectiveness. However, page documents often include texts irrelevant to image contents, becoming an obstacle for high-quality image retrieval. In this paper we propose a novel model to improve traditional text-based image retrieval by integrating weighted image annotation keywords and web texts seamlessly. Different from traditional text-based image retrieval models, the proposed model retrieves and ranks images depending on not only texts of web document but also image annotations. To verify the proposed model, some term-based queries are performed on three models, and results have shown that our model performs best.
[1]
Latifur Khan,et al.
Image annotations by combining multiple evidence & wordNet
,
2005,
ACM Multimedia.
[2]
David W. Conrath,et al.
Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy
,
1997,
ROCLING/IJCLCLP.
[3]
Christiane Fellbaum,et al.
Book Reviews: WordNet: An Electronic Lexical Database
,
1999,
CL.
[4]
Sanjeev Khudanpur,et al.
Hidden Markov models for automatic annotation and content-based retrieval of images and video
,
2005,
SIGIR '05.
[5]
Berthier A. Ribeiro-Neto,et al.
Image retrieval using multiple evidence ranking
,
2004,
IEEE Transactions on Knowledge and Data Engineering.