Majority Based Ranking Approach in Web Image Retrieval

In this paper, we address a ranking problem in web image retrieval. Due to the growing availability of web images, comprehensive retrieval of web images has been expected. Conventional systems for web image retrieval are based on keyword- based retrieval. However, we often find undesirable retrieval results from the keyword based web image retrieval system since the system uses the limited and inaccurate text information of web images ; a typical system uses text information such as surrounding texts and/or image filenames, etc. To alleviate this situation, we propose a new ranking approach which is the integration of results of text and image content via analyzing the retrieved results. We define four ranking methods based on the image contents analysis of the retrieved images; (1) majority-first method, (2) centroid-of-all method, (3) centroid-of-top K method, and (4) centroid-of-largest-cluster method. We evaluate the retrieval performance of our methods and conventional one using precision and recall graphs. The experimental results show that the proposed methods are more effective than conventional keyword-based retrieval methods.

[1]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[2]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[3]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[4]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[6]  Robert Villa,et al.  The effectiveness of query-specific hierarchic clustering in information retrieval , 2002, Inf. Process. Manag..

[7]  Ellen M. Voorhees The Cluster Hypothesis Revisited , 1985, SIGIR.

[8]  A.W.M. Smeulders,et al.  PicToSeek: A Content-based Image Search Engine for the WWW , 1997 .

[9]  Venkata Subramaniam,et al.  Information Retrieval: Data Structures & Algorithms , 1992 .

[10]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[11]  Marco La Cascia,et al.  Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web , 1999, Comput. Vis. Image Underst..

[12]  Jitendra Malik,et al.  Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ellen M. Vdorhees The cluster hypothesis revisited , 1985, SIGIR 1985.

[15]  Heung-Kyu Lee,et al.  A Ranking Algorithm Using Dynamic Clustering for Content-Based Image Retrieval , 2002, CIVR.

[16]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[17]  Christos Faloutsos,et al.  Efficient and effective Querying by Image Content , 1994, Journal of Intelligent Information Systems.