Can Social Tagging Improve Web Image Search?

Conventional Web image search engines can return reasonably accurate results for queries containing concrete terms, but the results are less accurate for queries containing only abstract terms, such as "spring" or "peace." To improve the recall ratio without drastically degrading the precision ratio, we developed a method that replaces an abstract query term given by a user with a set of concrete terms and that uses these terms in queries input into Web image search engines. Concrete terms are found for a given abstract term by making use of social tagging information extracted from a social photo sharing system, such as Flickr. This information is rich in user impressions about the objects in the images. The extraction and replacement are done by (1) collecting social tags that include the abstract term, (2) clustering the tags in accordance with the term co-occurrence of images, (3) selecting concrete terms from the clusters by using WordNet, and (4) identifying sets of concrete terms that are associated with the target abstract term by using a technique for association rule mining. Experimental results show that our method improves the recall ratio of Web image searches.

[1]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[2]  Marcel Worring,et al.  Classification of user image descriptions , 2004, Int. J. Hum. Comput. Stud..

[3]  Katsumi Tanaka,et al.  RelaxImage: a cross-media meta-search engine for searching images from Web based on query relaxation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[4]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[5]  Kristina Lerman,et al.  Social Browsing on Flickr , 2006, ICWSM.

[6]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[7]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  Remco C. Veltkamp,et al.  Content-based image retrieval systems: A survey , 2000 .

[10]  Satoshi Nakamura,et al.  Can social bookmarking enhance search in the web? , 2007, JCDL '07.

[11]  Mads Nielsen,et al.  Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.

[12]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[13]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[14]  Stefan M. Rüger,et al.  High-dimensional visual vocabularies for image retrieval , 2007, SIGIR.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  Mor Naaman,et al.  Towards automatic extraction of event and place semantics from flickr tags , 2007, SIGIR.