A Content-Based Method to Enhance Tag Recommendation

Tagging has become a primary tool for users to organize and share digital content on many social media sites. In addition, tag information has been shown to enhance capabilities of existing search engines. However, many resources on the web still lack tag information. This paper proposes a content-based approach to tag recommendation which can be applied to webpages with or without prior tag information. While social bookmarking service such as Delicious enables users to share annotated bookmarks, tag recommendation is available only for pages with tags specified by other users. Our proposed approach is motivated by the observation that similar webpages tend to have the same tags. Each webpage can therefore share the tags they own with similar webpages. The propagation of a tag depends on its weight in the originating webpage and the similarity between the sending and receiving webpages. The similarity metric between two webpages is defined as a linear combination of four cosine similarities, taking into account both tag information and page content. Experiments using data crawled from Delicious show that the proposed method is effective in populating untagged webpages with the correct tags.

[1]  Dinan Gunawardena,et al.  Social tags: meaning and suggestions , 2008, CIKM '08.

[2]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[3]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[4]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[5]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[7]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[8]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  Aditya Ghose,et al.  The Optimizing WEB , 2011 .

[11]  AgrawalRakesh,et al.  Mining association rules between sets of items in large databases , 1993 .

[12]  Hector Garcia-Molina,et al.  Social tag prediction , 2008, SIGIR '08.

[13]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .

[14]  Andreas Hotho,et al.  Tag Recommendations in Folksonomies , 2007, LWA.

[15]  Wolfgang Nejdl,et al.  Can all tags be used for search? , 2008, CIKM '08.

[16]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.