Actively Mining Search Logs for Diverse Tags

Social tagging has become a very important mechanism for organizing information on the Web. Usually, people tag a web page manually, just as what they do on a social bookmarking web site. In this paper, we will demonstrate a brand-new perspective - tagging web pages automatically by mining search logs. In order to keep diversity, we first classify web queries into different categories and then extract tags from queries to depict each category. Thereafter we describe a web page with all queries which are related to this page, and finally we get the recommended tags for each web page after mapping the related queries into corresponding diverse tags. The experiments conducted on a real search log show that our method can dig out accurate and meaningful diverse tags for web pages more effectively.

[1]  Gisele L. Pappa,et al.  Exploiting co-occurrence and information quality metrics to recommend tags in web 2.0 applications , 2010, CIKM.

[2]  Amanda Spink,et al.  Determining the informational, navigational, and transactional intent of Web queries , 2008, Inf. Process. Manag..

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  Yan Zhang,et al.  Ontology enhancement and concept granularity learning: keeping yourself current and adaptive , 2011, KDD.

[5]  Aristides Gionis,et al.  Query similarity by projecting the query-flow graph , 2010, SIGIR.

[6]  Rui Li,et al.  Survey on social tagging techniques , 2010, SKDD.

[7]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[8]  Aaron D. Scriver Semantic Distance in WordNet: A Simplified and Improved Measure of Semantic Relatedness , 2006 .

[9]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .

[10]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[11]  Lee-Feng Chien,et al.  PAT-tree-based keyword extraction for Chinese information retrieval , 1997, SIGIR '97.

[12]  Qiang Yang,et al.  Query enrichment for web-query classification , 2006, TOIS.

[13]  Yan Zhang,et al.  Learning ontology resolution for document representation and its applications in text mining , 2010, CIKM '10.

[14]  Hongyuan Zha,et al.  Exploring social annotations for information retrieval , 2008, WWW.

[15]  Ryen W. White,et al.  Studying the use of popular destinations to enhance web search interaction , 2007, SIGIR.

[16]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[17]  Gang Wang,et al.  Understanding user's query intent with wikipedia , 2009, WWW '09.

[18]  Fabio Crestani,et al.  A statistical comparison of tag and query logs , 2009, SIGIR.