Topic-based ranking in Folksonomy via probabilistic model

Social tagging is an increasingly popular way to describe and classify documents on the web. However, the quality of the tags varies considerably since the tags are authored freely. How to rate the tags becomes an important issue. Most social tagging systems order tags just according to the input sequence with little information about the importance and relevance. This limits the applications of tags such as information search, tag recommendation, and so on. In this paper, we pay attention to finding the authority score of tags in the whole tag space conditional on topics and put forward a topic-sensitive tag ranking (TSTR) approach to rank tags automatically according to their topic relevance. We first extract topics from folksonomy using a probabilistic model, and then construct a transition probability graph. Finally, we perform random walk over the topic level on the graph to get topic rank scores of tags. Experimental results show that the proposed tag ranking method is both effective and efficient. We also apply tag ranking into tag recommendation, which demonstrates that the proposed tag ranking approach really boosts the performances of social-tagging related applications.

[1]  Bo Gao,et al.  Topic-Level Random Walk through Probabilistic Model , 2009, APWeb/WAIM.

[2]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  Qiong Luo,et al.  Towards Ontology Learning from Folksonomies , 2009, IJCAI.

[5]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[6]  Dominik Benz,et al.  Stop thinking, start tagging: tag semantics emerge from collaborative verbosity , 2010, WWW '10.

[7]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[8]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[9]  Chun Chen,et al.  Document recommendation in social tagging services , 2010, WWW '10.

[10]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[11]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[12]  Brian D. Davison,et al.  Topical link analysis for web search , 2006, SIGIR.

[13]  Wei Wang,et al.  Multi-grain hierarchical topic extraction algorithm for text mining , 2010, Expert Syst. Appl..

[14]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[15]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[16]  Andreas Hotho,et al.  Trend Detection in Folksonomies , 2006, SAMT.