Personalization in Folksonomies Based on Tag Clustering

Collaborative tagging systems, sometimes referred to as “folksonomies,” enable Internet users to annotate or search for resources using custom labels instead of being restricted by pre-defined navigational or conceptual hierarchies. However, the flexibility of tagging brings with it certain costs. Because users are free to apply any tag to any resource, tagging systems contain large numbers of redundant, ambiguous, and idiosyncratic tags which can render resource discovery difficult. Data mining techniques such as clustering can be used to ameliorate this problem by reducing noise in the data and identifying trends. In particular, discovered patterns can be used to tailor the system’s output to a user based on the user’s tagging behavior. In this paper, we propose a method to personalize a user’s experience within a folksonomy using clustering. A personalized view can overcome ambiguity and idiosyncratic tag assignment, presenting users with tags and resources that correspond more closely to their intent. Specifically, we examine unsupervised clustering methods for extracting commonalities between tags, and use the discovered clusters as intermediaries between a user’s profile and resources in order to tailor the results of search to the user’s interests. We validate this approach through extensive evaluation of proposed personalization algorithm and the underlying clustering techniques using data from a real collaborative tagging Web site.

[1]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .

[2]  Susan T. Dumais,et al.  Bringing order to the Web: automatically categorizing search results , 2000, CHI.

[3]  Andrew K. Lui,et al.  Web Information Retrieval in Collaborative Tagging Systems , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[4]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[5]  David R. Millen,et al.  Dogear: Social bookmarking in the enterprise , 2006, CHI.

[6]  Hector Garcia-Molina,et al.  Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems , 2006 .

[7]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[8]  Shinichi Honiden,et al.  Web Page Recommender System based on Folksonomy Mining for ITNG ’06 Submissions , 2006, Third International Conference on Information Technology: New Generations (ITNG'06).

[9]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[10]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[11]  Jack Minker,et al.  An Analysis of Some Graph Theoretical Cluster Techniques , 1970, JACM.

[12]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[13]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[14]  Susan T. Dumais,et al.  Characterizing the value of personalizing search , 2007, SIGIR.

[15]  Grigory Begelman,et al.  Automated Tag Clustering: Improving search and exploration in the tag space , 2006 .

[16]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[17]  J. Hopcroft,et al.  Proceedings of the fifth annual ACM symposium on Theory of computing , 1977 .

[18]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[19]  Paolo Avesani,et al.  Using Tags and Clustering to Identify Topic-Relevant Blogs , 2007, ICWSM.

[20]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[21]  Rong Yan,et al.  An efficient manual image annotation approach based on tagging and browsing , 2007, MS '07.

[22]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[23]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .