On Kernel Information Propagation for Tag Clustering in Social Annotation Systems

In social annotation systems, users label digital resources by using tags which are freely chosen textual descriptors. Tags are used to index, annotate and retrieve resource as an additional metadata of resource. Poor retrieval performance remains a major challenge of most social annotation systems resulting from the severe problems of ambiguity, redundancy and less semantic nature of tags. Clustering method is a useful approach to handle these problems in the social annotation systems. In this paper, we propose a novel clustering algorithm named kernel information propagation for tag clustering. This approach makes use of the kernel density estimation of the KNN neighbor directed graph as a start to reveal the prestige rank of tags in tagging data. The random walk with restart algorithm is then employed to determine the center points of tag clusters. The main strength of the proposed approach is the capability of partitioning tags from the perspective of tag prestige rank rather than the intuitive similarity calculation itself. Experimental studies on three real world datasets demonstrate the effectiveness and superiority of the proposed method.

[1]  Andreas Hotho,et al.  Tag Recommendations in Folksonomies , 2007, LWA.

[2]  Chun Chen,et al.  Document recommendation in social tagging services , 2010, WWW '10.

[3]  Flavius Frasincar,et al.  Searching and Browsing Tag Spaces Using the Semantic Tag Clustering Search Framework , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[4]  Bamshad Mobasher,et al.  Personalized recommendation in social tagging systems using hierarchical clustering , 2008, RecSys '08.

[5]  Larry A. Wasserman,et al.  Sparse Nonparametric Density Estimation in High Dimensions Using the Rodeo , 2007, AISTATS.

[6]  Frederico Araújo Durão,et al.  Extending a hybrid tag-based recommender system with personalization , 2010, SAC '10.

[7]  Paolo Avesani,et al.  Using Tags and Clustering to Identify Topic-Relevant Blogs , 2007, ICWSM.

[8]  Jimeng Sun,et al.  Neighborhood formation and anomaly detection in bipartite graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[9]  Susan T. Dumais,et al.  Bringing order to the Web: automatically categorizing search results , 2000, CHI.

[10]  Lars Schmidt-Thieme,et al.  Tag-aware recommender systems by fusion of collaborative filtering algorithms , 2008, SAC '08.

[11]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[12]  Chun Chen,et al.  Personalized tag recommendation using graph-based ranking on multi-type interrelated objects , 2009, SIGIR.

[13]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[14]  Christoph Meinel,et al.  Web Search Personalization Via Social Bookmarking and Tagging , 2007, ISWC/ASWC.

[15]  Joost N. Kok,et al.  Knowledge Discovery in Databases: PKDD 2007, 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, September 17-21, 2007, Proceedings , 2007, PKDD.

[16]  Sebastian Risi,et al.  Visualization and Clustering of Tagged Music Data , 2007, GfKl.