Tag Sense Disambiguation for Clarifying the Vocabulary of Social Tags

Tagging is one of the most popular services in Web 2.0. As a special form of tagging, social tagging is done collaboratively by many users, which forms a so-called folksonomy. As tagging has become widespread on the Web, the tag vocabulary is now very informal, uncontrolled, and personalized. For this reason, many tags are unfamiliar and ambiguous to users so that they fail to understand the meaning of each tag. In this paper, we propose a tag sense disambiguating method, called Tag Sense Disambigu-ation (TSD), which works in the social tagging environment. TSD can be applied to the vocabulary of social tags, thereby enabling users to understand the meaning of each tag through Wikipedia. To find the correct mappings from del.icio.us tags to Wikipedia articles, we define the Local Neighbor tags, the Global Neighbor tags, and finally the Neighbor tags that would be the useful key-words for disambiguating the sense of each tag based on the tag co-occurrences. The automatically built mappings are reasonable in most cases. The experiment shows that TSD can find the cor-rect mappings with high accuracy.

[1]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[2]  Scott Golder,et al.  Collaborative Tagging of Multimedia , 2008, IEEE Multimedia.

[3]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[4]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[5]  Xin Li,et al.  Tag-based social interest discovery , 2008, WWW.

[6]  Siegfried Handschuh,et al.  P-TAG: large scale automatic generation of personalized annotation tags for the web , 2007, WWW '07.

[7]  Nigel Shadbolt,et al.  Tag Meaning Disambiguation through Analysis of Tripartite Structure of Folksonomies , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[8]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[9]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[10]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[11]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[12]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[13]  Rui Li,et al.  Towards effective browsing of large scale social annotations , 2007, WWW '07.

[14]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.