Analysis of tag within online social networks

In recent years, tagging systems have been paid increasing attentions from both research communities and system designers. Most popular online social networking sites harness tag for managing and locating contents, for organizing and connecting users, and for recommending and sharing resources. We believe that tag acts like bridge between people and resources. Research on tag and tagging behavior will provide us insight about resource space and user activities on the Internet. In this paper, we present a two-level analysis of the tagging system of Del.icio.us. The results from both two levels confirm each other. In network level, we connect tags by users collaborative tagging to form a social network of tags. By investigating its network feature, we find phenomena of small world and scale-free network. We also discover that the links within this network have relatively strong semantic relatedness. In individual level, users' tagging behaviors and patterns are observed by visualizing their bookmarking history on Del.icio.us. Besides, we study the linked users by their tags and find that users within a subscription network share more common interests than random pairs of users. During the analysis, we also discuss the implications of the findings for the design of tag-based system.

[1]  Ravi Kumar,et al.  Visualizing tags over time , 2006, WWW '06.

[2]  Siegfried Handschuh,et al.  P-TAG: large scale automatic generation of personalized annotation tags for the web , 2007, WWW '07.

[3]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[4]  Steffen Staab,et al.  Towards the self-annotating web , 2004, WWW '04.

[5]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[6]  Rui Li,et al.  Towards effective browsing of large scale social annotations , 2007, WWW '07.

[7]  Stan Szpakowicz,et al.  Roget's thesaurus and semantic similarity , 2012, RANLP.

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[10]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[11]  Bradley Malin,et al.  Email alias detection using social network analysis , 2005, LinkKDD '05.

[12]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[13]  Thomas Ertl,et al.  Explanatory and illustrative visualization of special and general relativity , 2006, IEEE Transactions on Visualization and Computer Graphics.

[14]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[15]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[16]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[17]  Anthony H. Dekker,et al.  Visualisation of Social Networks Using CAVALIER , 2001, InVis.au.

[18]  Gerhard Weiß,et al.  Social Annotation of Semantically Heterogeneous Knowledge , 2004, SemAnnot@ISWC.

[19]  Paul M. B. Vitányi,et al.  Similarity of Objects and the Meaning of Words , 2006, TAMC.

[20]  Terrell Russell,et al.  cloudalicious: folksonomy over time , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[21]  Dongwon Lee,et al.  On six degrees of separation in DBLP-DB and more , 2005, SGMD.

[22]  Krishna P. Gummadi,et al.  Exploiting Social Networks for Internet Search , 2006, HotNets.

[23]  Ramanathan V. Guha,et al.  SemTag and seeker: bootstrapping the semantic web via automated semantic annotation , 2003, WWW '03.

[24]  Myke Gluck,et al.  Visual Explanations: Images and Quantities, Evidence and Narrative , 1997, Inf. Process. Manag..

[25]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[26]  Tad Hogg,et al.  Enhancing reputation mechanisms via online social networks , 2004, EC '04.

[27]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[28]  Filippo Menczer,et al.  Algorithmic detection of semantic similarity , 2005, WWW '05.

[29]  Kathy J. Lee What goes around comes around: an analysis of del.icio.us as social space , 2006, CSCW '06.

[30]  Robert L. Grossman,et al.  Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , 2005, KDD 2005.

[31]  Lars Schmidt-Thieme,et al.  Tag-aware recommender systems by fusion of collaborative filtering algorithms , 2008, SAC '08.

[32]  Vittorio Loreto,et al.  Folksonomies, the semantic web, and movie recommendation , 2007 .

[33]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[35]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[36]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[37]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[38]  Michael Kaminsky,et al.  SybilGuard: Defending Against Sybil Attacks via Social Networks , 2008, IEEE/ACM Transactions on Networking.

[39]  Martin Wattenberg,et al.  Visualizing Activity on Wikipedia with Chromograms , 2007, INTERACT.

[40]  Barry Wellman,et al.  For a social network analysis of computer networks: a sociological perspective on collaborative work and virtual community , 1996, SIGCPR '96.

[41]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..