The Wisdom of the Few? "Supertaggers" in Collaborative Tagging Systems

A folksonomy is ostensibly an information structure built up by the "wisdom of the crowd", but is the "crowd" really doing the work? Tagging is in fact a sharply skewed process in which a small minority of "supertagger" users generate an overwhelming majority of the annotations. Using data from three large-scale social tagging platforms, we explore (a) how to best quantify the imbalance in tagging behavior and formally define a supertagger, (b) how supertaggers differ from other users in their tagging patterns, and (c) if effects of motivation and expertise inform our understanding of what makes a supertagger. Our results indicate that such prolific users not only tag more than their counterparts, but in quantifiably different ways. Specifically, we find that supertaggers are more likely to label content in the long tail of less popular items, that they show differences in patterns of content tagged and terms utilized, and are measurably different with respect to tagging expertise and motivation. These findings suggest we should question the extent to which folksonomies achieve crowdsourced classification via the "wisdom of the crowd", especially for broad folksonomies like Last.fm as opposed to narrow folksonomies like Flickr.

[1]  C. Bauckhage,et al.  Analyzing Social Bookmarking Systems : A del . icio . us Cookbook , 2008 .

[2]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[3]  Mathias Niepert,et al.  A dynamic ontology for a dynamic reference work , 2007, JCDL '07.

[4]  Scott P. Robertson,et al.  Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , 1991 .

[5]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[6]  Markus Strohmaier,et al.  Of categorizers and describers: an evaluation of quantitative measures for tagging motivation , 2010, HT '10.

[7]  Peter M. Todd,et al.  "Supertagger" behavior in building folksonomies , 2014, WebSci '14.

[8]  Steffen Staab,et al.  PINTS: peer-to-peer infrastructure for tagging systems , 2008, IPTPS.

[9]  Christoph Meinel,et al.  On Measuring Expertise in Collaborative Tagging Systems , 2009 .

[10]  Oded Nov,et al.  What drives content tagging: the case of photos on Flickr , 2008, CHI.

[11]  Peter M. Todd,et al.  Can simple social copying heuristics explain tag popularity in a collaborative tagging system? , 2013, WebSci.

[12]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[13]  Christian Wolff,et al.  Personal Information Management vs. Resource Sharing: Towards a Model of Information Behavior in Social Tagging Systems , 2009, ICWSM.

[14]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[15]  Vittorio Loreto,et al.  Vocabulary growth in collaborative tagging systems , 2007, ArXiv.

[16]  Christoph Meinel,et al.  SPEAR: SPAMMING‐RESISTANT EXPERTISE ANALYSIS AND RANKING IN COLLABORATIVE TAGGING SYSTEMS , 2011, Comput. Intell..

[17]  J. Tanaka,et al.  Object categories and expertise: Is the basic level in the eye of the beholder? , 1991, Cognitive Psychology.

[18]  Andreas Hotho,et al.  BibSonomy: a social bookmark and publication sharing system , 2006 .

[19]  Valentin Robu,et al.  Emergence of consensus and shared vocabularies in collaborative tagging systems , 2009, TWEB.

[20]  E. Andrade Contemporary Physics , 1945, Nature.

[21]  Mario Kubek,et al.  Automatic Taxonomy Extraction through Mining Social Networks , 2010 .

[22]  Wei Dong,et al.  Facilitating Knowledge Exploration in Folksonomies: Expertise Ranking by Link and Semantic Structures , 2010, 2010 IEEE Second International Conference on Social Computing.

[23]  T. Rogers,et al.  Object categorization: reversals and explanations of the basic-level advantage. , 2007, Journal of experimental psychology. General.

[24]  Christian Wolff,et al.  Personal Information Management vs. Resource Sharing: Towards a Model of Information Behaviour in Social Tagging Systems , 2009, ICWSM 2009.

[25]  John Riedl,et al.  tagging, communities, vocabulary, evolution , 2006, CSCW '06.

[26]  YeChen,et al.  Why do people tag , 2010 .

[27]  Arkaitz Zubiaga,et al.  Tags vs shelves: from social tagging to social classification , 2011, HT '11.

[28]  Dominik Benz,et al.  Stop thinking, start tagging: tag semantics emerge from collaborative verbosity , 2010, WWW '10.

[29]  Markus Strohmaier,et al.  Why do Users Tag? Detecting Users' Motivation for Tagging in Social Tagging Systems , 2010, ICWSM.

[30]  Flavio Figueiredo,et al.  Assessing the quality of textual features in social media , 2013, Inf. Process. Manag..

[31]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.