SPEAR: SPAMMING‐RESISTANT EXPERTISE ANALYSIS AND RANKING IN COLLABORATIVE TAGGING SYSTEMS

In this article, we discuss the notions of experts and expertise in resource discovery in the context of collaborative tagging systems. We propose that the level of expertise of a user with respect to a particular topic is mainly determined by two factors. First, an expert should possess a high‐quality collection of resources, while the quality of a Web resource in turn depends on the expertise of the users who have assigned tags to it, forming a mutual reinforcement relationship. Second, an expert should be one who tends to identify interesting or useful resources before other users discover them, thus bringing these resources to the attention of the community of users. We propose a graph‐based algorithm, SPEAR (spamming‐resistant expertise analysis and ranking), which implements the above ideas for ranking users in a folksonomy. Our experiments show that our assumptions on expertise in resource discovery, and SPEAR as an implementation of these ideas, allow us to promote experts and demote spammers at the same time, with performance significantly better than the original hypertext‐induced topic search algorithm and simple statistical measures currently used in most collaborative tagging systems.

[1]  Craig MacDonald,et al.  High Quality Expertise Evidence for Expert Search , 2008, ECIR.

[2]  Christoph Meinel,et al.  Exploring social annotations for web document classification , 2008, SAC '08.

[3]  Michael Kaminsky,et al.  SybilGuard: Defending Against Sybil Attacks via Social Networks , 2008, IEEE/ACM Transactions on Networking.

[4]  Marcel Ausloos,et al.  Contextualising tags in collaborative tagging systems , 2009, HT '09.

[5]  Michael J. Prietula,et al.  Studies of Expertise from Psychological Perspectives , 2006 .

[6]  C. Bauckhage,et al.  Analyzing Social Bookmarking Systems : A del . icio . us Cookbook , 2008 .

[7]  Ayman Farahat,et al.  Authority Rankings from HITS, PageRank, and SALSA: Existence, Uniqueness, and Effect of Initialization , 2005, SIAM J. Sci. Comput..

[8]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[9]  Christoph Meinel,et al.  Authors vs. readers: a comparative study of document metadata and content in the www , 2007, DocEng '07.

[10]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[11]  R. Guha,et al.  Open Rating Systems , 2002 .

[12]  Ajita John,et al.  Collaborative Tagging and Expertise in the Enterprise , 2006 .

[13]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[14]  Tyler Moore,et al.  Evaluating the Wisdom of Crowds in Assessing Phishing Websites , 2008, Financial Cryptography.

[15]  M. Chi Two Approaches to the Study of Experts' Characteristics , 2006 .

[16]  Ciro Cattuto,et al.  Social spam detection , 2009, AIRWeb '09.

[17]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[18]  Ling Chen,et al.  Using Co-occurence of Tags and Resources to Identify Spammers , 2008 .

[19]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[20]  Georgia Koutrika,et al.  Combating spam in tagging systems , 2007, AIRWeb '07.

[21]  Yi Zhang,et al.  Graph-based ranking algorithms for e-mail expertise analysis , 2003, DMKD '03.

[22]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[23]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[24]  Ling Liu,et al.  Socialtrust: tamper-resilient trust establishment in online communities , 2008, JCDL '08.

[25]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[26]  Andreas Hotho,et al.  The anti-social tagger: detecting spam in social bookmarking systems , 2008, AIRWeb '08.

[27]  F. Gobet The Cambridge handbook of expertise and expert performance , 2006 .

[28]  Georgia Koutrika,et al.  Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges , 2007, IEEE Internet Computing.

[29]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[30]  Michael Kaminsky,et al.  SybilGuard: defending against sybil attacks via social networks , 2006, SIGCOMM.

[31]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.