Followee recommendation based on text analysis of micro-blogging activity

Nowadays, more and more users keep up with news through information streams coming from real-time micro-blogging activity offered by services such as Twitter. In these sites, information is shared via a followers/followees social network structure in which a follower receives all the micro-blogs from his/her followees. Recent research efforts on understanding micro-blogging as a novel form of communication and news spreading medium have identified three different categories of users in these systems: information sources, information seekers and friends. As social networks grow in the number of registered users, finding relevant and reliable users to receive interesting information becomes essential. In this paper we propose a followee recommender system based on both the analysis of the content of micro-blogs to detect users' interests and in the exploration of the topology of the network to find candidate users for recommendation. Experimental evaluation was conducted in order to determine the impact of different profiling strategies based on the text analysis of micro-blogs as well as several factors that allows the identification of users acting as good information sources. We found that user-generated content available in the network is a rich source of information for profiling users and finding like-minded people.

[1]  Balachander Krishnamurthy,et al.  A few chirps about twitter , 2008, WOSN '08.

[2]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[3]  Vasudeva Varma,et al.  User context as a source of topic retrieval in Twitter , 2011 .

[4]  Daniel Dajun Zeng,et al.  A Novel Recommendation Framework for Micro-Blogging Based on Information Diffusion , 2009, WITS 2009.

[5]  Krishna P. Gummadi,et al.  Cognos: crowdsourcing search for topic experts in microblogs , 2012, SIGIR '12.

[6]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[7]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[8]  Krishna P. Gummadi,et al.  Inferring who-is-who in the Twitter social network , 2012, WOSN '12.

[9]  Virgílio A. F. Almeida,et al.  Finding trendsetters in information networks , 2012, KDD.

[10]  Karen Rose,et al.  What is Twitter , 2009 .

[11]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.

[12]  Hiroyuki Kitagawa,et al.  TURank: Twitter User Ranking Based on User-Tweet Graph Analysis , 2010, WISE.

[13]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[14]  Ido Guy,et al.  Do you know?: recommending people to invite into your social network , 2009, IUI.

[15]  K. Selçuk Candan,et al.  How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? , 2010, ICWSM.

[16]  Analía Amandi,et al.  Topology-Based Recommendation of Users in Micro-Blogging Communities , 2012, Journal of Computer Science and Technology.

[17]  Raleigh North Haewoon, Kwak, Changhyun, Lee, Park, Hosung, and Moon, Sue. . What is Twitter, a Social Network or a News Media?. 19th International World Wide Web (WWW) Conference.April. , 2010 .

[18]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[19]  Haewoon Kwak,et al.  Finding influentials based on the temporal order of information adoption in twitter , 2010, WWW '10.

[20]  Danushka Bollegala,et al.  Measuring semantic similarity between words using web search engines , 2007, WWW '07.

[21]  Michael S. Bernstein,et al.  Short and tweet: experiments on recommending content from information streams , 2010, CHI.

[22]  Barry Smyth,et al.  On the real-time web as a source of recommendation knowledge , 2010, RecSys '10.

[23]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[24]  Scott Counts,et al.  Identifying topical authorities in microblogs , 2011, WSDM '11.

[25]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[26]  John Hannon,et al.  Recommending twitter users to follow using content and collaborative filtering approaches , 2010, RecSys '10.

[27]  X. Amatriain,et al.  Weighted Content Based Methods for Recommending Connections in Online Social Networks , 2010 .

[28]  Rizal Setya Perdana What is Twitter , 2013 .

[29]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[30]  Barry Smyth,et al.  Using twitter to recommend real-time topical news , 2009, RecSys '09.

[31]  Michael J. Muller,et al.  Make new friends, but keep the old: recommending people on social networking sites , 2009, CHI.

[32]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[33]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[34]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[35]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[36]  Nan Sun,et al.  Exploiting internal and external semantics for the clustering of short texts using world knowledge , 2009, CIKM.

[37]  Shuchuan Lo,et al.  WMR--A Graph-Based Algorithm for Friend Recommendation , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[38]  Paolo Rosso,et al.  Improving the Clustering of Blogosphere with a Self-term Enriching Technique , 2009, TSD.

[39]  Paolo Rosso,et al.  On the difficulty of clustering company tweets , 2010, SMUC '10.

[40]  Jody Wheeler,et al.  Make new friends , 2002 .

[41]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[42]  Daniel M. Romero,et al.  Influence and Passivity in Social Media , 2011, ECML/PKDD.