Profiling the followers of the most influential and verified users on Sina Weibo

The new social media such as Twitter and Sina Weibo has become an increasingly popular channel for spreading influence, challenging traditional media such as TVs and newspapers. The most influential and verified users, also called big-V accounts on Sina Weibo often attract million of followers and fans, creating massive “celebrity-centric” social networks on the social media, which play a key role in disseminating breaking news, latest events, and controversial opinions on social issues. Given the importance of these accounts, it is very crucial to understand social networks and user influence of these accounts and profile their followers' behaviors. Towards this end, this paper monitors a selected group of influential users on Sina Weibo and collects their tweet streams as well as retweeting and commenting activities on these tweets from their followers. Our analysis on tweet data streams from Sina Weibo reveals when and what the followers comment on the tweets of these influential users, and discovers different temporal patterns and word diversity in the comments. Based on the insight gained from follower characteristics, we further develop simple and intuitive algorithms for classifying the followers into spammers and normal fans. Our experimental results demonstrate that the proposed algorithms are able to achieve an average accuracy of 95.20% in detecting spammers from the followers who have commented on the tweets of these influential accounts.

[1]  James She,et al.  An Analysis of Verifications in Microblogging Social Networks -- Sina Weibo , 2012, 2012 32nd International Conference on Distributed Computing Systems Workshops.

[2]  Qian Zhang,et al.  Analyzing the influential people in Sina Weibo dataset , 2013, 2013 IEEE Global Communications Conference (GLOBECOM).

[3]  Kai Zhang,et al.  Characterizing Tweeting Behaviors of Sina Weibo Users via Public Data Streaming , 2014, WAIM.

[4]  Kai Zhang,et al.  Understanding Sina Weibo online social network: A community approach , 2013, 2013 IEEE Global Communications Conference (GLOBECOM).

[5]  Pang-Ning Tan,et al.  1 INFORMATION ASSURANCE : DETECTION OF WEB SPAM ATTACKS IN SOCIAL MEDIA , 2010 .

[6]  Kai Zhang,et al.  Extracting unknown words from Sina Weibo via data clustering , 2015, 2015 IEEE International Conference on Communications (ICC).

[7]  Xiaohua Jia,et al.  Characterizing Information Diffusion in Online Social Networks with Linear Diffusive Model , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.

[8]  Kyumin Lee,et al.  The social honeypot project: protecting online communities from spammers , 2010, WWW '10.

[9]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[10]  Kai Zhang,et al.  Hot topic analysis and content mining in social media , 2014, 2014 IEEE 33rd International Performance Computing and Communications Conference (IPCCC).

[11]  Xiaokang Yang,et al.  Analysis and identification of spamming behaviors in Sina Weibo microblog , 2013, SNAKDD '13.

[12]  Michael Sirivianos,et al.  Aiding the Detection of Fake Accounts in Large Scale Social Online Services , 2012, NSDI.

[13]  Christos Faloutsos,et al.  Detecting Fraudulent Personalities in Networks of Online Auctioneers , 2006, PKDD.

[14]  Georgia Koutrika,et al.  Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges , 2007, IEEE Internet Computing.

[15]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .

[16]  Pang-Ning Tan,et al.  A co-classification framework for detecting web spam and spammers in social media web sites , 2009, CIKM.

[17]  Michael Trusov,et al.  Determining Influential Users in Internet Social Networks , 2010 .

[18]  Jiebo Luo,et al.  SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks , 2011, Proc. VLDB Endow..

[19]  Marc Lemercier,et al.  SPOT 1.0: Scoring Suspicious Profiles on Twitter , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[20]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[21]  Lin Liu,et al.  Detecting Spam in Chinese Microblogs - A Study on Sina Weibo , 2012, 2012 Eighth International Conference on Computational Intelligence and Security.

[22]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[23]  Long Li,et al.  Characterizing User Behavior in Weibo , 2012, 2012 Third FTRA International Conference on Mobile, Ubiquitous, and Intelligent Computing.

[24]  Xiaokang Yang,et al.  Feature Analysis of Spammers in Social Networks with Active Honeypots: A Case Study of Chinese Microblogging Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[25]  Huan Liu,et al.  Social Spammer Detection in Microblogging , 2013, IJCAI.