On Profiling Bots in Social Media

The popularity of social media platforms such as Twitter has led to the proliferation of automated bots, creating both opportunities and challenges in information dissemination, user engagements, and quality of services. Past works on profiling bots had been focused largely on malicious bots, with the assumption that these bots should be removed. In this work, however, we find many bots that are benign, and propose a new, broader categorization of bots based on their behaviors. This includes broadcast, consumption, and spam bots. To facilitate comprehensive analyses of bots and how they compare to human accounts, we develop a systematic profiling framework that includes a rich set of features and classifier bank. We conduct extensive experiments to evaluate the performances of different classifiers under varying time windows, identify the key features of bots, and infer about bots in a larger Twitter population. Our analysis encompasses more than 159K bot and human (non-bot) accounts in Twitter. The results provide interesting insights on the behavioral traits of both benign and malicious bots.

[1]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[2]  V. S. Subrahmanian,et al.  Using sentiment to detect bots on Twitter: Are humans more opinionated than bots? , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[3]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[4]  Fabrício Benevenuto,et al.  Reverse engineering socialbot infiltration strategies in Twitter , 2014, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[5]  David W. McDonald,et al.  Dissecting a Social Botnet: Growth, Content and Influence in Twitter , 2015, CSCW.

[6]  Gianluca Stringhini,et al.  Detecting spammers on social networks , 2010, ACSAC '10.

[7]  Alex Hai Wang,et al.  Detecting Spam Bots in Online Social Networking Sites: A Machine Learning Approach , 2010, DBSec.

[8]  Max Nanis,et al.  Socialbots: voices from the fronts , 2012, INTR.

[9]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[10]  Amos Azaria,et al.  The DARPA Twitter Bot Challenge , 2016, Computer.

[11]  Kyumin Lee,et al.  Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter , 2011, ICWSM.

[12]  Markus Strohmaier,et al.  When Social Bots Attack: Modeling Susceptibility of Users in Online Social Networks , 2012, #MSM.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  A. Faisal,et al.  Scaling-Laws of Human Broadcast Communication Enable Distinction between Human, Corporate and Robot Twitter Users , 2013, PloS one.

[15]  Konstantin Beznosov,et al.  Design and analysis of a social botnet , 2013, Comput. Networks.

[16]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[17]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[18]  Huan Liu,et al.  Social Spammer Detection in Microblogging , 2013, IJCAI.

[19]  Krishna P. Gummadi,et al.  Understanding and combating link farming in the twitter social network , 2012, WWW.

[20]  Sushil Jajodia,et al.  Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg? , 2012, IEEE Transactions on Dependable and Secure Computing.

[21]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[22]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[23]  Markus Strohmaier,et al.  A categorization scheme for socialbot attacks in online social networks , 2014, ArXiv.