Twitter Bot Detection with Reduced Feature Set

Online social networks provide a novel channel to allow interaction between human beings. Its success has attracted interest in attacking and exploiting them through a wide range of unethical activities, such as malicious actions to manipulate users. One of the methods to carry out these abuses is the use of bots on Twitter. Recent examples of bots influencing public opinion in the election process demonstrate their potential harm to the democratic world. Such malicious behavior needs to be checked and its effects should be diminished. Recently, machine learning (ML) classifiers to distinguish between real and bot accounts have proven advances. Thus, in this work four ML algorithms were tested using a public dataset and a few expressive features based on simple user profile counters for the classification of bots on Twitter. We measured their performance compared to one state–of–the–art bot detection work. The classifier accuracy was considered homogeneous with a mean of 0.8549 and 0.1889 of standard deviation. Besides, all multiclass classifiers obtained AUCs greater than 0.9 indicating a practical benefit for bot detection on Twitter.

[1]  Filippo Menczer,et al.  Online Human-Bot Interactions: Detection, Estimation, and Characterization , 2017, ICWSM.

[2]  Suzan Üsküdarli,et al.  Organized Behavior Classification of Tweet Sets using Supervised Learning Methods , 2017, WIMS.

[3]  Erhan Guven,et al.  A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection , 2016, IEEE Communications Surveys & Tutorials.

[4]  Roberto Di Pietro,et al.  DNA-Inspired Online Behavioral Modeling and Its Application to Spambot Detection , 2016, IEEE Intell. Syst..

[5]  William L. Simon,et al.  The Art of Deception: Controlling the Human Element of Security , 2001 .

[6]  Maurizio Tesconi,et al.  RTbust: Exploiting Temporal Patterns for Botnet Detection on Twitter , 2019, WebSci.

[7]  Philip N. Howard,et al.  Bots, #StrongerIn, and #Brexit: Computational Propaganda during the UK-EU Referendum , 2016, ArXiv.

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  Roberto Di Pietro,et al.  Fame for sale: Efficient detection of fake Twitter followers , 2015, Decis. Support Syst..

[10]  A. Faisal,et al.  Scaling-Laws of Human Broadcast Communication Enable Distinction between Human, Corporate and Robot Twitter Users , 2013, PloS one.

[11]  Bruce Schneier,et al.  Secrets and Lies: Digital Security in a Networked World , 2000 .

[12]  Wei Hu,et al.  Twitter spammer detection using data stream clustering , 2014, Inf. Sci..

[13]  Ralph Schroeder,et al.  Political Bots and the Swedish General Election , 2018, 2018 IEEE International Conference on Intelligence and Security Informatics (ISI).

[14]  Jürgen Knauth,et al.  Language-Agnostic Twitter-Bot Detection , 2019, RANLP.

[15]  Alex Hai Wang,et al.  Detecting Spam Bots in Online Social Networking Sites: A Machine Learning Approach , 2010, DBSec.

[16]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2013, IEEE Trans. Inf. Forensics Secur..

[17]  Hamideh Afsarmanesh,et al.  Phishing through social bots on Twitter , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[18]  Majd Latah,et al.  Detection of malicious social bots: A survey and a refined taxonomy , 2020, Expert Syst. Appl..

[19]  Kyumin Lee,et al.  Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter , 2011, ICWSM.

[20]  Filippo Menczer,et al.  BotOrNot: A System to Evaluate Social Bots , 2016, WWW.

[21]  Jorge Henrique Cabral Fernandes,et al.  Bot Development for Social Engineering Attacks on Twitter , 2020, ArXiv.

[22]  Roberto Di Pietro,et al.  The Paradigm-Shift of Social Spambots: Evidence, Theories, and Tools for the Arms Race , 2017, WWW.

[23]  Sushil Jajodia,et al.  Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg? , 2012, IEEE Transactions on Dependable and Secure Computing.

[24]  Muhammad Abulaish,et al.  A generic statistical approach for spam detection in Online Social Networks , 2013, Comput. Commun..

[25]  Fabrício Benevenuto,et al.  Reverse engineering socialbot infiltration strategies in Twitter , 2014, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[26]  Gianluca Stringhini,et al.  Detecting spammers on social networks , 2010, ACSAC '10.

[27]  Jon Crowcroft,et al.  Of Bots and Humans (on Twitter) , 2017, ASONAM.

[28]  Raúl Monroy,et al.  A one-class classification approach for bot detection on Twitter , 2020, Comput. Secur..