Detecting Malicious Twitter Bots Using Machine Learning

Cybercrimes and phishing scams have increased multi-folds over the past few years. Now a days, hackers are coming up with new techniques to hack accounts and gain sensitive information about people and organisations. Social networking site like Twitter is one such tool. And due to its large audience hackers use such sites to reach large number of people. They use such sites to circulate malicious URLs, phishing mails etc. which serve as the entry point into the target system. And with the introduction of Twitter Bots, this work got even easier. Twitter bots can send tweets without any human intervention after a fixed regular interval of time. Also their frequency of tweets is much more than humans and therefore they are frequently used by hackers to spread malicious URLs. And due to large number of active members, these malicious URLs are reaching out to more people, therefore increasing the phishing scams and frauds. So this paper proposes a model which will use different algorithms of machine learning, first to detect twitter bots and then find out which of them is posting malicious URLs. In the proposed model, some features have been suggested which distinguishes a twitter bot account from a benign account. Based on those statistical features, model will be trained. The model will help us to filter out the malicious bots which are harmful for legitimate users.

[1]  Hossein Hamooni,et al.  DeBot: Twitter Bot Detection via Warped Correlation , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[2]  Filippo Menczer,et al.  BotOrNot: A System to Evaluate Social Bots , 2016, WWW.

[3]  Timothy Cohen,et al.  What is the Machine Learning , 2017, 1709.10106.

[4]  Colin Mills,et al.  On Group Comparisons With Logistic Regression Models , 2020 .

[5]  Devika Subramanian,et al.  Hunting Malicious Bots on Twitter: An Unsupervised Approach , 2017, SocInfo.

[6]  Jong Kim,et al.  WarningBird: Detecting Suspicious URLs in Twitter Stream , 2012, NDSS.

[7]  Jan Novotny Twitter bot detection & categorization - a comparative study of machine learning methods , 2019 .

[8]  Ambeth Kumar Visvam Devadoss,et al.  Efficient daily news platform generation using natural language processing , 2019 .

[9]  Larry S. Davis,et al.  Deep Representation Learning for Metadata Verification , 2019, 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW).

[10]  Yong Wang,et al.  Detecting malicious short URLs on Twitter , 2015, AMCIS.

[11]  V. Marx,et al.  Machine learning, practically speaking , 2019, Nature Methods.

[12]  Piotr Sapiezynski,et al.  Evidence of complex contagion of information in social media: An experiment using Twitter bots , 2017, PloS one.

[13]  Hadi Kharrazi,et al.  Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults , 2019, PloS one.

[14]  Xiang Fu,et al.  Curvature Bag of Words Model for Shape Recognition , 2019, IEEE Access.