Detection of fake Twitter followers using graph centrality measures

With the exponential rise in the number of Internet users, Social Networking platforms have become one of the major means of communication all over the globe. Many major players in this field exist including the likes of Facebook, Twitter, Google+ etc. Impressed by the number of users an individual can reach using the existing Social Networking platforms, most organizations and celebrities make use of them to keep in touch with their fans and followers continuously. Social Networking platforms also allow the organizations or celebrities to publicize any event and to update information regarding their business just to keep themselves active in the market. Most Social Networking platforms provide some form of metric which can be used to define the popularity of an user such as the number of followers on Twitter, number of likes on Facebook etc. However, in recent years, it has been observed that many users attempt to manipulate their popularity metric with the help of fake accounts to look more popular. In this paper, we have devised a method which can be used to detect all the fake followers within a social graph network based on features related to the centrality of all the nodes in the graph and training a classifier based on a subset of the data. Using only graph based centrality measures, the proposed method yielded very high accuracy on fake follower detection. The proposed method is generic in nature and can be used irrespective of the social network platform under consideration.

[1]  Jianjun Yu,et al.  Automatic Fake Followers Detection in Chinese Micro-blogging System , 2014, PAKDD.

[2]  Sajid Yousuf Bhat,et al.  Using communities against deception in online social networks , 2014 .

[3]  Yi Zhang,et al.  Discover millions of fake followers in Weibo , 2016, Social Network Analysis and Mining.

[4]  Roberto Di Pietro,et al.  Fame for sale: Efficient detection of fake Twitter followers , 2015, Decis. Support Syst..

[5]  Emiliano De Cristofaro,et al.  Paying for Likes?: Understanding Facebook Like Fraud Using Honeypots , 2014, Internet Measurement Conference.

[6]  Sushil Jajodia,et al.  Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg? , 2012, IEEE Transactions on Dependable and Secure Computing.

[7]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[8]  Gérard Biau,et al.  Analysis of a Random Forests Model , 2010, J. Mach. Learn. Res..

[9]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[10]  Mahmoud A. Mahmoud,et al.  Fake Account Detection in Twitter Based on Minimum Weighted Feature set , 2015 .

[11]  V. S. Subrahmanian,et al.  Using sentiment to detect bots on Twitter: Are humans more opinionated than bots? , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[12]  Gang Wang,et al.  Follow the green: growth and dynamics in twitter follower markets , 2013, Internet Measurement Conference.