Rumour veracity detection on twitter using particle swarm optimized shallow classifiers

Information overload on Web has been a well-identified challenge which has amplified with the advent of social web. Good, bad, true, false, useful, useless all kinds of information disseminates through the social web platforms. It becomes exceedingly imperative to pro-actively resolve rumours and inhibit them from spreading among the Internet users as it can jeopardize the well-being of the citizens. The task for rumour analysis intends to identify & classify a rumour either as true (factual), false (nonfactual) or unresolved. Determining the accuracy of a rumourous story, a.k.a. rumour veracity is hard owing to the noisy, ambiguous and heterogeneous use of natural language. This necessitates automation of the predictive task which classifies the questionable veracity of rumour accurately. The research presented in this paper, is an empirical study to put forward an optimized learning model which classifies real-time tweets on the basis of truth value, facilitating rumour analysis. The study is conducted on a collection of nearly 14 k tweets pertaining to the recent mob lynching fuelled by rumours on suspected child-lifters in the Indian sub-continent (#moblynching) and run on five classical shallow classifiers to categorize tweets into true, false and unspecified using 13 attributes (features). Subsequently, the use of an optimal feature selection method, particle swarm algorithm is proposed to improve the classifier’s performance. The empirical analysis validates that the proposed implementation of particle swarm optimization (PSO) for feature subset selection in rumour veracity classification outperforms the baseline supervised learning algorithms. An average 11.28% improvement in accuracy and approximately 31% average reduction in features are demonstrated using PSO. The highest accuracy with optimization of 96.15% is achieved by decision tree.

[1]  Jason R. C. Nurse,et al.  Determining the Veracity of Rumours on Twitter , 2016, SocInfo.

[2]  Arkaitz Zubiaga,et al.  SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours , 2017, *SEMEVAL.

[3]  Yiannis Kompatsiaris,et al.  Verifying information with multimedia content on twitter , 2017, Multimedia Tools and Applications.

[4]  Quan Z. Sheng,et al.  Extreme User and Political Rumor Detection on Twitter , 2016, ADMA.

[5]  Soroush Vosoughi,et al.  Automatic detection and verification of rumors on Twitter , 2015 .

[6]  Wei Gao,et al.  Detect Rumors Using Time Series of Social Context Information on Microblogging Websites , 2015, CIKM.

[7]  Arunima Jaiswal,et al.  Empirical Study of Twitter and Tumblr for Sentiment Analysis using Soft Computing Techniques , 2022 .

[8]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[9]  Xiaomo Liu,et al.  Real-time Rumor Debunking on Twitter , 2015, CIKM.

[10]  Samhaa R. El-Beltagy,et al.  NileTMRG at SemEval-2017 Task 8: Determining Rumour and Veracity Support for Rumours on Twitter. , 2017, *SEMEVAL.

[11]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[12]  A. Kumar,et al.  Information Virality Prediction using Emotion Quotient of Tweets , 2018 .

[13]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[14]  Zili Zhang,et al.  Predictors of the authenticity of Internet health rumours. , 2015, Health information and libraries journal.

[15]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[16]  Shamus Khan,et al.  Sexual assault incidents among college undergraduates: Prevalence and factors associated with risk , 2017, PloS one.

[17]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[18]  Takao Terano,et al.  Detecting rumor patterns in streaming social media , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[19]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[20]  Ponnurangam Kumaraguru,et al.  The follower count fallacy: detecting Twitter users with manipulated follower count , 2018, SAC.

[21]  Akshi Kumar,et al.  A Survey on Sentiment Analysis using Swarm Intelligence , 2016 .

[22]  Dazhen Lin,et al.  Content Representation for Microblog Rumor Detection , 2016, UKCI.

[23]  Akshi Kumar,et al.  Emotion analysis of Twitter using opinion mining , 2015, 2015 Eighth International Conference on Contemporary Computing (IC3).

[24]  Akshi Kumar,et al.  Rumor Detection Using Machine Learning Techniques on Social Media , 2018, International Conference on Innovative Computing and Communications.

[25]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[26]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[27]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[28]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[29]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[30]  Akshi Kumar,et al.  Systematic literature review of sentiment analysis on Twitter using soft computing techniques , 2019, Concurr. Comput. Pract. Exp..

[31]  Mehdi Hosseinzadeh Aghdam,et al.  Feature Selection Using Particle Swarm Optimization in Text Categorization , 2015, J. Artif. Intell. Soft Comput. Res..

[32]  Dejing Dou,et al.  A Temporal Attentional Model for Rumor Stance Classification , 2017, CIKM.

[33]  Akshi Kumar,et al.  Machine Learning from Theory to Algorithms: An Overview , 2018, Journal of Physics: Conference Series.

[34]  M. Cha,et al.  Rumor Detection over Varying Time Windows , 2017, PloS one.

[35]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[36]  Carlos Angel Iglesias,et al.  A Survey of Twitter Rumor Spreading Simulations , 2015, ICCCI.

[37]  Snehasish Banerjee,et al.  Linguistic Predictors of Rumor Veracity on the Internet , 2016 .

[38]  Mohd Shahizan Othman,et al.  Review of feature selection for solving classification problems , 2013 .

[39]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[40]  Arkaitz Zubiaga,et al.  SemEval-2019 Task 7: RumourEval, Determining Rumour Veracity and Support for Rumours , 2019, *SEMEVAL.