Fake news, rumor, information pollution in social media and web: A contemporary survey of state-of-the-arts, challenges and opportunities

Abstract Internet and social media have become a widespread, large scale and easy to use platform for real-time information dissemination. It has become an open stage for discussion, ideology expression, knowledge dissemination, emotions and sentiment sharing. This platform is gaining tremendous attraction and a huge user base from all sections and age groups of society. The matter of concern is that up to what extent the contents that are circulating among all these platforms every second changing the mindset, perceptions and lives of billions of people are verified, authenticated and up to the standards. This paper puts forward a holistic view of how the information is being weaponized to fulfil the malicious motives and forcefully making a biased user perception about a person, event or firm. Further, a taxonomy is provided for the classification of malicious information content at different stages and prevalent technologies to cope up with this issue form origin, propagation, detection and containment stages. We also put forward a research gap and possible future research directions so that the web information content could be more reliable and safer to use for decision making as well as for knowledge sharing.

[1]  R Santhya,et al.  A Modern approach to identify the fake news using machine learning , 2018 .

[2]  H. Russell Bernard,et al.  Studying Fake News via Network Analysis: Detection and Mitigation , 2018, Lecture Notes in Social Networks.

[3]  Chengkai Li,et al.  Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster , 2017, KDD.

[4]  Sebastian Tschiatschek,et al.  Fake News Detection in Social Networks via Crowd Signals , 2017, WWW.

[5]  Abdelouahed Gherbi,et al.  An empirical study on detecting fake reviews using machine learning techniques , 2017, 2017 Seventh International Conference on Innovative Computing Technology (INTECH).

[6]  Filippo Menczer,et al.  Anatomy of an online misinformation network , 2018, PloS one.

[7]  Edward Tjörnhammar,et al.  Mama Edha at SemEval-2017 Task 8: Stance Classification with CNN and Rules , 2017, *SEMEVAL.

[8]  Mong-Li Lee,et al.  Predicting Stances in Twitter Conversations for Detecting Veracity of Rumors: A Neural Approach , 2018, 2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI).

[9]  Mohammad Taher Pilehvar,et al.  Towards Automatic Fake News Detection: Cross-Level Stance Detection in News Articles , 2018 .

[10]  S. van der Linden,et al.  The fake news game: actively inoculating against the risk of misinformation , 2019 .

[11]  Putu Wuri Handayani,et al.  Information credibility factors on information sharing activites in social media , 2017, 2017 Second International Conference on Informatics and Computing (ICIC).

[12]  Zili Zhang,et al.  A bio-inspired method for locating the diffusion source with limited observers , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[13]  Joydip Dhar,et al.  A mathematical model of news propagation on online social network and a control strategy for rumor spreading , 2016, Social Network Analysis and Mining.

[14]  V. P. Sahana,et al.  Automatic detection of rumoured tweets and finding its origin , 2015, 2015 International Conference on Computing and Network Communications (CoCoNet).

[15]  Yale Song,et al.  #FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media , 2014, IEEE Transactions on Visualization and Computer Graphics.

[16]  Michal Jankowski-Lorek,et al.  Automated Credibility Assessment on Twitter , 2015, Comput. Sci..

[17]  Huan Liu,et al.  Beyond News Contents: The Role of Social Context for Fake News Detection , 2017, WSDM.

[18]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[19]  G. Geethakumari,et al.  Detecting misinformation in online social networks using cognitive psychology , 2014, Human-centric Computing and Information Sciences.

[20]  Huan Liu,et al.  Tracing Fake-News Footprints: Characterizing Social Media Messages by How They Propagate , 2018, WSDM.

[21]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[22]  Prabhas Chongstitvatana,et al.  Detecting Fake News with Machine Learning Method , 2018, 2018 15th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON).

[23]  Eslam Amer,et al.  Deep Learning Algorithms for Detecting Fake News in Online Text , 2018, 2018 13th International Conference on Computer Engineering and Systems (ICCES).

[24]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[25]  Wei Huang,et al.  Rumor spreading model with consideration of forgetting mechanism: A case of online blogging LiveJournal , 2011 .

[26]  Alireza Louni,et al.  Who Spread That Rumor: Finding the Source of Information in Large Online Social Networks With Probabilistically Varying Internode Relationship Strengths , 2018, IEEE Transactions on Computational Social Systems.

[27]  Yi Zhang,et al.  iOLAP: A Framework for Analyzing the Internet, Social Networks, and Other Networked Data , 2009, IEEE Transactions on Multimedia.

[28]  Monther Aldwairi,et al.  Detecting Fake News in Social Media Networks , 2018, EUSPN/ICTH.

[29]  Reza Zafarani,et al.  Fake News: Fundamental Theories, Detection Strategies and Challenges , 2019, WSDM.

[30]  Sabu M. Thampi,et al.  A nature - inspired approach based on Forest Fire model for modeling rumor propagation in social networks , 2019, J. Netw. Comput. Appl..

[31]  Wasi Haider Butt,et al.  Multi-Label Fake News Detection using Multi-layered Supervised Learning , 2019, ICCAE 2019.

[32]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[33]  Devavrat Shah,et al.  Rumors in a Network: Who's the Culprit? , 2009, IEEE Transactions on Information Theory.

[34]  Jaeyoung Choi,et al.  Rumor source detection under querying with untruthful answers , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[35]  Yubo Zhang,et al.  Identification of efficient observers for locating spreading source in complex networks , 2016 .

[36]  Wei Gao,et al.  Detect Rumors Using Time Series of Social Context Information on Microblogging Websites , 2015, CIKM.

[37]  Fang Wang,et al.  Structure of peer-to-peer social networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Lina Yao,et al.  Opinion Fraud Detection via Neural Autoencoder Decision Forest , 2018, Pattern Recognit. Lett..

[39]  Yoshiteru Ishida,et al.  Fake News and its Credibility Evaluation by Dynamic Relational Networks: A Bottom up Approach , 2018, KES.

[40]  Eric Gilbert,et al.  CREDBANK: A Large-Scale Social Media Corpus With Associated Credibility Annotations , 2015, ICWSM.

[41]  Gabriella Pasi,et al.  Credibility in social media: opinions, news, and health information—a survey , 2017, WIREs Data Mining Knowl. Discov..

[42]  Nam P. Nguyen,et al.  Containment of misinformation spread in online social networks , 2012, WebSci '12.

[43]  Gábor Csányi,et al.  Structure of a large social network. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[44]  Kai Shu,et al.  FakeNewsTracker: a tool for fake news collection, detection, and visualization , 2018, Computational and Mathematical Organization Theory.

[45]  Xiang Ao,et al.  Energy model for rumor propagation on social networks , 2014 .

[46]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[47]  Christos Faloutsos,et al.  R-MAT: A Recursive Model for Graph Mining , 2004, SDM.

[48]  Xiaomo Liu,et al.  Real-time Rumor Debunking on Twitter , 2015, CIKM.

[49]  Soroush Vosoughi,et al.  Rumor Gauge , 2017, ACM Trans. Knowl. Discov. Data.

[50]  Jiao Gu,et al.  The effect of the forget-remember mechanism on spreading , 2008, 1011.3684.

[51]  Yamir Moreno,et al.  Theory of Rumour Spreading in Complex Social Networks , 2007, ArXiv.

[52]  Olga Simek,et al.  Collective Classification for Social Media Credibility Estimation , 2019, HICSS.

[53]  Mor Naaman,et al.  Diamonds in the rough: Social media visual analytics for journalistic inquiry , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[54]  Shrisha Rao,et al.  3HAN: A Deep Neural Network for Fake News Detection , 2017, ICONIP.

[55]  Suyalatu Dong,et al.  SEIR Model of Rumor Spreading in Online Social Network with Varying Total Population Size , 2017 .

[56]  Pável Calado,et al.  Combining Similarity Features and Deep Representation Learning for Stance Detection in the Context of Checking Fake News , 2018, ACM J. Data Inf. Qual..

[57]  Lei Ying,et al.  Information source detection in the SIR model: A sample path based approach , 2012, 2013 Information Theory and Applications Workshop (ITA).

[58]  Shuo Yang,et al.  Unsupervised Fake News Detection on Social Media: A Generative Approach , 2019, AAAI.

[59]  Kate Starbird,et al.  Rumors, False Flags, and Digital Vigilantes: Misinformation on Twitter after the 2013 Boston Marathon Bombing , 2014 .

[60]  Yuriy A. Belov,et al.  Generation of a Social Network Graph by Using Apache Spark , 2017, Automatic Control and Computer Sciences.

[61]  Vahida Attar,et al.  Source detection of rumor in social network - A review , 2019, Online Soc. Networks Media.

[62]  Yun Liu,et al.  An epidemic model of rumor diffusion in online social networks , 2013 .

[63]  Xiaoru Yuan,et al.  Social Media Visual Analytics , 2017, Comput. Graph. Forum.

[64]  Cody Buntain,et al.  Automatically Identifying Fake News in Popular Twitter Threads , 2017, 2017 IEEE International Conference on Smart Cloud (SmartCloud).

[65]  Wei Gao,et al.  Rumor Detection on Twitter with Tree-structured Recursive Neural Networks , 2018, ACL.

[66]  Jaeyoung Choi,et al.  Estimating the rumor source with anti-rumor in social networks , 2016, 2016 IEEE 24th International Conference on Network Protocols (ICNP).

[67]  Xiaohua Sun,et al.  Whisper: Tracing the Spatiotemporal Process of Information Diffusion in Real Time , 2012, IEEE Transactions on Visualization and Computer Graphics.

[68]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[69]  Yan Liu,et al.  Neural User Response Generator: Fake News Detection with Collective User Intelligence , 2018, IJCAI.

[70]  Indrajit Bhattacharya,et al.  Analysis and Early Detection of Rumors in a Post Disaster Scenario , 2018, Information Systems Frontiers.

[71]  Eiji Aramaki,et al.  How do rumors spread during a crisis?: Analysis of rumor expansion and disaffirmation on Twitter after 3.11 in Japan , 2014, Int. J. Web Inf. Syst..

[72]  Yanyan Li,et al.  Social Analytics Framework to Boost Recommendation in Online Learning Communities , 2015, 2015 IEEE 15th International Conference on Advanced Learning Technologies.

[73]  Ian Reilly,et al.  F for Fake: Propaganda! Hoaxing! Hacking! Partisanship! and Activism! in the Fake News Ecology , 2018 .

[74]  Ahmet Aker,et al.  The Fake News Challenge: Stance Detection using Traditional Machine Learning Approaches. , 2018 .

[75]  Muhammad Al-Qurishi,et al.  Selecting the best open source tools for collecting and visualzing social media content , 2015, 2015 2nd World Symposium on Web Applications and Networking (WSWAN).

[76]  Joao Correia,et al.  Improving trust using online credibility sources and social network quality in P2P marketplaces , 2013, 2013 8th Iberian Conference on Information Systems and Technologies (CISTI).

[77]  Muhammad Al-Qurishi,et al.  CredFinder: A real-time tweets credibility assessing system , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[78]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[79]  Jingjing Cheng,et al.  SIHR rumor spreading model in social networks , 2012 .

[80]  Nicolas Turenne The rumour spectrum , 2018, PloS one.

[81]  Mei-Ling Shyu,et al.  High Dimensional Latent Space Variational AutoEncoders for Fake News Detection , 2019, 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[82]  Dinesh Kumar Vishwakarma,et al.  Detection and veracity analysis of fake news via scrapping and authenticating the web search , 2019, Cognitive Systems Research.

[83]  Hernán A. Makse,et al.  CUNY Academic Works , 2022 .

[84]  Filippo Menczer,et al.  Hoaxy: A Platform for Tracking Online Misinformation , 2016, WWW.

[85]  Fan Zhou,et al.  Information Diffusion Prediction via Recurrent Cascades Convolution , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[86]  Sungyong Seo,et al.  CSI: A Hybrid Deep Model for Fake News Detection , 2017, CIKM.

[87]  Peng Yang,et al.  Deceptive Review Spam Detection via Exploiting Task Relatedness and Unlabeled Data , 2016, EMNLP.

[88]  Yongdong Zhang,et al.  Novel Visual and Statistical Image Features for Microblogs News Verification , 2017, IEEE Transactions on Multimedia.

[89]  Simrat Ahluwalia,et al.  Fake News Detection: A Deep Learning Approach , 2018 .

[90]  Sibel Adali,et al.  This Just In: Fake News Packs a Lot in Title, Uses Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News , 2017, Proceedings of the International AAAI Conference on Web and Social Media.

[91]  Gabriella Pasi,et al.  An ensemble method for the credibility assessment of user-generated content , 2017, WI.

[92]  Volker Markl,et al.  Myriad: Scalable and Expressive Data Generation , 2012, Proc. VLDB Endow..

[93]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[94]  He Jiang,et al.  Combating Fake News , 2019, ACM Trans. Intell. Syst. Technol..

[95]  Siegfried Depner,et al.  Towards a property graph generator for benchmarking , 2017, GRADES@SIGMOD/PODS.

[96]  Athena Vakali,et al.  Behind the cues: A benchmarking study for fake news detection , 2019, Expert Syst. Appl..

[97]  Filippo Menczer,et al.  Early detection of promoted campaigns on social media , 2017, EPJ Data Science.

[98]  Walid Maalej,et al.  Towards understanding and detecting fake reviews in app stores , 2019, Empirical Software Engineering.

[99]  David G. Rand,et al.  Belief in Fake News Is Associated with Delusionality, Dogmatism, Religious Fundamentalism, and Reduced Analytic Thinking , 2018, Journal of Applied Research in Memory and Cognition.

[100]  Jianhong Chen,et al.  Agent-Based Simulation of Rumor Propagation on Social Network Based on Active Immune Mechanism , 2017 .

[101]  Huan Liu,et al.  Deep Headline Generation for Clickbait Detection , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[102]  Wei Gao,et al.  Detect Rumor and Stance Jointly by Neural Multi-task Learning , 2018, WWW.

[103]  Wanlei Zhou,et al.  To Shut Them Up or to Clarify: Restraining the Spread of Rumors in Online Social Networks , 2014, IEEE Transactions on Parallel and Distributed Systems.

[104]  Muhammad Al-Qurishi,et al.  A multistage credibility analysis model for microblogs , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[105]  Wuqiong Luo,et al.  Identifying Infection Sources and Regions in Large Networks , 2012, IEEE Transactions on Signal Processing.

[106]  Zili Zhang,et al.  Predictors of the authenticity of Internet health rumours. , 2015, Health information and libraries journal.

[107]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[108]  Yang Liu,et al.  Early Detection of Fake News on Social Media Through Propagation Path Classification with Recurrent and Convolutional Networks , 2018, AAAI.

[109]  Mona T. Diab,et al.  Rumor Detection and Classification for Twitter Data , 2015, ArXiv.

[110]  George H. L. Fletcher,et al.  gMark: Schema-Driven Generation of Graphs and Queries , 2015, IEEE Transactions on Knowledge and Data Engineering.

[111]  Huan Liu,et al.  Understanding User Profiles on Social Media for Fake News Detection , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[112]  Manuel Penschuck,et al.  I/O-Efficient Generation of Massive Graphs Following the LFR Benchmark , 2016, ALENEX.

[113]  Kate Starbird,et al.  Examining the Alternative Media Ecosystem Through the Production of Alternative Narratives of Mass Shooting Events on Twitter , 2017, ICWSM.

[114]  Arkaitz Zubiaga,et al.  Hawkes Processes for Continuous Time Sequence Classification: an Application to Rumour Stance Classification in Twitter , 2016, ACL.

[115]  Jiliang Tang,et al.  Multi-Source Multi-Class Fake News Detection , 2018, COLING.

[116]  Hung-Yu Kao,et al.  IKM at SemEval-2017 Task 8: Convolutional Neural Networks for stance detection and rumor verification , 2017, *SEMEVAL.

[117]  Tamara G. Kolda,et al.  A Scalable Generative Graph Model with Community Structure , 2013, SIAM J. Sci. Comput..

[118]  Jake Ryland Williams,et al.  BuzzFace: A News Veracity Dataset with Facebook User Commentary and Egos , 2018, ICWSM.

[119]  Ponnurangam Kumaraguru,et al.  TweetCred: Real-Time Credibility Assessment of Content on Twitter , 2014, SocInfo.

[120]  Yongdong Zhang,et al.  Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs , 2017, ACM Multimedia.

[121]  Wei Gao,et al.  From Retweet to Believability: Utilizing Trust to Identify Rumor Spreaders on Twitter , 2017, ASONAM.

[122]  Issa Traore,et al.  Detecting opinion spams and fake news using text classification , 2018, Secur. Priv..

[123]  Qiang Zhang,et al.  Reply-Aided Detection of Misinformation via Bayesian Deep Learning , 2019, WWW.

[124]  Fang Miao,et al.  Rumor spreading and monitoring deployment in online social networks , 2017, 2017 IEEE 17th International Conference on Communication Technology (ICCT).

[125]  Gianluca Stringhini,et al.  Towards Detecting Compromised Accounts on Social Networks , 2015, IEEE Transactions on Dependable and Secure Computing.

[126]  Antonio Scala,et al.  Polarization and Fake News , 2018, ACM Trans. Web.

[127]  Wei Gao,et al.  Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning , 2017, ACL.

[128]  Deepayan Bhowmik,et al.  Fake News Identification on Twitter with Hybrid CNN and RNN Models , 2018, SMSociety.

[129]  Muhammad Al-Qurishi,et al.  A Credibility Analysis System for Assessing Information on Twitter , 2018, IEEE Transactions on Dependable and Secure Computing.

[130]  Vasudeva Varma,et al.  MVAE: Multimodal Variational Autoencoder for Fake News Detection , 2019, WWW.

[131]  Tian Wang,et al.  Using Mobile Nodes to Control Rumors in Big Data Based on a New Rumor Propagation Model in Vehicular Social Networks , 2018, IEEE Access.

[132]  Shlok Gilda,et al.  Evaluating machine learning algorithms for fake news detection , 2017, 2017 IEEE 15th Student Conference on Research and Development (SCOReD).