Fake News Early Detection

Massive dissemination of fake news and its potential to erode democracy has increased the demand for accurate fake news detection. Recent advancements in this area have proposed novel techniques that aim to detect fake news by exploring how it propagates on social networks. Nevertheless, to detect fake news at an early stage, i.e., when it is published on a news outlet but not yet spread on social media, one cannot rely on news propagation information as it does not exist. Hence, there is a strong need to develop approaches that can detect fake news by focusing on news content. In this article, a theory-driven model is proposed for fake news detection. The method investigates news content at various levels: lexicon-level, syntax-level, semantic-level, and discourse-level. We represent news at each level, relying on well-established theories in social and forensic psychology. Fake news detection is then conducted within a supervised machine learning framework. As an interdisciplinary research, our work explores potential fake news patterns, enhances the interpretability in fake news feature engineering, and studies the relationships among fake news, deception/disinformation, and clickbaits. Experiments conducted on two real-world datasets indicate the proposed method can outperform the state-of-the-art and enable fake news early detection when there is limited content information.

[1]  A. Roets,et al.  ‘Fake news’: Incorrect, but hard to correct. The role of cognitive ability on the impact of false information on social impressions , 2017 .

[2]  Yimin Chen,et al.  Deception detection for news: Three types of fakes , 2015, ASIST.

[3]  Johan Bollen,et al.  Computational Fact Checking from Knowledge Networks , 2015, PloS one.

[4]  Amol Agrawal,et al.  Clickbait detection using deep learning , 2016, 2016 2nd International Conference on Next Generation Computing Technologies (NGCT).

[5]  Huan Liu,et al.  Beyond News Contents: The Role of Social Context for Fake News Detection , 2017, WSDM.

[6]  Sanjeev Arora,et al.  A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[7]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[8]  Niloy Ganguly,et al.  Tabloids in the Era of Social Media? , 2017, Proc. ACM Hum. Comput. Interact..

[9]  L RubinVictoria,et al.  Truth and deception at the rhetorical structure level , 2015 .

[10]  D. Pisarevskaya RhetoRical StRuctuRe theoRy aS a FeatuRe FoR Deception Detection in newS RepoRtS in the RuSSian language , 2015 .

[11]  Niloy Ganguly,et al.  Stop Clickbait: Detecting and preventing clickbaits in online news media , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[12]  Davide Eynard,et al.  Fake News Detection on Social Media using Geometric Deep Learning , 2019, ArXiv.

[13]  Reza Zafarani,et al.  Social Media Mining: An Introduction , 2014 .

[14]  Suhang Wang,et al.  Fake News Detection on Social Media: A Data Mining Perspective , 2017, SKDD.

[15]  Victoria L. Rubin On deception and deception detection: Content analysis of computer-mediated stated beliefs , 2010, ASIST.

[16]  Eric D. Ragan,et al.  Open Issues in Combating Fake News: Interpretability as an Opportunity , 2019, ArXiv.

[17]  C. MacLeod,et al.  Attentional bias in emotional disorders. , 1986, Journal of abnormal psychology.

[18]  Benno Stein,et al.  A Stylometric Inquiry into Hyperpartisan and Fake News , 2017, ACL.

[19]  G. Bálint,et al.  [The Semmelweis-reflex]. , 2009, Orvosi hetilap.

[20]  Jing Qian,et al.  A Survey on Natural Language Processing for Fake News Detection , 2018, LREC.

[21]  Tanya Goyal,et al.  Predicting Email and Article Clickthroughs with Domain-adaptive Language Models , 2018, WebSci.

[22]  Reza Zafarani,et al.  Fake News: Fundamental Theories, Detection Strategies and Challenges , 2019, WSDM.

[23]  Yejin Choi,et al.  Syntactic Stylometry for Deception Detection , 2012, ACL.

[24]  Yongdong Zhang,et al.  News Verification by Exploiting Conflicting Social Viewpoints in Microblogs , 2016, AAAI.

[25]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[26]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[27]  William Yang Wang “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection , 2017, ACL.

[28]  Svitlana Volkova,et al.  Separating Facts from Fiction: Linguistic Models to Classify Suspicious and Trusted News Posts on Twitter , 2017, ACL.

[29]  Jiawei Han,et al.  Evaluating Event Credibility on Twitter , 2012, SDM.

[30]  Fenglong Ma,et al.  EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection , 2018, KDD.

[31]  Xia Hu,et al.  Techniques for interpretable machine learning , 2018, Commun. ACM.

[32]  Reza Zafarani,et al.  Fake News: A Survey of Research, Detection Methods, and Opportunities , 2018, ArXiv.

[33]  Sungyong Seo,et al.  CSI: A Hybrid Deep Model for Fake News Detection , 2017, CIKM.

[34]  Yanjie Fu,et al.  Fake News Detection with Deep Diffusive Network Model , 2018, ArXiv.

[35]  R. Nickerson Confirmation Bias: A Ubiquitous Phenomenon in Many Guises , 1998 .

[36]  M. Zuckerman Verbal and nonverbal communication of deception , 1981 .

[37]  David R. Karger,et al.  A Structured Response to Misinformation: Defining and Annotating Credibility Indicators in News Articles , 2018, WWW.

[38]  Daniel Jurafsky,et al.  Linguistic Models for Analyzing and Detecting Biased Language , 2013, ACL.

[39]  Huan Liu,et al.  FakeNewsNet: A Data Repository with News Content, Social Context and Dynamic Information for Studying Fake News on Social Media , 2018, ArXiv.

[40]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[41]  Supavich Pengnate Measuring Emotional Arousal in Clickbait: Eye-Tracking Approach , 2016, AMCIS.

[42]  Yimin Chen,et al.  Misleading Online Content: Recognizing Clickbait as "False News" , 2015, WMDD@ICMI.

[43]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[44]  Marcia K. Johnson,et al.  Reality Monitoring , 2005 .

[45]  Georg Rehm,et al.  From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles , 2017, NLPmJ@EMNLP.

[46]  Jacob Eisenstein,et al.  Representation Learning for Text-level Discourse Parsing , 2014, ACL.

[47]  Reza Zafarani,et al.  Credibility-based Fake News Detection , 2019, Lecture Notes in Social Networks.

[48]  Ryan L. Boyd,et al.  The Development and Psychometric Properties of LIWC2015 , 2015 .

[49]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[50]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[51]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[52]  Amy M. Wisner,et al.  Information Manipulation Theory 2 , 2014 .

[53]  Reza Zafarani,et al.  Fake News Detection: An Interdisciplinary Research , 2019, WWW.

[54]  Reza Zafarani,et al.  Network-based Fake News Detection: A Pattern-driven Approach , 2019, SKDD.

[55]  Victoria L. Rubin,et al.  Truth and deception at the rhetorical structure level , 2015, J. Assoc. Inf. Sci. Technol..

[56]  G. Loewenstein The psychology of curiosity: A review and reinterpretation. , 1994 .

[57]  Sandya Mannarswamy,et al.  CIMTDetect: A Community Infused Matrix-Tensor Coupled Factorization Based Method for Fake News Detection , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[58]  Fan Yang,et al.  XFake: Explainable Fake News Detector with Visualizations , 2019, WWW.

[59]  Tim Weninger,et al.  Discriminative predicate path mining for fact checking in knowledge graphs , 2015, Knowl. Based Syst..

[60]  Verónica Pérez-Rosas,et al.  Automatic Detection of Fake News , 2017, COLING.

[61]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[62]  L. Boehm,et al.  The Validity Effect: A Search for Mediating Variables , 1994 .

[63]  Matthias Hagen,et al.  Clickbait Detection , 2016, ECIR.

[64]  Yongdong Zhang,et al.  News Credibility Evaluation on Microblog with a Hierarchical Propagation Model , 2014, 2014 IEEE International Conference on Data Mining.