Fake News Detection by means of Uncertainty Weighted Causal Graphs

Society is experimenting changes in information consumption, as new information channels such as social networks let people share news that do not necessarily be trust worthy. Sometimes, these sources of information produce fake news deliberately with doubtful purposes and the consumers of that information share it to other users thinking that the information is accurate. This transmission of information represents an issue in our society, as can influence negatively the opinion of people about certain figures, groups or ideas. Hence, it is desirable to design a system that is able to detect and classify information as fake and categorize a source of information as trust worthy or not. Current systems experiment difficulties performing this task, as it is complicated to design an automatic procedure that can classify this information independent on the context. In this work, we propose a mechanism to detect fake news through a classifier based on weighted causal graphs. These graphs are specific hybrid models that are built through causal relations retrieved from texts and consider the uncertainty of causal relations. We take advantage of this representation to use the probability distributions of this graph and built a fake news classifier based on the entropy and KL divergence of learned and new information. We believe that the problem of fake news is accurately tackled by this model due to its hybrid nature between a symbolic and quantitative methodology. We describe the methodology of this classifier and add empirical evidence of the usefulness of our proposed approach in the form of synthetic experiments and a real experiment involving lung cancer.

[1]  Alejandro Sobrino,et al.  Extracting answers from causal mechanisms in a medical document , 2014, Neurocomputing.

[2]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[3]  Eduardo C. Garrido-Merch'an,et al.  Uncertainty Weighted Causal Graphs , 2020, ArXiv.

[4]  Causality in Sciencie , 2011 .

[5]  Alejandro Sobrino,et al.  Summarizing information by means of causal sentences through causal graphs , 2017, J. Appl. Log..

[6]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .

[7]  Suhang Wang,et al.  Fake News Detection on Social Media: A Data Mining Perspective , 2017, SKDD.

[8]  Pinar Yolum,et al.  Preserving Privacy as Social Responsibility in Online Social Networks , 2018, ACM Trans. Internet Techn..

[9]  Hyegyu Lee,et al.  When Do People Verify and Share Health Rumors on Social Media? The Effects of Message Importance, Health Anxiety, and Health Literacy , 2019, Journal of health communication.

[10]  José Angel Olivas,et al.  Creating a natural language summary from a compressed causal graph , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Eduardo C. Garrido-Merchán,et al.  Generating a Question Answering System from Text Causal Relations , 2019, HAIS.

[13]  T. Venturini,et al.  “API-Based Research” or How can Digital Sociology and Journalism Studies Learn from the Facebook and Cambridge Analytica Data Breach , 2019, Digital Journalism.

[14]  Eduardo C. Garrido-Merchán,et al.  A Machine Consciousness architecture based on Deep Learning and Gaussian Processes , 2020, HAIS.

[15]  Kristina Lerman,et al.  Analyzing the Digital Traces of Political Manipulation: The 2016 Russian Interference Twitter Campaign , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[16]  Daniel Hern'andez-Lobato,et al.  Parallel Predictive Entropy Search for Multi-objective Bayesian Optimization with Constraints , 2020, ArXiv.

[17]  A. J. Morales,et al.  Characterizing and modeling an electoral campaign in the context of Twitter: 2011 Spanish Presidential Election as a case study , 2012, Chaos.

[18]  M. Gentzkow,et al.  Social Media and Fake News in the 2016 Election , 2017 .

[19]  A. Bruns,et al.  Twitter and Society , 2013 .

[20]  A. Aish Explanatory Models in Suicide Research: Explaining Relationships , 2002 .

[21]  Alejandro Sobrino,et al.  Extraction, analysis and representation of imperfect conditional and causal sentences by means of a semi-automatic process , 2010, International Conference on Fuzzy Systems.

[22]  Daniel M. Hausman,et al.  Causal Asymmetries: List of Figures , 1998 .