Detecting Rumor and Disinformation by Web Mining

A method for determining whether given text is a rumor or disinformation is proposed, based on web mining and linguistic technology comparing two paragraphs of text. We hypothesize about a family of content generation algorithms which are capable of producing disinformation from a portion of genuine, original text. We then propose a disinformation detection algorithm which finds a candidate source of text on the web and compares it with the given text, applying parse thicket technology. Parse thicket is graph combined from a sequence of parse trees augmented with inter-sentence relations for anaphora and rhetoric structures. We evaluate our algorithm in the domain of customer reviews, considering a product review as an instance of possible disinformation. It is confirmed as a plausible way to detect rumor and disinformation in a web document. Linguistic approach presented here complements social network structure-based described on a corpus of research on disinformation detection.

[1]  Boris A. Galitsky,et al.  A Web Mining Tool for Assistance with Creative Writing , 2013, ECIR.

[2]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[3]  Johanna Weiss,et al.  Conquest In Cyberspace National Security And Information Warfare , 2016 .

[4]  Jacob Ratkiewicz,et al.  Detecting and Tracking the Spread of Astroturf Memes in Microblog Streams , 2010, ArXiv.

[5]  Boris A. Galitsky Transfer learning of syntactic structures for building taxonomies for search engines , 2013, Eng. Appl. Artif. Intell..

[6]  Josep Lluís de la Rosa i Esteva,et al.  Inferring the semantic properties of sentences by mining syntactic parse trees , 2012, Data Knowl. Eng..

[7]  Eunsoo Seo,et al.  Identifying rumors and their sources in social networks , 2012, Defense + Commercial Sensing.

[8]  Wei Xiaohu Competitive Intelligence Gathering in Casting Exhibition , 2014 .

[9]  Bernd Carsten Stahl,et al.  On the Difference or Equality of Information, Misinformation, and Disinformation: A Critical Research Perspective , 2006, Informing Sci. Int. J. an Emerg. Transdiscipl..

[10]  Kevin Robert Canini,et al.  Finding Credible Information Sources in Social Networks Based on Content and Social Structure , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[11]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[12]  G. Geethakumari,et al.  Detecting misinformation in online social networks using cognitive psychology , 2014, Human-centric Computing and Information Sciences.

[13]  Boris A. Galitsky Machine learning of syntactic parse trees for search and classification of text , 2013, Eng. Appl. Artif. Intell..