The Truth is Out There: Investigating Conspiracy Theories in Text Generation

With the growing adoption of text generation models in today’s society, users are increasingly exposed to machine-generated text. This in turn can leave users vulnerable to the generation of harmful information such as conspiracy theories. While the propagation of conspiracy theories through social media has been studied, previous work has not evaluated their diffusion through text generation. In this work, we investigate the propensity for language models to generate conspiracy theory text. Our study focuses on testing these models for the elicitation of conspiracy theories and comparing these generations to humanwritten theories from Reddit. We also introduce a new dataset consisting of conspiracy theory topics, machine-generated conspiracy theories, and human-written conspiracy theories. Our experiments show that many wellknown conspiracy theory topics are deeply rooted in the pre-trained language models, and can become more prevalent through different model settings.

[1]  Thomas Wolf,et al.  DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter , 2019, ArXiv.

[2]  Michelle I. Seelig,et al.  Why do people believe COVID-19 conspiracy theories? , 2020, Harvard Kennedy School Misinformation Review.

[3]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[4]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[5]  J. Frith,et al.  COVID-19, 5G conspiracies and infrastructural futures , 2020, Media International Australia.

[6]  Karen M. Douglas,et al.  Climate change: Why the conspiracy theories are dangerous , 2015 .

[7]  Karen M. Douglas,et al.  Belief in conspiracy theories: Basic principles of an emerging research domain , 2018, European journal of social psychology.

[8]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[9]  Nanyun Peng,et al.  The Woman Worked as a Babysitter: On Biases in Language Generation , 2019, EMNLP.

[10]  Sander van der Linden,et al.  The conspiracy-effect: Exposure to conspiracy theories (about global warming) decreases pro-social behavior and science acceptance , 2015 .

[11]  Michael S. Bernstein,et al.  Empath: Understanding Topic Signals in Large-Scale Text , 2016, CHI.

[12]  Wasim Ahmed,et al.  COVID-19 and the 5G Conspiracy Theory: Social Network Analysis of Twitter Data , 2020, Journal of Medical Internet Research.

[13]  Thomas J. Wood,et al.  Conspiracy Theories and the Paranoid Style(s) of Mass Opinion , 2014 .

[14]  Guido Caldarelli,et al.  Science vs Conspiracy: Collective Narratives in the Age of Misinformation , 2014, PloS one.

[15]  Moreno Mancosu,et al.  Believing in Conspiracy Theories: Evidence from an Exploratory Analysis of Italian Survey Data , 2017 .

[16]  Lysandre Debut,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[17]  Guido Caldarelli,et al.  Emotional Dynamics in the Age of Misinformation , 2015, PloS one.