The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic

Fact checking by professionals is viewed as a vital defense in the fight against misinformation.While fact checking is important and its impact has been significant, fact checks could have limited visibility and may not reach the intended audience, such as those deeply embedded in polarized communities. Concerned citizens (i.e., the crowd), who are users of the platforms where misinformation appears, can play a crucial role in disseminating fact-checking information and in countering the spread of misinformation. To explore if this is the case, we conduct a data-driven study of misinformation on the Twitter platform, focusing on tweets related to the COVID-19 pandemic, analyzing the spread of misinformation, professional fact checks, and the crowd response to popular misleading claims about COVID-19. In this work, we curate a dataset of false claims and statements that seek to challenge or refute them. We train a classifier to create a novel dataset of 155,468 COVID-19-related tweets, containing 33,237 false claims and 33,413 refuting arguments.Our findings show that professional fact-checking tweets have limited volume and reach. In contrast, we observe that the surge in misinformation tweets results in a quick response and a corresponding increase in tweets that refute such misinformation. More importantly, we find contrasting differences in the way the crowd refutes tweets, some tweets appear to be opinions, while others contain concrete evidence, such as a link to a reputed source. Our work provides insights into how misinformation is organically countered in social platforms by some of their users and the role they play in amplifying professional fact checks.These insights could lead to development of tools and mechanisms that can empower concerned citizens in combating misinformation. The code and data can be found in this http URL.

[1]  Christof Schuster,et al.  A Note on the Interpretation of Weighted Kappa and its Relations to Other Rater Agreement Statistics for Metric Scales , 2004 .

[2]  Filippo Menczer,et al.  Anatomy of an online misinformation network , 2018, PloS one.

[3]  Gretchen G. Moisen,et al.  A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and Kappa , 2008 .

[4]  Cristian Danescu-Niculescu-Mizil,et al.  ConvoKit: A Toolkit for the Analysis of Conversations , 2020, SIGDIAL.

[5]  Md Momen Investigating “Who” in the Crowdsourcing of News Credibility , 2020 .

[6]  Abrar Ahmad Chughtai,et al.  COVID-19–Related Infodemic and Its Impact on Public Health: A Global Social Media Analysis , 2020, The American journal of tropical medicine and hygiene.

[7]  Preslav Nakov,et al.  Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society , 2020, EMNLP.

[8]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[9]  Y. Ortiz-Martínez,et al.  Yellow fever outbreaks and Twitter: Rumors and misinformation. , 2017, American journal of infection control.

[10]  Emily K. Vraga,et al.  Using Expert Sources to Correct Health Misinformation in Social Media , 2017 .

[11]  Kate Starbird,et al.  Rumors, False Flags, and Digital Vigilantes: Misinformation on Twitter after the 2013 Boston Marathon Bombing , 2014 .

[12]  Vincent A. Knight,et al.  Tweeting the terror: modelling the social media reaction to the Woolwich terrorist attack , 2014, Social Network Analysis and Mining.

[13]  Kathleen M. Carley,et al.  Characterizing COVID-19 Misinformation Communities Using a Novel Twitter Dataset , 2020, CIKM.

[14]  Fenglong Ma,et al.  EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection , 2018, KDD.

[15]  Gianluca Demartini,et al.  The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively? , 2020, CIKM.

[16]  Sameer Patil,et al.  Exposure to Social Engagement Metrics Increases Vulnerability to Misinformation , 2020, Harvard Kennedy School Misinformation Review.

[17]  Sameer Singh,et al.  Detecting COVID-19 Misinformation on Social Media , 2020 .

[18]  Arkaitz Zubiaga,et al.  Towards Detecting Rumours in Social Media , 2015, AAAI Workshop: AI for Cities.

[19]  Oluwaseun Ajao,et al.  Sentiment Aware Fake News Detection on Online Social Networks , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[21]  Jure Leskovec,et al.  Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes , 2016, WWW.

[22]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[23]  Sungyong Seo,et al.  COVID-19 on Social Media: Analyzing Misinformation in Twitter Conversations , 2020 .

[24]  Neil Shah,et al.  False Information on Web and Social Media: A Survey , 2018, ArXiv.

[25]  Miriam J. Metzger,et al.  The science of fake news , 2018, Science.

[26]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27]  Amit P. Sheth,et al.  What Are People Tweeting About Zika? An Exploratory Study Concerning Its Symptoms, Treatment, Transmission, and Prevention , 2017, JMIR public health and surveillance.

[28]  Yan Jin,et al.  Seeking Formula for Misinformation Treatment in Public Health Crises: The Effects of Corrective Information Type and Source , 2019, Health communication.

[29]  Vana Kalogeraki,et al.  A Model for Identifying Misinformation in Online Social Networks , 2015, OTM Conferences.

[30]  Penelope Brown,et al.  Politeness: Some Universals in Language Usage , 1989 .

[31]  Ismini Lourentzou,et al.  Drink bleach or do what now? Covid-HeRA: A dataset for risk-informed health decision making in the presence of COVID19 misinformation , 2020, ArXiv.

[32]  Kristina Lerman,et al.  Tracking Social Media Discourse About the COVID-19 Pandemic: Development of a Public Coronavirus Twitter Data Set , 2020, JMIR public health and surveillance.

[33]  Emily K. Vraga,et al.  Do the right thing: Tone may not affect correction of misinformation on social media , 2020, Harvard Kennedy School Misinformation Review.

[34]  Fenglong Ma,et al.  Weak Supervision for Fake News Detection via Reinforcement Learning , 2019, AAAI.

[35]  Christo Wilson,et al.  Linguistic Signals under Misinformation and Fact-Checking , 2018, Proc. ACM Hum. Comput. Interact..

[36]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[37]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[38]  Marlyna Maros,et al.  Politeness Strategies in Twitter Updates of Female English Language Studies Malaysian Undergraduates , 2017 .

[39]  Tim A. Majchrzak,et al.  An Exploratory Study of COVID-19 Misinformation on Twitter , 2020, ArXiv.

[40]  Amanda Wintersieck,et al.  Debating the Truth , 2017 .

[41]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[42]  Matteo Cinelli,et al.  The COVID-19 social media infodemic , 2020, Scientific reports.

[43]  Kathleen M. Carley,et al.  Disinformation and Misinformation on Twitter during the Novel Coronavirus Outbreak , 2020, ArXiv.

[44]  Wen Chen,et al.  Neutral Bots Reveal Political Bias on Social Media , 2020, ArXiv.

[45]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[46]  Jabra Zarka,et al.  Coronavirus Goes Viral: Quantifying the COVID-19 Misinformation Epidemic on Twitter , 2020, Cureus.

[47]  Dimitrios Gunopulos,et al.  Efficient and timely misinformation blocking under varying cost constraints , 2017, Online Soc. Networks Media.

[48]  Elia Gabarron,et al.  Ebola, Twitter, and misinformation: a dangerous combination? , 2014, BMJ : British Medical Journal.

[49]  Chengkai Li,et al.  Introduction to the Special Issue on Combating Digital Misinformation and Disinformation , 2019, ACM J. Data Inf. Qual..