CoAID: COVID-19 Healthcare Misinformation Dataset

As the COVID-19 virus quickly spreads around the world, unfortunately, misinformation related to COVID-19 also gets created and spreads like wild fire. Such misinformation has caused confusion among people, disruptions in society, and even deadly consequences in health problems. To be able to understand, detect, and mitigate such COVID-19 misinformation, therefore, has not only deep intellectual values but also huge societal impacts. To help researchers combat COVID-19 health misinformation, therefore, we present CoAID (Covid-19 heAlthcare mIsinformation Dataset), with diverse COVID-19 healthcare misinformation, including fake news on websites and social platforms, along with users' social engagement about such news. CoAID includes 4,251 news, 296,000 related user engagements, 926 social platform posts about COVID-19, and ground truth labels. The dataset is available at: this https URL.

[1]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[2]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[3]  Hannah R. Meredith,et al.  The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application , 2020, Annals of Internal Medicine.

[4]  Huan Liu,et al.  FakeNewsNet: A Data Repository with News Content, Social Context and Dynamic Information for Studying Fake News on Social Media , 2018, ArXiv.

[5]  Huan Liu,et al.  dEFEND: Explainable Fake News Detection , 2019, KDD.

[6]  S. Shyam Sundar,et al.  “Fake News” Is Not Simply False Information: A Concept Explication and Taxonomy of Online Content , 2019, American Behavioral Scientist.

[7]  Suhang Wang,et al.  SAME: Sentiment-Aware Multi-Modal Embedding for Detecting Fake News , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[8]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[9]  Yelena Mejova,et al.  Fake Cures , 2018, Proc. ACM Hum. Comput. Interact..

[10]  Sungyong Seo,et al.  CSI: A Hybrid Deep Model for Fake News Detection , 2017, CIKM.

[11]  Brendan Nyhan,et al.  The effects of corrective information about disease epidemics and outbreaks: Evidence from Zika and yellow fever in Brazil , 2020, Science Advances.

[12]  P. Bordia,et al.  Rumor Psychology: Social and Organizational Approaches , 2006 .

[13]  Axel Gelfert Fake News: A Definition , 2018 .

[14]  Muhammad Ashad Kabir,et al.  Differences in Health News from Reliable and Unreliable Media , 2019, WWW.

[15]  Fatima K. Abu Salem,et al.  FA-KES: A Fake News Dataset around the Syrian War , 2019, ICWSM.

[16]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[17]  Suhang Wang,et al.  Ginger Cannot Cure Cancer: Battling Fake Health News with a Comprehensive Data Repository , 2020, ICWSM.

[18]  Filippo Menczer,et al.  Hoaxy: A Platform for Tracking Online Misinformation , 2016, WWW.

[19]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[20]  William Yang Wang “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection , 2017, ACL.

[21]  Fenglong Ma,et al.  EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection , 2018, KDD.