Hate Speech and Topic Shift in the Covid-19 Public Discourse on Social Media in Italy

The availability of large annotated corpora from social media and the development of powerful classification approaches have contributed in an unprecedented way to tackle the challenge of monitoring users’ opinions and sentiments in online social platforms across time but also arose the challenge of temporal robustness of such detection and monitoring systems. We used as case study a dataset of tweets in Italian related to the COVID-19 induced lockdown in Italy to measure how quickly the most debated topic online shifted in time. We concluded that it is a promising approach but dedicated corpora and fine tuning of algorithms are crucial for more insightful results.

[1]  Michael Wiegand,et al.  A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[4]  Viviana Patti,et al.  Time of Your Hate: The Challenge of Time in Hate Speech Detection on Social Media , 2020, Applied Sciences.

[5]  Viviana Patti,et al.  Resources and benchmark corpora for hate speech detection: a systematic review , 2020, Language Resources and Evaluation.

[6]  Giovanni Semeraro,et al.  Computational Linguistics Against Hate: Hate Speech Detection and Visualization on Social Media in the "Contro L'Odio" Project , 2019, CLiC-it.

[7]  Valerio Basile,et al.  Long-term Social Media Data Collection at the University of Turin , 2018, CLiC-it.

[8]  “Contro L’Odio”: A Platform for Detecting, Monitoring and Visualizing Hate Speech against Immigrants in Italian Social Media , 2020, Italian Journal of Computational Linguistics.

[9]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[10]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[11]  Sérgio Nunes,et al.  A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[12]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[13]  Valerio Basile,et al.  We Need to Consider Disagreement in Evaluation , 2021, BPPF.

[14]  Viviana Patti,et al.  Hurtlex: A Multilingual Lexicon of Words to Hurt , 2018, CLiC-it.

[15]  AlBERTo: Modeling Italian Social Media Language with BERT , 2019, Italian Journal of Computational Linguistics.

[16]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[17]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[18]  Leon Derczynski,et al.  Directions in abusive language training data, a systematic review: Garbage in, garbage out , 2020, PloS one.