An empirical evaluation of text representation schemes to filter the social media stream

Modeling text in a numerical representation is a prime task for any Natural Language Processing downstream task such as text classification. This paper attempts to study the effectiveness of text r...

[1]  Thomas Mandl Tolerant Information Retrieval with Backpropagation Networks , 2000, Neural Computing & Applications.

[2]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[3]  Prasenjit Majumder,et al.  Tracking Hate in Social Media: Evaluation, Challenges and Approaches , 2020, SN Computer Science.

[4]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[5]  Prasenjit Majumder,et al.  Detecting and visualizing hate speech in social media: A cyber Watchdog for surveillance , 2020, Expert Syst. Appl..

[6]  Shervin Malmasi,et al.  Detecting Hate Speech in Social Media , 2017, RANLP.

[7]  Prasenjit Majumder,et al.  Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages , 2019, FIRE.

[8]  Vasudeva Varma,et al.  Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[9]  Rudresh Panchal,et al.  Online hatred of women in the Incels.me forum , 2019, Journal of Language Aggression and Conflict.

[10]  Monojit Choudhury,et al.  Overview of the FIRE 2018 track: Information Retrieval from Microblogs during Disasters (IRMiDis) , 2018, FIRE.

[11]  Naganna Chetty,et al.  Hate speech review in the context of online social networks , 2018 .

[12]  Bing Xue,et al.  In Data We Trust: A Critical Analysis of Hate Speech Detection Datasets , 2020, ALW.

[13]  Gerhard Weikum,et al.  Graph-based text classification: learn from your neighbors , 2006, SIGIR.

[14]  Anil K. Jain,et al.  Classification of text documents , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[15]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[16]  Diego R. Amancio,et al.  Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks , 2016, PloS one.

[17]  Paolo Rosso,et al.  The Battle Against Online Harmful Information: The Cases of Fake News and Hate Speech , 2020, CIKM.

[18]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[19]  Walter Daelemans,et al.  Automatic detection of cyberbullying in social media text , 2018, PloS one.