论文信息 - How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets? - 字舞流文

How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets?

Paula Fortuna | Leo Wanner | Juan Soler | L. Wanner | Paula Fortuna | Juan Soler

[1] Vasudeva Varma,et al. Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[2] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3] Bernard J. Jansen,et al. Detecting Toxicity Triggers in Online Discussions , 2019, HT.

[4] Noel Crespi,et al. A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media , 2019, COMPLEX NETWORKS.

[5] Preslav Nakov,et al. SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval) , 2019, *SEMEVAL.

[6] Ralf Krestel,et al. Challenges for Toxic Comment Classification: An In-Depth Error Analysis , 2018, ALW.

[7] Ritesh Kumar,et al. Benchmarking Aggression Identification in Social Media , 2018, TRAC@COLING 2018.

[8] Lei Gao,et al. Detecting Online Hate Speech Using Context Aware Models , 2017, RANLP.

[9] Bernard J. Jansen,et al. Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media , 2018, ICWSM.

[10] Paolo Rosso,et al. SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter , 2019, *SEMEVAL.

[11] Viviana Patti,et al. Misogyny Detection in Twitter: a Multilingual and Cross-Domain Study , 2020, Inf. Process. Manag..

[12] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[13] Joachim Bingel,et al. Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection , 2018 .

[14] Yulan He,et al. Approaches to Automated Detection of Cyberbullying: A Survey , 2020, IEEE Transactions on Affective Computing.

[15] Gianluca Stringhini,et al. Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior , 2018, ICWSM.

[16] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.

[17] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[18] Endang Wahyu Pamungkas,et al. Cross-domain and Cross-lingual Abusive Language Detection: A Hybrid Approach with Deep Learning and a Multilingual Lexicon , 2019, ACL.

[19] Paolo Rosso,et al. Overview of the Task on Automatic Misogyny Identification at IberEval 2018 , 2018, IberEval@SEPLN.

[20] Yangqiu Song,et al. Multilingual and Multi-Aspect Hate Speech Analysis , 2019, EMNLP.

[21] Scott A. Hale,et al. Challenges and frontiers in abusive content detection , 2019, Proceedings of the Third Workshop on Abusive Language Online.

[22] Barbara Poblete,et al. Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation , 2019, SIGIR.

[23] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[24] Leon Derczynski,et al. Directions in abusive language training data, a systematic review: Garbage in, garbage out , 2020, PloS one.

[25] Jan Snajder,et al. Cross-Domain Detection of Abusive Language Online , 2018, ALW.

[26] Irina Illina,et al. BERT and fastText Embeddings for Automatic Detection of Toxic Speech , 2020, 2020 International Multi-Conference on: “Organization of Knowledge and Advanced Technologies” (OCTA).

[27] Sérgio Nunes,et al. Stop PropagHate at SemEval-2019 Tasks 5 and 6: Are abusive language classification results reproducible? , 2019, *SEMEVAL.

[28] Maite Taboada,et al. The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments , 2019, Corpus pragmatics : international journal of corpus linguistics and pragmatics.

[29] Sérgio Nunes,et al. A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[30] Mauro Conti,et al. All You Need is "Love": Evading Hate Speech Detection , 2018, ArXiv.

[31] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[32] Björn Gambäck,et al. Studying Generalisability across Abusive Language Detection Datasets , 2019, CoNLL.

[33] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[34] Tomas Mikolov,et al. Advances in Pre-Training Distributed Word Representations , 2017, LREC.

[35] Jorge Pérez,et al. Hate speech detection is not as easy as you may think: A closer look at model validation (extended version) , 2020, Inf. Syst..

[36] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[37] Jenq-Haur Wang,et al. Vulnerable community identification using hate speech detection on social media , 2020, Inf. Process. Manag..

[38] Lucas Dixon,et al. Ex Machina: Personal Attacks Seen at Scale , 2016, WWW.

[39] Cody Buntain,et al. A Large Labeled Corpus for Online Harassment Research , 2017, WebSci.

[40] Bernard J. Jansen,et al. Developing an online hate classifier for multiple social media platforms , 2020, Human-centric Computing and Information Sciences.

[41] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.

[42] Paolo Rosso,et al. Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI) , 2018, EVALITA@CLiC-it.

[43] Paula Fortuna,et al. Toxic, Hateful, Offensive or Abusive? What Are We Really Classifying? An Empirical Analysis of Hate Speech Datasets , 2020, LREC.

[44] Peter Norvig,et al. The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.

[45] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[46] Ona de Gibert,et al. Hate Speech Dataset from a White Supremacy Forum , 2018, ALW.

[47] Eric Gilbert,et al. The Bag of Communities: Identifying Abusive Behavior Online with Preexisting Internet Data , 2017, CHI.

[48] Sérgio Nunes,et al. A Hierarchically-Labeled Portuguese Hate Speech Dataset , 2019, Proceedings of the Third Workshop on Abusive Language Online.