How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets?
暂无分享,去创建一个
[1] Vasudeva Varma,et al. Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.
[2] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[3] Bernard J. Jansen,et al. Detecting Toxicity Triggers in Online Discussions , 2019, HT.
[4] Noel Crespi,et al. A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media , 2019, COMPLEX NETWORKS.
[5] Preslav Nakov,et al. SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval) , 2019, *SEMEVAL.
[6] Ralf Krestel,et al. Challenges for Toxic Comment Classification: An In-Depth Error Analysis , 2018, ALW.
[7] Ritesh Kumar,et al. Benchmarking Aggression Identification in Social Media , 2018, TRAC@COLING 2018.
[8] Lei Gao,et al. Detecting Online Hate Speech Using Context Aware Models , 2017, RANLP.
[9] Bernard J. Jansen,et al. Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media , 2018, ICWSM.
[10] Paolo Rosso,et al. SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter , 2019, *SEMEVAL.
[11] Viviana Patti,et al. Misogyny Detection in Twitter: a Multilingual and Cross-Domain Study , 2020, Inf. Process. Manag..
[12] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.
[13] Joachim Bingel,et al. Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection , 2018 .
[14] Yulan He,et al. Approaches to Automated Detection of Cyberbullying: A Survey , 2020, IEEE Transactions on Affective Computing.
[15] Gianluca Stringhini,et al. Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior , 2018, ICWSM.
[16] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.
[17] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.
[18] Endang Wahyu Pamungkas,et al. Cross-domain and Cross-lingual Abusive Language Detection: A Hybrid Approach with Deep Learning and a Multilingual Lexicon , 2019, ACL.
[19] Paolo Rosso,et al. Overview of the Task on Automatic Misogyny Identification at IberEval 2018 , 2018, IberEval@SEPLN.
[20] Yangqiu Song,et al. Multilingual and Multi-Aspect Hate Speech Analysis , 2019, EMNLP.
[21] Scott A. Hale,et al. Challenges and frontiers in abusive content detection , 2019, Proceedings of the Third Workshop on Abusive Language Online.
[22] Barbara Poblete,et al. Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation , 2019, SIGIR.
[23] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.
[24] Leon Derczynski,et al. Directions in abusive language training data, a systematic review: Garbage in, garbage out , 2020, PloS one.
[25] Jan Snajder,et al. Cross-Domain Detection of Abusive Language Online , 2018, ALW.
[26] Irina Illina,et al. BERT and fastText Embeddings for Automatic Detection of Toxic Speech , 2020, 2020 International Multi-Conference on: “Organization of Knowledge and Advanced Technologies” (OCTA).
[27] Sérgio Nunes,et al. Stop PropagHate at SemEval-2019 Tasks 5 and 6: Are abusive language classification results reproducible? , 2019, *SEMEVAL.
[28] Maite Taboada,et al. The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments , 2019, Corpus pragmatics : international journal of corpus linguistics and pragmatics.
[29] Sérgio Nunes,et al. A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..
[30] Mauro Conti,et al. All You Need is "Love": Evading Hate Speech Detection , 2018, ArXiv.
[31] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.
[32] Björn Gambäck,et al. Studying Generalisability across Abusive Language Detection Datasets , 2019, CoNLL.
[33] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.
[34] Tomas Mikolov,et al. Advances in Pre-Training Distributed Word Representations , 2017, LREC.
[35] Jorge Pérez,et al. Hate speech detection is not as easy as you may think: A closer look at model validation (extended version) , 2020, Inf. Syst..
[36] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[37] Jenq-Haur Wang,et al. Vulnerable community identification using hate speech detection on social media , 2020, Inf. Process. Manag..
[38] Lucas Dixon,et al. Ex Machina: Personal Attacks Seen at Scale , 2016, WWW.
[39] Cody Buntain,et al. A Large Labeled Corpus for Online Harassment Research , 2017, WebSci.
[40] Bernard J. Jansen,et al. Developing an online hate classifier for multiple social media platforms , 2020, Human-centric Computing and Information Sciences.
[41] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.
[42] Paolo Rosso,et al. Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI) , 2018, EVALITA@CLiC-it.
[43] Paula Fortuna,et al. Toxic, Hateful, Offensive or Abusive? What Are We Really Classifying? An Empirical Analysis of Hate Speech Datasets , 2020, LREC.
[44] Peter Norvig,et al. The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.
[45] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[46] Ona de Gibert,et al. Hate Speech Dataset from a White Supremacy Forum , 2018, ALW.
[47] Eric Gilbert,et al. The Bag of Communities: Identifying Abusive Behavior Online with Preexisting Internet Data , 2017, CHI.
[48] Sérgio Nunes,et al. A Hierarchically-Labeled Portuguese Hate Speech Dataset , 2019, Proceedings of the Third Workshop on Abusive Language Online.