论文信息 - LaSTUS/TALN at SemEval-2019 Task 6: Identification and Categorization of Offensive Language in Social Media with Attention-based Bi-LSTM model - 字舞流文

LaSTUS/TALN at SemEval-2019 Task 6: Identification and Categorization of Offensive Language in Social Media with Attention-based Bi-LSTM model

We present a bidirectional Long-Short Term Memory network for identifying offensive language in Twitter. Our system has been developed in the context of the SemEval 2019 Task 6 which comprises three different sub-tasks, namely A: Offensive Language Detection, B: Categorization of Offensive Language, C: Offensive Language Target Identification. We used a pre-trained Word Embeddings in tweet data, including information about emojis and hashtags. Our approach achieves good performance in the three sub-tasks.

Horacio Saggion | Lutfiye Seda Mut Altin | Àlex Bravo Serrano | Horacio Saggion

[1] Preslav Nakov,et al. Predicting the Type and Target of Offensive Posts in Social Media , 2019, NAACL.

[2] Yoshua Bengio,et al. Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[3] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[4] Horacio Saggion,et al. How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics , 2016, ACM Multimedia.

[5] Ingmar Weber,et al. Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[6] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[7] Tommaso Caselli,et al. Evalita 2018: Overview on the 6th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian , 2018, EVALITA@CLiC-it.

[8] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.

[9] Sérgio Nunes,et al. A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[10] Preslav Nakov,et al. SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval) , 2019, *SEMEVAL.

[11] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[12] Walter Daelemans,et al. Automatic detection of cyberbullying in social media text , 2018, PloS one.

[13] Mai ElSherief,et al. Hate Lingo: A Target-based Linguistic Analysis of Hate Speech in Social Media , 2018, ICWSM.

[14] L. Hartling,et al. Prevalence and Effect of Cyberbullying on Children and Young People: A Scoping Review of Social Media Studies. , 2015, JAMA pediatrics.

[15] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[16] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[17] Theodore Chu,et al. Comment Abuse Classification with Deep Learning , 2017 .

[18] Ritesh Kumar,et al. Benchmarking Aggression Identification in Social Media , 2018, TRAC@COLING 2018.

[19] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[20] Björn Gambäck,et al. Using Convolutional Neural Networks to Classify Hate-Speech , 2017, ALW@ACL.

[21] David Robinson,et al. Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network , 2018, ESWC.

[22] Michael Wiegand,et al. Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language , 2018 .

[23] Wei Shi,et al. Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification , 2016, ACL.