论文信息 - Saagie at Semeval-2019 Task 5: From Universal Text Embeddings and Classical Features to Domain-specific Text Classification

Saagie at Semeval-2019 Task 5: From Universal Text Embeddings and Classical Features to Domain-specific Text Classification

This paper describes our contribution to SemEval 2019 Task 5: Hateval. We propose to investigate how domain-specific text classification task can benefit from pretrained state of the art language models and how they can be combined with classical handcrafted features. For this purpose, we propose an approach based on a feature-level Meta-Embedding to let the model choose which features to keep and how to use them.

Miriam Benballa | Sebastien Collet | Romain Picot-Clémente

[1] Mathieu Cliche,et al. BB_twtr at SemEval-2017 Task 4: Twitter Sentiment Analysis with CNNs and LSTMs , 2017, *SEMEVAL.

[2] Omer Levy,et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[3] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[4] Aurélien Lucchi,et al. SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision , 2016, *SEMEVAL.

[5] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[6] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[7] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.

[8] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[9] Daniela Moctezuma,et al. INGEOTEC at MEX-A3T: Author Profiling and Aggressiveness Analysis in Twitter Using μTC and EvoMSA , 2018, IberEval@SEPLN.

[10] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .

[11] Viviana Patti,et al. 14-ExLab@UniTo for AMI at IberEval2018: Exploiting Lexical Knowledge for Detecting Misogyny in English and Spanish Tweets , 2018, IberEval@SEPLN.