论文信息 - SASICM A Multi-Task Benchmark For Subtext Recognition - 字舞流文

SASICM A Multi-Task Benchmark For Subtext Recognition

Subtext is a kind of deep semantics which can be acquired after one or more rounds of expression transformation. As a popular way of expressing one’s intentions, it is well worth studying. In this paper, we try to make computers understand whether there is a subtext by means of machine learning. We build a Chinese dataset whose source data comes from the popular social media (e.g. Weibo, Netease Music, Zhihu, and Bilibili). In addition, we also build a baseline model called SASICM to deal with subtext recognition. The F1 score of SASICMg, whose pretrained model is GloVe, is as high as 64.37%, which is 3.97% higher than that of BERT based model, 12.7% higher than that of traditional methods on average, including support vector machine, logistic regression classifier, maximum entropy classifier, naive bayes classifier and decision tree and 2.39% higher than that of the state-of-the-art, including MARIN and BTM. The F1 score of SASICMBERT , whose pretrained model is BERT, is 65.12%, which is 0.75% higher than that of SASICMg. The accuracy rates of SASICMg and SASICMBERT are 71.16% and 70.76%, respectively, which can compete with those of other methods which are mentioned before.

Shen Furao | Jian Zhao | Hua Yan | Weikang Xiao | Feng Han | S. Furao | Wei Xiao | Feng Han | Jian Zhao | Hua Yan

[1] Neel Kant,et al. Practical Text Classification With Large Pre-Trained Language Models , 2018, ArXiv.

[2] Lei Zhang,et al. Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[3] Shalom Lappin,et al. Predicting Human Metaphor Paraphrase Judgments with Deep Neural Networks , 2018, Fig-Lang@NAACL-HLT.

[4] Shuai Wang,et al. Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[5] Ruifeng Xu,et al. Context-aware Embedding for Targeted Aspect-based Sentiment Analysis , 2019, ACL.

[6] Preslav Nakov,et al. SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[7] R. Plutchik. Emotions : a general psychoevolutionary theory , 1984 .

[8] Ivan Bondarenko,et al. Conditional Random Fields for Metaphor Detection , 2018, Fig-Lang@NAACL-HLT.

[9] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10] Carolyn Penstein Rosé,et al. Metaphor Detection with Topic Transition, Emotion and Cognition in Context , 2016, ACL.

[11] Enkhbold Bataa,et al. An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese , 2019, ACL.

[12] Patrik Lambert,et al. Attention and Lexicon Regularized LSTM for Aspect-based Sentiment Analysis , 2019, ACL.

[13] Pushpak Bhattacharyya,et al. Automatic Sarcasm Detection: A Survey , 2016 .

[14] Martha Palmer,et al. Leveraging Syntactic Constructions for Metaphor Identification , 2018, Fig-Lang@NAACL-HLT.

[15] Ning Jin,et al. Multi-Task Learning Model Based on Multi-Scale CNN and LSTM for Sentiment Classification , 2020, IEEE Access.

[16] Ekaterina Shutova,et al. Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection , 2017, EMNLP.

[17] Pushpak Bhattacharyya,et al. Learning Cognitive Features from Gaze Data for Sentiment and Sarcasm Classification using Convolutional Neural Network , 2017, ACL.

[18] Yue Zhang,et al. Tweet Sarcasm Detection Using Deep Neural Network , 2016, COLING.

[19] Md. Shad Akhtar,et al. Multi-task learning for aspect term extraction and aspect sentiment classification , 2020, Neurocomputing.

[20] Pushpak Bhattacharyya,et al. Harnessing Cognitive Features for Sarcasm Detection , 2016, ACL.

[21] Paolo Rosso,et al. IDAT at FIRE2019: Overview of the Track on Irony Detection in Arabic Tweets , 2019, FIRE.

[22] A. Feinstein,et al. High agreement but low kappa: I. The problems of two paradoxes. , 1990, Journal of clinical epidemiology.

[23] Rada Mihalcea,et al. CASCADE: Contextual Sarcasm Detection in Online Discussion Forums , 2018, COLING.

[24] Erik Cambria,et al. Sentiment and Sarcasm Classification With Multitask Learning , 2019, IEEE Intelligent Systems.

[25] Jason Baldridge,et al. Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns , 2018, TACL.

[26] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27] Preslav Nakov,et al. SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[28] Debanjan Ghosh,et al. A Report on the 2020 Sarcasm Detection Shared Task , 2020, FIGLANG.

[29] Shu-Kai Hsieh,et al. Sarcasm Detection in Chinese Using a Crowdsourced Corpus , 2016, ROCLING.

[30] Ron Artstein. Inter-Coder Agreement for Computational Linguistics , 2008 .

[31] Frank Guerin,et al. End-to-End Sequential Metaphor Identification Inspired by Linguistic Theories , 2019, ACL.

[32] J. Sim,et al. The kappa statistic in reliability studies: use, interpretation, and sample size requirements. , 2005, Physical therapy.

[33] Rui Mao,et al. Word Embedding and WordNet Based Metaphor Identification and Interpretation , 2018, ACL.

[34] Jian Su,et al. Reasoning with Sarcasm by Reading In-Between , 2018, ACL.

[35] Benjamin Roth,et al. Joint Aspect and Polarity Classification for Aspect-based Sentiment Analysis with End-to-End Neural Networks , 2018, EMNLP.

[36] Mikhail Khodak,et al. A Large Self-Annotated Corpus for Sarcasm , 2017, LREC.

[37] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[38] Zhifang Sui,et al. Towards Fine-grained Text Sentiment Transfer , 2019, ACL.

[39] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .