论文信息 - Recognition of Sarcasm in Microblogging Based on Sentiment Analysis and Coherence Identification - 字舞流文

Recognition of Sarcasm in Microblogging Based on Sentiment Analysis and Coherence Identification

Recognition of sarcasm in microblogging is important in a range of NLP applications, such as opinion mining. However, this is a challenging task, as the real meaning of a sarcastic sentence is the opposite of the literal meaning. Furthermore, microblogging messages are short and usually written in a free style that may include misspellings, grammatical errors, and complex sentence structures. This paper proposes a novel method for identifying sarcasm in tweets. It combines two supervised classifiers, a Support Vector Machine (SVM) using N-gram features and an SVM using our proposed features. Our features represent the intensity and contradictions of sentiment in a tweet, derived by sentiment analysis. The sentiment contradiction feature also considers coherence among multiple sentences in the tweet, and this is automatically identified by our proposed method using unsupervised clustering and an adaptive genetic algorithm. Furthermore, a method for identifying the concepts of unknown sentiment words is used to compensate for gaps in the sentiment lexicon. Our method also considers punctuation and the special symbols that are frequently used in Twitter messaging. Experiments using two datasets demonstrated that our proposed system outperformed baseline systems on one dataset, while producing comparable results on the other. Accuracy of 82% and 76% was achieved in sarcasm identification on the two datasets.

Masnizah Mohd | Piyoros Tungthamthiti | Kiyoaki Shirai | Kiyoaki Shirai | M. Mohd | Piyoros Tungthamthiti

[1] R. Singh. HUMOUR, IRONY AND SATIRE IN LITERATURE , 2013 .

[2] Masnizah Mohd,et al. Recognition of Sarcasms in Tweets Based on Concept Level Sentiment Analysis and Supervised Learning Approaches , 2014, PACLIC.

[3] G. McLachlan,et al. The EM algorithm and extensions , 1996 .

[4] Mike Thelwall,et al. Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[5] Horacio Saggion,et al. Modelling Irony in Twitter , 2014, EACL.

[6] Ellen Riloff,et al. Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[7] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[8] Vaibhavi N Patodkar,et al. Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[9] Ari Rappoport,et al. ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews , 2010, ICWSM.

[10] Philip Resnik,et al. Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[11] Patrick Paroubek,et al. Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[12] F. Stringfellow,et al. The Meaning of Irony: A Psychoanalytic Investigation , 1994 .

[13] R. Doerfler. A Comedy of Errors or, How I Learned to Stop Worrying and Love Sensibility-Invariantism about ‘Funny’ , 2012 .

[14] Chu-Ren Huang,et al. Sentiment Analyzer with Rich Features for Ironic and Sarcastic Tweets , 2015, PACLIC.

[15] R. Beaugrande,et al. Introduction to text linguistics , 1981 .

[16] F. Bartlett,et al. Remembering: A Study in Experimental and Social Psychology , 1932 .

[17] Hwee Tou Ng,et al. A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[18] Mário J. Silva,et al. Clues for detecting irony in user-generated contents: oh...!! it's "so easy" ;-) , 2009, TSA@CIKM.

[19] Herbert L. Colston,et al. Irony in Language and Thought : A Cognitive Science Reader , 2007 .

[20] Erik Cambria,et al. Sentic Computing: Techniques, Tools, and Applications , 2012 .

[21] Paolo Rosso,et al. A multidimensional approach for detecting irony in Twitter , 2013, Lang. Resour. Evaluation.

[22] Rutvija Pandya,et al. C5.0 Algorithm to Improved Decision Tree with Feature Selection and Reduced Error Pruning , 2015 .

[23] G. McLachlan,et al. The EM Algorithm and Extensions: Second Edition , 2008 .

[24] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[25] Chu-Ren Huang,et al. LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets , 2015, SemEval@NAACL-HLT.

[26] Ari Rappoport,et al. Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[27] Paolo Rosso,et al. Making objective decisions from subjective data: Detecting irony in customer reviews , 2012, Decis. Support Syst..