A Pattern-Based Approach for Sarcasm Detection on Twitter

Sarcasm is a sophisticated form of irony widely used in social networks and microblogging websites. It is usually used to convey implicit information within the message a person transmits. Sarcasm might be used for different purposes, such as criticism or mockery. However, it is hard even for humans to recognize. Therefore, recognizing sarcastic statements can be very useful to improve automatic sentiment analysis of data collected from microblogging websites or social networks. Sentiment Analysis refers to the identification and aggregation of attitudes and opinions expressed by Internet users toward a specific topic. In this paper, we propose a pattern-based approach to detect sarcasm on Twitter. We propose four sets of features that cover the different types of sarcasm we defined. We use those to classify tweets as sarcastic and non-sarcastic. Our proposed approach reaches an accuracy of 83.1% with a precision equal to 91.1%. We also study the importance of each of the proposed sets of features and evaluate its added value to the classification. In particular, we emphasize the importance of pattern-based features for the detection of sarcastic statements.

[1]  Murat Ali Bayir,et al.  Identifying breakpoints in public opinion , 2010, SOMA '10.

[2]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[3]  Zhijian Wu,et al.  Twitter Sarcasm Detection Exploiting a Context-Based Model , 2015, WISE.

[4]  Wolf-Tilo Balke,et al.  Will I Like It? Providing Product Overviews Based on Opinion Excerpts , 2011, 2011 IEEE 13th Conference on Commerce and Enterprise Computing.

[5]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[6]  Penny M. Pexman,et al.  Context Incongruity and Irony Processing , 2003 .

[7]  Elisabetta Fersini,et al.  Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[10]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[11]  Diana Maynard,et al.  Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. , 2014, LREC.

[12]  Pushpak Bhattacharyya,et al.  Harnessing Context Incongruity for Sarcasm Detection , 2015, ACL.

[13]  Elisabeth Camp Sarcasm, Pretense, and The Semantics/ Pragmatics Distinction ∗ , 2012 .

[14]  Reza Zafarani,et al.  Sarcasm Detection on Twitter: A Behavioral Modeling Approach , 2015, WSDM.

[15]  R. Gibbs,et al.  Psychological aspects of irony understanding , 1991 .

[16]  David R. Traum,et al.  "yeah Right": Sarcasm Recognition for Spoken Dialogue Systems , 2006, INTERSPEECH.

[17]  Kalina Bontcheva,et al.  Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data , 2013, RANLP.

[18]  F. Stringfellow,et al.  The Meaning of Irony: A Psychoanalytic Investigation , 1994 .

[19]  J Aharon-Peretz,et al.  The neuroanatomical basis of understanding sarcasm and its relationship to social cognition. , 2005, Neuropsychology.

[20]  Pushpak Bhattacharyya,et al.  Automatic Sarcasm Detection , 2016, ACM Comput. Surv..

[21]  Antal van den Bosch,et al.  The perfect solution for detecting sarcasm in tweets #not , 2013, WASSA@NAACL-HLT.

[22]  Hakan Ferhatosmanoglu,et al.  Short text classification in twitter to improve information filtering , 2010, SIGIR.

[23]  Paolo Rosso,et al.  A multidimensional approach for detecting irony in Twitter , 2013, Lang. Resour. Evaluation.

[24]  S. Attardo Irony as relevant inappropriateness , 2000 .

[25]  Ari Rappoport,et al.  ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews , 2010, ICWSM.

[26]  Michael W. Berry,et al.  Survey of Text Mining: Clustering, Classification, and Retrieval , 2007 .

[27]  H. Grice Reasoning: Further Notes on Logic and Conversation , 2008 .

[28]  A. Katz,et al.  Are There Necessary Conditions for Inducing a Sense of Sarcastic Irony? , 2012 .

[29]  P. Rockwell,et al.  Empathy and the Expression and Recognition of Sarcasm by Close Relations or Strangers , 2003, Perceptual and motor skills.

[30]  Nina Wacholder,et al.  Identification of nonliteral language in social media: A case study on sarcasm , 2016, J. Assoc. Inf. Sci. Technol..

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  R. Rajasree,et al.  Sentiment analysis in twitter using machine learning techniques , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[33]  Deirdre Wilson,et al.  The pragmatics of verbal irony: Echo or pretence? , 2006 .

[34]  P. Radha Krishna,et al.  Analyzing Internet Slang for Sentiment Mining , 2010, 2010 Second Vaagdevi International Conference on Information Technology for Real World Problems.

[35]  R. Giora On irony and negation , 1995 .

[36]  P. Rockwell,et al.  Vocal Features of Conversational Sarcasm: A Comparison of Methods , 2007, Journal of psycholinguistic research.

[37]  Ming Zhou,et al.  Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification , 2014, ACL.

[38]  Tony Veale,et al.  Detecting Ironic Intent in Creative Comparisons , 2010, ECAI.

[39]  Fernando Cuartero,et al.  Twitter as a Tool for Predicting Elections Results , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[40]  Sanjay Kumar Jena,et al.  Parsing-based sarcasm sentiment recognition in Twitter data , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[41]  Boi Faltings,et al.  A :) Is Worth a Thousand Words: How People Attach Sentiment to Emoticons and Words in Tweets , 2013, 2013 International Conference on Social Computing.

[42]  Debanjan Ghosh,et al.  Sarcastic or Not: Word Embeddings to Predict the Literal or Sarcastic Meaning of Words , 2015, EMNLP.

[43]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[44]  Timothy Baldwin,et al.  Automatic Satire Detection: Are You Having a Laugh? , 2009, ACL.

[45]  Horacio Saggion,et al.  Modelling Sarcasm in Twitter, a Novel Approach , 2014, WASSA@ACL.

[46]  David Bamman,et al.  Contextualized Sarcasm Detection on Twitter , 2015, ICWSM.