Self-Deprecating Sarcasm Detection: An Amalgamation of Rule-Based and Machine Learning Approach

Sarcasm is a special category of figurative language, which is mainly used in online social media to convey messages with implicit semantics and criticism. Such messages are used for sarcastic remarks using contemptuous, ridicule, bitter, taunt, and mock related words or phrases. Though sarcasm detection is a well-considered problem by the researchers, to the best of our knowledge, none of them has considered the problem of self-deprecating sarcasm, which is a special category of sarcasm, mainly used by the users to deprecate or criticize themselves using sarcastic phrases. In this paper, we propose a novel self-deprecating sarcasm detection approach using an amalgamation of rule-based and machine learning techniques. The rule-based techniques aim to identify candidate self-around tweets, whereas machine learning techniques are used for feature extraction and classification. A total number of 11 features, including six self-deprecating features and five hyperbolic features are identified to train three different classifiers - decision tree, naïve Bayes, and bagging. The proposed approach is evaluated over a Twitter dataset containing 107536 tweets, and compared with some state-of-the-art methods for sarcasm detection.

[1]  Reza Zafarani,et al.  Sarcasm Detection on Twitter: A Behavioral Modeling Approach , 2015, WSDM.

[2]  Ari Rappoport,et al.  ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews , 2010, ICWSM.

[3]  Paolo Rosso,et al.  SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter , 2015, *SEMEVAL.

[4]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[5]  Jun Hong,et al.  Sarcasm Detection on Czech and English Twitter , 2014, COLING.

[6]  Tomoaki Ohtsuki,et al.  A Pattern-Based Approach for Sarcasm Detection on Twitter , 2016, IEEE Access.

[7]  Raj Kumar Gupta,et al.  CrystalNest at SemEval-2017 Task 4: Using Sarcasm Detection for Enhancing Sentiment Classification and Quantification , 2017, *SEMEVAL.

[8]  Sanjay Kumar Jena,et al.  Parsing-based sarcasm sentiment recognition in Twitter data , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[9]  Nina Wacholder,et al.  Identifying Sarcasm in Twitter: A Closer Look , 2011, ACL.

[10]  Byron C. Wallace,et al.  Modelling Context with User Embeddings for Sarcasm Detection in Social Media , 2016, CoNLL.

[11]  Marilyn A. Walker,et al.  Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue , 2013, ArXiv.

[12]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.