Sarcasm detection using machine learning algorithms in Twitter: A systematic review

Recognizing both literal and figurative meanings is crucial to understanding users’ opinions on various topics or events in social media. Detecting the sarcastic posts on social media has received much attention recently, particularly because sarcastic comments in the form of tweets often include positive words that represent negative or undesirable characteristics. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement was used to understand the application of different machine learning algorithms for sarcasm detection in Twitter. Extensive database searching led to the inclusion of 31 studies classified into two groups: Adapted Machine Learning Algorithms (AMLA) and Customized Machine Learning Algorithms (CMLA). The review results revealed that Support Vector Machine (SVM) was the best and the most commonly used AMLA for sarcasm detection in Twitter. In addition, combining Convolutional Neural Network (CNN) and SVM was found to offer a high prediction accuracy. Moreover, our result showed that using lexical, pragmatic, frequency, and part-of-speech tagging can contribute to the performance of SVM, whereas both lexical and personal features can enhance the performance of CNN-SVM. This work also addressed the main challenges faced by prior scholars when predicting sarcastic tweets. Such knowledge can be useful for future researchers or machine learning developers to consider the major issues of classifying sarcastic posts in social media.

[1]  Horacio Saggion,et al.  UPF-taln: SemEval 2015 Tasks 10 and 11. Sentiment Analysis of Literal and Figurative Language in Twitter , 2015, SemEval@NAACL-HLT.

[2]  Prabhat Ranjan,et al.  Proposed Approach for Sarcasm Detection in Twitter , 2017 .

[3]  Saachi Jain,et al.  The Lowest Form of Wit : Identifying Sarcasm in Social Media , 2015 .

[4]  Praveen Thokala,et al.  Preferred Reporting Items for Systematic Reviews and Meta-analyses statement checklist , 2012 .

[5]  Tomoaki Ohtsuki,et al.  Multi-Class Sentiment Analysis in Twitter: What if Classification is Not the Answer , 2018, IEEE Access.

[6]  Masnizah Mohd,et al.  Recognition of Sarcasms in Tweets Based on Concept Level Sentiment Analysis and Supervised Learning Approaches , 2014, PACLIC.

[7]  Han Ren,et al.  Context-augmented convolutional neural networks for twitter sarcasm detection , 2018, Neurocomputing.

[8]  Erik Cambria,et al.  A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks , 2016, COLING.

[9]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[10]  Khaldoon Dhou An Innovative Employment of Virtual Humans to Explore the Chess Personalities of Garry Kasparov and Other Class-A Players , 2019, HCI.

[11]  Simon Fong,et al.  Hierarchical classification in text mining for sentiment analysis of online news , 2014, Soft Computing.

[12]  Hosam Al-Samarraie,et al.  Towards incorporating personality into the design of an interface: a method for facilitating users’ interaction with the display , 2018, User Modeling and User-Adapted Interaction.

[13]  Mihaela Cocea,et al.  Detecting Sarcasm from Students' Feedback in Twitter , 2015, EC-TEL.

[14]  Nivid Limbasiya,et al.  Feature based Composite Approach for Sarcasm Detection using MapReduce , 2018, 2018 Second International Conference on Computing Methodologies and Communication (ICCMC).

[15]  Yue Zhang,et al.  Tweet Sarcasm Detection Using Deep Neural Network , 2016, COLING.

[16]  Jian Su,et al.  Reasoning with Sarcasm by Reading In-Between , 2018, ACL.

[17]  Masnizah Mohd,et al.  Recognition of Sarcasm in Microblogging Based on Sentiment Analysis and Coherence Identification , 2016 .

[18]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[19]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[20]  David Bamman,et al.  Contextualized Sarcasm Detection on Twitter , 2015, ICWSM.

[21]  M. Ragan,et al.  A novel alignment-free method for detection of lateral genetic transfer based on TF-IDF , 2016, Scientific Reports.

[22]  Herbert L. Colston,et al.  On necessary conditions for verbal irony comprehension , 2000 .

[23]  Dirk Hovy,et al.  Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations , 2016, ACL.

[24]  Osmar R. Zaïane,et al.  Current State of Text Sentiment Analysis from Opinion to Emotion Mining , 2017, ACM Comput. Surv..

[25]  Nina Wacholder,et al.  Identifying Sarcasm in Twitter: A Closer Look , 2011, ACL.

[26]  Byron C. Wallace,et al.  Modelling Context with User Embeddings for Sarcasm Detection in Social Media , 2016, CoNLL.

[27]  Elise Lavoué,et al.  Design for Teaching and Learning in a Networked World: 10th European Conference on Technology Enhanced Learning, EC-TEL 2015, Toledo, Spain, September 15-18, 2015, Proceedings , 2015, EC-TEL.

[28]  Samer Muthana Sarsam,et al.  Personality and individual differences: the potential of using preferences for visual stimuli to predict the Big Five traits , 2018, Cognition, Technology & Work.

[29]  Herbert L. Colston,et al.  Irony in Language and Thought : A Cognitive Science Reader , 2007 .

[30]  Navjyoti Singh,et al.  Sarcasm Detection : Building a Contextual Hierarchy , 2016, PEOPLES@COLING.

[31]  Jun Hong,et al.  Sarcasm Detection on Czech and English Twitter , 2014, COLING.

[32]  Tomoaki Ohtsuki,et al.  A Pattern-Based Approach for Sarcasm Detection on Twitter , 2016, IEEE Access.

[33]  Pushpak Bhattacharyya,et al.  Harnessing Context Incongruity for Sarcasm Detection , 2015, ACL.

[34]  Hosam Al-Samarraie,et al.  A First Look at the Effectiveness of Personality Dimensions in Promoting Users’ Satisfaction With the System , 2018 .

[35]  Hosam Al-Samarraie,et al.  Geo-spatial-based Emotions: A Mechanism for Event Detection in Microblogs , 2019, ICSCA.

[36]  Longzhuang Li,et al.  Recognizing and regulating e-learners' emotions based on interactive Chinese texts in e-learning systems , 2014, Knowl. Based Syst..

[38]  R. Gibbs Irony in Talk Among Friends , 2000 .

[39]  Sandeepa Kannangara,et al.  Mining Twitter for Fine-Grained Political Opinion Polarity Classification, Ideology Detection and Sarcasm Detection , 2018, WSDM.

[40]  Pushpak Bhattacharyya,et al.  How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text , 2016, LaTeCH@ACL.

[41]  Tony Veale,et al.  Fracking Sarcasm using Neural Network , 2016, WASSA@NAACL-HLT.

[42]  Tomoaki Ohtsuki,et al.  Opinion mining in Twitter: How to make use of sarcasm to enhance sentiment analysis , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[43]  B. Ramesh,et al.  Evaluation of Stemming Techniques for Text Classification , 2015 .

[44]  Erik Cambria,et al.  The CLSA Model: A Novel Framework for Concept-Level Sentiment Analysis , 2015, CICLing.

[45]  Sanjay D. Bhanderi,et al.  A Survey on Pre-processing Techniques for Text Mining , 2014 .

[46]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[47]  Héctor Cerezo-Costas,et al.  Gradiant-Analytics: Training Polarity Shifters with CRFs for Message Level Polarity Detection , 2015, SemEval@NAACL-HLT.

[48]  S. Glucksberg,et al.  How about another piece of pie: the allusional pretense theory of discourse irony. , 1995, Journal of experimental psychology. General.

[49]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[50]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[51]  Khaldoon Dhou,et al.  Towards a Better Understanding of Chess Players' Personalities: A Study Using Virtual Chess Players , 2018, HCI.

[52]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[53]  Mona Masood,et al.  The role of personality characteristics in informing our preference for visual presentation: An eye movement study , 2016, J. Ambient Intell. Smart Environ..

[54]  Reza Zafarani,et al.  Sarcasm Detection on Twitter: A Behavioral Modeling Approach , 2015, WSDM.

[55]  Rodney D. Nielsen,et al.  Detecting Sarcasm is Extremely Easy ;-) , 2018 .

[56]  Roger J. Kreuz,et al.  Distinguishing Sarcasm From Literal Language: Evidence From Books and Blogging , 2013 .

[57]  Samer Muthana Sarsam Reinforcing the Decision-making Process in Chemometrics: Feature Selection and Algorithm Optimization , 2019, ICSCA.

[58]  Pushpak Bhattacharyya,et al.  Automatic Sarcasm Detection: A Survey , 2016 .

[59]  Debanjan Ghosh,et al.  Sarcastic or Not: Word Embeddings to Predict the Literal or Sarcastic Meaning of Words , 2015, EMNLP.

[60]  B. S. Harish,et al.  Sentiment analysis for sarcasm detection on streaming short text data , 2017, 2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA).

[61]  Derwin Suhartono,et al.  Single Document Automatic Text Summarization using Term Frequency-Inverse Document Frequency (TF-IDF) , 2016 .