Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series ‘Friends’

This paper is a novel study that views sarcasm detection in dialogue as a sequence labeling task, where a dialogue is made up of a sequence of utterances. We create a manuallylabeled dataset of dialogue from TV series ‘Friends’ annotated with sarcasm. Our goal is to predict sarcasm in each utterance, using sequential nature of a scene. We show performance gain using sequence labeling as compared to classification-based approaches. Our experiments are based on three sets of features, one is derived from information in our dataset, the other two are from past works. Two sequence labeling algorithms (SVM-HMM and SEARN) outperform three classification algorithms (SVM, Naive Bayes) for all these feature sets, with an increase in F-score of around 4%. Our observations highlight the viability of sequence labeling techniques for sarcasm detection of dialogue.

[1]  David R. Traum,et al.  "yeah Right": Sarcasm Recognition for Spoken Dialogue Systems , 2006, INTERSPEECH.

[2]  Tony Veale,et al.  Detecting Ironic Intent in Creative Comparisons , 2010, ECAI.

[3]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[4]  Claire Cardie,et al.  Hierarchical Sequential Learning for Extracting Opinions and Their Attributes , 2010, ACL.

[5]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[6]  Stephen Kwek,et al.  Applying Support Vector Machines to Imbalanced Datasets , 2004, ECML.

[7]  R. Kreuz,et al.  Lexical Influences on the Perception of Sarcasm , 2007 .

[8]  Diana Boxer,et al.  Reactions to irony in discourse: evidence for the least disruption principle , 2006 .

[9]  Elisabeth Camp Sarcasm, Pretense, and The Semantics/ Pragmatics Distinction ∗ , 2012 .

[10]  Yi Mao,et al.  Isotonic Conditional Random Fields and Local Sentiment Flow , 2006, NIPS.

[11]  Jörg Tiedemann,et al.  OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles , 2016, LREC.

[12]  Byron C. Wallace,et al.  Humans Require Context to Infer Ironic Intent (so Computers Probably do, too) , 2014, ACL.

[13]  Claire Cardie,et al.  Multi-Level Structured Models for Document-Level Sentiment Classification , 2010, EMNLP.

[14]  Thomas Hofmann,et al.  Hidden Markov Support Vector Machines , 2003, ICML.

[15]  Zhijian Wu,et al.  Twitter Sarcasm Detection Exploiting a Context-Based Model , 2015, WISE.

[16]  Philipp Cimiano,et al.  An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews , 2014, WASSA@ACL.

[17]  Dimitris Kanellopoulos,et al.  Handling imbalanced datasets: A review , 2006 .

[18]  Byron C. Wallace,et al.  Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment , 2015, ACL.

[19]  Nina Wacholder,et al.  Identifying Sarcasm in Twitter: A Closer Look , 2011, ACL.

[20]  Andrew Rosenberg,et al.  "sure, I Did the Right Thing": a System for Sarcasm Detection in Speech , 2013, INTERSPEECH.

[21]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[22]  Ari Rappoport,et al.  ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews , 2010, ICWSM.

[23]  Shuzhi Sam Ge,et al.  Emotion analysis of children's stories with context information , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[24]  David Bamman,et al.  Contextualized Sarcasm Detection on Twitter , 2015, ICWSM.

[25]  Jun Zhao,et al.  Adding Redundant Features for CRFs-based Sentence Sentiment Classification , 2008, EMNLP.

[26]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[27]  Reza Zafarani,et al.  Sarcasm Detection on Twitter: A Behavioral Modeling Approach , 2015, WSDM.

[28]  R. Gibbs Irony in Talk Among Friends , 2000 .

[29]  Pushpak Bhattacharyya,et al.  Your Sentiment Precedes You: Using an author’s historical tweets to predict sarcasm , 2015, WASSA@EMNLP.

[30]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[31]  John Langford,et al.  Search-based structured prediction , 2009, Machine Learning.

[32]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[33]  Pushpak Bhattacharyya,et al.  Harnessing Context Incongruity for Sarcasm Detection , 2015, ACL.