Do We Criticise (and Laugh) in the Same Way? Automatic Detection of Multi-Lingual Satirical News in Twitter

During the last few years, the investigation of methodologies to automatically detect and characterise the figurative traits of textual contents has attracted a growing interest. Indeed, the capability to correctly deal with figurative language and more specifically with satire is fundamental to build robust approaches in several sub-fields of Artificial Intelligence including Sentiment Analysis and Affective Computing. In this paper we investigate the automatic detection of Tweets that advertise satirical news in English, Spanish and Italian. To this purpose we present a system that models Tweets from different languages by a set of language independent features that describe lexical, semantic and usage-related properties of the words of each Tweet. We approach the satire identification problem as binary classification of Tweets as satirical or not satirical messages. We test the performance of our system by performing experiments of both monolingual and cross-language classifications, evaluating the satire detection effectiveness of our features. Our system outperforms a word-based baseline and it is able to recognise if a news in Twitter is satirical or not with good accuracy. Moreover, we analyse the behaviour of the system across the different languages, obtaining interesting results.

[1]  Heather K. Evans,et al.  The Representation of Women in Publication: An Analysis of Political Communication and the International Journal of Press/Politics , 2010, PS: Political Science & Politics.

[2]  Antal van den Bosch,et al.  The perfect solution for detecting sarcasm in tweets #not , 2013, WASSA@NAACL-HLT.

[3]  José Carlos Maldonado,et al.  Proceedings of the 35th Annual ACM Symposium on Applied Computing , 2013 .

[4]  Paolo Rosso,et al.  A multidimensional approach for detecting irony in Twitter , 2013, Lang. Resour. Evaluation.

[5]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[6]  Lisa Colletta,et al.  Political Satire and Postmodern Irony in the Age of Stephen Colbert and Jon Stewart , 2009 .

[7]  Tieniu Tan,et al.  Affective Computing: A Review , 2005, ACII.

[9]  Horacio Saggion,et al.  Modelling Irony in Twitter , 2014, EACL.

[10]  Xavier Carreras,et al.  FreeLing: An Open-Source Suite of Language Analyzers , 2004, LREC.

[11]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[12]  Timothy Baldwin,et al.  Automatic Satire Detection: Are You Having a Laugh? , 2009, ACL.

[13]  Aggelos Kiayias,et al.  Malicious takeover of voting systems: arbitrary code execution on optical scan voting terminals , 2013, SAC '13.

[14]  Kristen D. Landreville,et al.  The Irony of Satire , 2009 .

[15]  John Peter,et al.  Complaint and Satire in Early English Literature , 1958 .

[16]  Mark T. Maybury,et al.  Language Resources and Evaluation: International Strategy Panel , 2002, LREC.

[17]  Jill Mann,et al.  Chaucer and medieval estates satire : the literature of social classes and the general prologue to the Canterbury tales , 1973 .

[18]  Alessandro Lenci,et al.  The First Italian Conference on Computational Linguistics CLiC-it 2014 , 2014 .

[19]  Renata Vieira,et al.  Pathways for irony detection in tweets , 2014, SAC.

[20]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[21]  C. Knight,et al.  The Literature of Satire: Introduction: the satiric frame of mind , 2004 .

[22]  Malvina Nissim,et al.  Sentiment analysis on Italian tweets , 2013, WASSA@NAACL-HLT.

[23]  Phil Blunsom,et al.  Proceedings of the ACL-IJCNLP 2009 Conference Short Papers , 2009 .

[24]  Mário J. Silva,et al.  Clues for detecting irony in user-generated contents: oh...!! it's "so easy" ;-) , 2009, TSA@CIKM.

[25]  Kalina Bontcheva,et al.  TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text , 2013, RANLP.

[26]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[27]  Tony Veale,et al.  Detecting Ironic Intent in Creative Comparisons , 2010, ECAI.

[28]  Horacio Saggion,et al.  Automatic Detection of Irony and Humour in Twitter , 2014, ICCC.

[29]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[30]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .