Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous

The paper introduces a new annotated French data set for Sentiment Analysis, which is a currently missing resource. It focuses on the collection from Twitter of data related to the socio-political debate about the reform of the bill for wedding in France. The design of the annotation scheme is described, which extends a polarity label set by making available tags for marking target semantic areas and figurative language devices. The annotation process is presented and the disagreement discussed, in particular, in the perspective of figurative language use and in that of the semantic oriented annotation, which are open challenges for NLP systems.

[1]  Cristina Bosco,et al.  Detecting Happiness in Italian Tweets: Towards an Evaluation Dataset for Sentiment Analysis in Felicittà , 2014 .

[2]  Elena Filatova,et al.  Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing , 2012, LREC.

[3]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[4]  Patrick Paroubek,et al.  Twitter as a Comparable Corpus to build Multilingual Affective Lexicons , 2014 .

[5]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[6]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[7]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[8]  Yves Bestgen,et al.  Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis , 2008, LREC.

[9]  Uldis Bojars,et al.  The Pragmatics of Political Messages in Twitter Communication , 2011, #MSM.

[10]  Feiyu Xu,et al.  Annotating Opinions in German Political News , 2012, LREC.

[11]  Egle Eensoo,et al.  Approche textuelle pourle traitement automatique du discours évaluatif , 2014 .

[12]  Joel D. Martin,et al.  Sentiment, emotion, purpose, and style in electoral tweets , 2015, Inf. Process. Manag..

[13]  Kam-Fai Wong,et al.  Quantising Opinions for Political Tweets Analysis , 2012, LREC.

[14]  Jacob Ratkiewicz,et al.  Predicting the Political Alignment of Twitter Users , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[15]  Diana Maynard,et al.  Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. , 2014, LREC.

[16]  Byron C. Wallace,et al.  Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment , 2015, ACL.

[17]  Luigi Di Caro,et al.  Annotating Irony in a Novel Italian Corpus for Sentiment Analysis , 2012 .

[18]  Patrick Paroubek,et al.  Toward a unifying model for Opinion, Sentiment and Emotion information extraction , 2014, LREC.

[19]  Cristina Bosco,et al.  Analyzing and annotating for sentiment analysis the socio-political debate on #labuonascuola , 2015 .

[20]  Giovanni Comarela,et al.  Analyzing the Dynamic Evolution of Hashtags on Twitter: a Language-Based Approach , 2011 .

[21]  Marilyn A. Walker,et al.  Collective Stance Classification of Posts in Online Debate Forums , 2014 .

[22]  Rajeev Sangal,et al.  Stance Classification in Online Debates by Recognizing Users’ Intentions , 2013, SIGDIAL Conference.

[23]  Swapna Somasundaran,et al.  Recognizing Stances in Ideological On-Line Debates , 2010, HLT-NAACL 2010.

[24]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[25]  Cristina Bosco,et al.  Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola , 2016, LREC.

[26]  C. Bosco,et al.  Building a Corpus on a Debate on Political Reform in Twitter , 2015 .

[27]  A. Smeaton,et al.  On Using Twitter to Monitor Political Sentiment and Predict Election Results , 2011 .

[28]  Paolo Rosso,et al.  SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter , 2015, *SEMEVAL.

[29]  Cristina Bosco,et al.  Developing Corpora for Sentiment Analysis: The Case of Irony and Senti-TUT , 2013, IEEE Intelligent Systems.

[30]  Paolo Rosso,et al.  On the difficulty of automatically detecting irony: beyond a simple case of negation , 2014, Knowledge and Information Systems.

[31]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[32]  Davide Buscaldi,et al.  Sentiment Analysis on Microblogs for Natural Disasters Management: a Study on the 2014 Genoa Floodings , 2015, WWW.

[33]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[34]  Paolo Rosso,et al.  A multidimensional approach for detecting irony in Twitter , 2013, Lang. Resour. Evaluation.

[35]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[36]  Cristina Bosco,et al.  Debate on political reforms in Twitter: A hashtag-driven analysis of political polarization , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[37]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..