FB-NEWS15: A Topic-Annotated Facebook Corpus for Emotion Detection and Sentiment Analysis

English. In this paper we present the FBNEWS15 corpus, a new Italian resource for sentiment analysis and emotion detection. The corpus has been built by crawling the Facebook pages of the most important newspapers in Italy and it has been organized into topics using LDA. In this work we provide a preliminary analysis of the corpus, including the most debated news in 2015. Italiano. In questo lavoro presentiamo il corpus FBNEWS15, un corpus italiano creato per scopi di sentiment analysis ed emotion detection. Il corpus stato costruito scaricando le pagine Facebook delle maggiori testate giornalistiche in Italia e successivamente organizzato in topic utilizzando LDA. In questo articolo forniamo una analisi preliminare del corpus, e mostriamo le notizie pi discusse nel 2015.

[1]  Alessandro Lenci,et al.  Extracting Terms with EXTra , 2016 .

[2]  Susan T. Dumais,et al.  Enhancing Performance in Latent Semantic Indexing (LSI) Retrieval , 1990 .

[3]  D. Lasorsa,et al.  An Explorative Study on the Market Relation Between Online and Print Newspapers , 2002 .

[4]  K. Chen,et al.  WHAT'S IN A TWEET? , 2013 .

[5]  Felice Dell'Orletta,et al.  Ensemble system for Part-of-Speech tagging , 2009 .

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  A. Hermida #JOURNALISM: Reconfiguring journalism research about Twitter, one tweet at a time , 2013 .

[8]  Lin Qiu,et al.  Two Sites, Two Voices: Linguistic Differences between Facebook Status Updates and Tweets , 2013, HCI.

[9]  Alessandro Lenci,et al.  Evaluating Context Selection Strategies to Build Emotive Vector Space Models , 2016, LREC.

[10]  Felice Dell'Orletta,et al.  Accurate Dependency Parsing with a Stacked Multilayer Perceptron , 2009 .

[11]  Judith M. Buddenbaum,et al.  Journalism , 1909, The Hospital.

[12]  William H. Dutton,et al.  Social Media in the Changing Ecology of News: The Fourth and Fifth Estate in Britain , 2012 .

[13]  A. Ahmad Is Twitter a useful tool for journalists? , 2010 .

[14]  M. Sheffer,et al.  Paradigm Shift or Passing Fad? Twitter and Sports Journalism , 2010 .

[15]  Alessandro Lenci,et al.  ItEM: A Vector Space Model to Bootstrap an Italian Emotive Lexicon , 2015 .

[16]  William P. Eveland,et al.  Information and Expression in a Digital Age , 2005, Commun. Res..