An Approach to Subjectivity Detection on Twitter Using the Structured Information

In this paper, we propose an approach to the subjectivity detection on Twitter micro texts that explores the uses of the structured information of the social network framework. The sentiment analysis on Twitter has been usually performed through the automatic processing of the texts. However, the established limit of 140 characters and the particular characteristics of the texts reduce drastically the accuracy of Natural Language Processing (NLP) techniques. Under these circumstances, it becomes necessary to study new data sources that allow us to extract new useful knowledge to represent and classify the texts. The structured information, also called meta-information or meta-data, provide us with alternative features of the texts that can improve the classification tasks. In this study we have analysed the use of features extracted from the structured information in the subjectivity detection task, as a first step of the polarity detection task, and their integration with classical features.

[1]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[2]  Miguel A. Alonso,et al.  A review on political analysis and social media , 2016, Proces. del Leng. Natural.

[3]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[4]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[5]  David Yarowsky,et al.  Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media , 2013, EMNLP.

[6]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[7]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[8]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[9]  Fermín L. Cruz,et al.  Explorando Twitter mediante la Integración de Información Estructurada y No Estructurada , 2015, Proces. del Leng. Natural.

[10]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[11]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[12]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[13]  Lluís F. Hurtado,et al.  Sentiment Analysis in Twitter for Spanish , 2014, NLDB.

[14]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[15]  José-Luis Sancho-Gómez,et al.  Word Normalization in Twitter Using Finite-state Transducers , 2013, Tweet-Norm@SEPLN.

[16]  Luis Alfonso Ureña López,et al.  Sentiment analysis in Twitter , 2012, Natural Language Engineering.

[17]  Alessandro Rozza,et al.  Modelling political disaffection from Twitter data , 2013, WISDOM '13.

[18]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[19]  Padmini Srinivasan,et al.  GOP primary season on twitter: "popular" political sentiment in social media , 2013, WSDM.

[20]  A. Smeaton,et al.  On Using Twitter to Monitor Political Sentiment and Predict Election Results , 2011 .

[21]  Fernando De la Torre,et al.  Facing Imbalanced Data--Recommendations for the Use of Performance Metrics , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[22]  Julio Villena-Román,et al.  Overview of TASS 2015 , 2015, TASS@SEPLN.

[23]  Timothy Baldwin,et al.  unimelb: Spanish Text Normalisation , 2013, Tweet-Norm@SEPLN.

[24]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[25]  Barry Smyth,et al.  Mining the real-time web: A novel approach to product recommendation , 2012, Knowl. Based Syst..

[26]  Ana-Maria Popescu,et al.  A Machine Learning Approach to Twitter User Classification , 2011, ICWSM.

[27]  Rim Faiz,et al.  Towards Events Tweet Contextualization Using Social Influence Model and Users Conversations , 2015, WIMS.

[28]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.