URWF: user reputation based weightage framework for twitter micropost classification

Sentiment analysis is an emerging field that helps in understanding the sentiments of users on microblogging sites. Many sentiment analysis techniques have been proposed by researchers that classify and analyze the sentiments from micropost posted by various users. Majorly, these techniques perform text based classification that does not allow predicting the micropost impact. Further, it is very difficult to analyze this huge volume of online content produced each day. Therefore, an effective technique for sentiment analysis is required that not only performs the precise text-based classification but also makes the analysis easy by reducing the volume of data. Moreover, micropost impact must also be determined in order to segregate the high impact microposts in corpus. In the present study, we have presented sentiment analysis framework that incorporates any text based classification and separates out the high impact microposts from low impact by calculating the factor of user reputation. This user reputation is calculated by considering multiple factors regarding user activities that may help organizations to know customer opinions and views about their products and services. This way, volume of data becomes small that has to be analyzed by considering only microposts posted by high impact users. Multiple text classifications classes are introduced instead of just positive, negative and neutral for precise sentiment classification. The proposed framework also calculates the accumulated weight of each micropost by multiplying the user reputation with the assigned sentiment score. The user reputation calculation factors are validated by using Spearman rho and Kendall tau correlation coefficient. The framework is further evaluated by using the Sanders topic based corpus and results are presented.

[1]  Eman AlDwaisan,et al.  A Twitter-Based Weighted Reputation system , 2012, ANT/MobiWIS.

[2]  Suneetha Manne,et al.  Sentiment Analysis on Twitter Streaming Data , 2015 .

[3]  Rosa M. Carro,et al.  Sentiment analysis in Facebook and its application to e-learning , 2014, Comput. Hum. Behav..

[4]  Yi Zeng,et al.  A Weighted Multi-factor Algorithm for Microblog Search , 2011, AMT.

[5]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[6]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..

[7]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.

[8]  Manoochehr Ghiassi,et al.  Measuring effectiveness of a dynamic artificial neural network algorithm for classification problems , 2010, Expert Syst. Appl..

[9]  David Zimbra,et al.  A dynamic artificial neural network model for forecasting time series events , 2005 .

[10]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[11]  Xue Bai,et al.  Predicting consumer sentiments from online text , 2011, Decis. Support Syst..

[12]  Stefan Sommer,et al.  Analyzing customer sentiments in microblogs - A topic-model-based approach for Twitter datasets , 2011, AMCIS.

[13]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[14]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[15]  Cecilia R. Aragon,et al.  Collaborative Visual Analysis of Sentiment in Twitter Events , 2014, CDVE.

[16]  Yong Shi,et al.  The Role of Text Pre-processing in Sentiment Analysis , 2013, ITQM.

[17]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[18]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[19]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[20]  Rim Faiz,et al.  Customer review summarization approach using Twitter and SentiWordNet , 2013, WIMS '13.

[21]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[22]  Marián Šimko,et al.  Sentiment analysis on microblog utilizing appraisal theory , 2013, World Wide Web.

[23]  Luis Alfonso Ureña López,et al.  Ranked WordNet graph for Sentiment Polarity Classification in Twitter , 2014, Comput. Speech Lang..

[24]  Juan Luis Castro,et al.  Handling Context in Lexicon-Based Sentiment Analysis , 2012, IPMU.

[25]  Ann Lehman,et al.  JMP for Basic Univariate and Multivariate Statistics: Methods for Researchers and Social Scientists, Second Edition , 2013 .

[26]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[27]  Usman Qamar,et al.  TOM: Twitter opinion mining framework using hybrid classification scheme , 2014, Decis. Support Syst..