Estimating Reputation Polarity on Microblog Posts

We find that reputation polarity of a post is different from sentiment.We model reputation polarity using feature classes from communication theory.We introduce new features based on the replies to a post.We propose different ways to operationalise the RepLab 2012 and 2013 tasks. In reputation management, knowing what impact a tweet has on the reputation of a brand or company is crucial. The reputation polarity of a tweet is a measure of how the tweet influences the reputation of a brand or company. We consider the task of automatically determining the reputation polarity of a tweet. For this classification task, we propose a feature-based model based on three dimensions: the source of the tweet, the contents of the tweet and the reception of the tweet, i.e., how the tweet is being perceived. For evaluation purposes, we make use of the RepLab 2012 and 2013 datasets. We study and contrast three training scenarios. The first is independent of the entity whose reputation is being managed, the second depends on the entity at stake, but has over 90% fewer training samples per model, on average. The third is dependent on the domain of the entities. We find that reputation polarity is different from sentiment and that having less but entity-dependent training data is significantly more effective for predicting the reputation polarity of a tweet than an entity-independent training scenario. Features related to the reception of a tweet perform significantly better than most other features.

[1]  José Saias,et al.  In Search of Reputation Assessment: Experiences with Polarity Classification in RepLab 2013 , 2013, CLEF.

[2]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[3]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[4]  Alexandra Balahur,et al.  Detecting Entity-Related Events and Sentiments from Tweets Using Multilingual Resources , 2012, CLEF.

[5]  Valentin Jijkoun,et al.  Generating Focused Topic-Specific Sentiment Lexicons , 2010, ACL.

[6]  Fredrik Olsson,et al.  Profiling Reputation of Corporate Entities in Semantic Space , 2012, CLEF.

[7]  Julio Gonzalo,et al.  Overview of RepLab 2014: Author Profiling and Reputation Dimensions for Online Reputation Management , 2014, CLEF.

[8]  Ellen Riloff,et al.  Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[9]  Silvio Amir,et al.  POPSTAR at RepLab 2013: Polarity for Reputation Classification , 2013, CLEF.

[10]  Mary Madden,et al.  Reputation Management and Social Media: How People Monitor Their Identity and Search for Others Online , 2010 .

[11]  Wouter Weerkamp,et al.  Microblog language identification: overcoming the limitations of short, unedited and idiomatic text , 2012, Language Resources and Evaluation.

[12]  Julio Gonzalo,et al.  UNED Online Reputation Monitoring Team at RepLab 2013 , 2013, CLEF.

[13]  Gary King,et al.  A Method of Automated Nonparametric Content Analysis for Social Science , 2010 .

[14]  Julio Gonzalo,et al.  Overview of RepLab 2013: Evaluating Online Reputation Monitoring Systems , 2013, CLEF.

[15]  Dean C. Barnlund A TRANSACTIONAL MODEL OF COMMUNICATION , 1970 .

[16]  Verónica Pérez-Rosas,et al.  Learning Sentiment Lexicons in Spanish , 2012, LREC.

[17]  Nicholas Hookway,et al.  `Entering the blogosphere': some strategies for using blogs in social research , 2008 .

[18]  Guda van Noort,et al.  Online Damage Control: The Effects of Proactive versus Reactive Webcare Interventions in Consumer-generated and Brand-generated Platforms , 2012 .

[19]  Bruno Pouliquen,et al.  Sentiment Analysis in the News , 2010, LREC.

[20]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[21]  Namkee Park,et al.  Effects of online news forum on corporate reputation , 2007 .

[22]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[23]  Paolo Rosso,et al.  On the Difficulty of Clustering Microblog Texts for Online Reputation Management , 2011, WASSA@ACL.

[24]  Mohamed Morchid,et al.  LIA@RepLab 2013 , 2013, CLEF.

[25]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[26]  Chao Yang,et al.  Lexical and Machine Learning Approaches Toward Online Reputation Management , 2012, CLEF.

[27]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[28]  Irina Chugur,et al.  Using an Emotion-based Model and Sentiment Analysis Techniques to Classify Polarity for Reputation , 2012, CLEF.

[29]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[30]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[31]  Ana M. García-Serrano,et al.  Modelling Techniques for Twitter Contents: A Step beyond Classification based Approaches , 2013, CLEF.

[32]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[33]  Julio Gonzalo,et al.  Overview of RepLab 2012: Evaluating Online Reputation Management Systems , 2012, CLEF.

[34]  Rianne Kaptein Learning to Analyze Relevancy and Polarity of Tweets , 2012, CLEF.

[35]  David J. Faulds,et al.  Social media: The new hybrid element of the promotion mix , 2009 .

[36]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[37]  Michael Reinhard,et al.  The Process And Effects Of Mass Communication , 2016 .

[38]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[39]  Julio Gonzalo,et al.  ORMA: A Semi-automatic Tool for Online Reputation Monitoring in Twitter , 2014, ECIR.

[40]  Thomas Gottron,et al.  Bad news travel fast: a content-based analysis of interestingness on Twitter , 2011, WebSci '11.

[41]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[42]  Charles J. Fombrun,et al.  Essentials of Corporate Communication , 2006 .

[43]  M. de Rijke,et al.  Credibility-inspired ranking for blog post retrieval , 2012, Information Retrieval.

[44]  Richárd Farkas,et al.  Filtering and Polarity Detection for Reputation Management on Tweets , 2013, CLEF.

[45]  Roi Blanco,et al.  FBM-Yahoo! at RepLab 2012 , 2012, CLEF.

[46]  David Kenneth Berlo,et al.  The process of communication:an introduction to theory and practice , 1960 .

[47]  Karl Aberer,et al.  It Was Easy, when Apples and Blackberries Were only Fruits , 2010, CLEF.

[48]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[49]  Donna Harman,et al.  Information Processing and Management , 2022 .

[50]  Ans Kolk,et al.  A Fat Debate on Big Food? Unraveling Blogosphere Reactions , 2012 .

[51]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[52]  Martin Ebner,et al.  Is Twitter an Individual Mass Communication Medium , 2010 .

[53]  M. de Rijke,et al.  Cognitive Temporal Document Priors , 2013, DIR.

[54]  Patricio Martínez-Barco,et al.  DLSI-Volvam at RepLab 2013: Polarity Classification on Twitter Data , 2013, CLEF.

[55]  Julio Gonzalo,et al.  WePS3 Evaluation Campaign: Overview of the On-line Reputation Management Task , 2010, CLEF.

[56]  Hyunjong Lee,et al.  Using Feature Selection Metrics for Polarity Analysis in RepLab 2012 , 2012, CLEF.

[57]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[58]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[59]  Andrew McCallum,et al.  Automatic Categorization of Email into Folders: Benchmark Experiments on Enron and SRI Corpora , 2005 .

[60]  José Carlos González,et al.  DAEDALUS at RepLab 2012: Polarity Classification and Filtering on Twitter Data , 2012, CLEF.

[61]  Krisztian Balog,et al.  The University of Amsterdam at WePS3 , 2010, CLEF.

[62]  Iadh Ounis,et al.  Overview of the TREC 2011 Microblog Track , 2011, TREC.

[63]  Kalina Bontcheva,et al.  Reputation Profiling with GATE , 2012, CLEF.