StarTrack: The Next Generation (of Product Review Management Tools)

Online product reviews are increasingly being recognized as a gold mine of information for determining product and brand positioning, and more and more companies look for ways of digging this gold mine for nuggets of knowledge that they can then bring to bear in decision making. We present a software system, called StarTrack, that automatically rates a product review according to a number of “stars,” i.e., according to how positive it is. In other words, given a text-only review (i.e., one with no explicit star-rating attached), StarTrack attempts to guess the star-rating that the reviewer would have attached to the review. StarTrack is thus useful for analysing unstructured word-of-mouth on products, such as the comments and reviews about products that are to be found in spontaneous discussion forums, such as newsgroups, blogs, and the like. StarTrack is based on machine learning technology, and as such does not require any re-programming for porting it from one product domain to another. Based on the star-ratings it attributes to reviews, StarTrack can subsequently rank the products in a given set according to how favourably they have been reviewed by consumers. We present controlled experiments in which we evaluate, on two large sets of product reviews crawled from the Web, the accuracy of StarTrack at (i) star-rating reviews, and (ii) ranking the reviewed products based on the automatically attributed star-ratings.

[1]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[2]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[3]  Ronald Fagin,et al.  Comparing and aggregating rankings with ties , 2004, PODS '04.

[4]  S. Sénécal,et al.  The influence of online product recommendations on consumers' online choices , 2004 .

[5]  Han Tong Loh,et al.  Gather customer concerns from online product reviews - A text summarization approach , 2009, Expert Syst. Appl..

[6]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[8]  Viktor Pekar,et al.  Discovery of subjective evaluations of product features in hotel reviews , 2008 .

[9]  Paul A. Pavlou,et al.  Overcoming the J-shaped distribution of product reviews , 2009, CACM.

[10]  Wei Chu,et al.  Support Vector Ordinal Regression , 2007, Neural Computation.

[11]  Sang-goo Lee,et al.  Feature-based Product Review Summarization Utilizing User Score , 2010, J. Inf. Sci. Eng..

[12]  Hsinchun Chen,et al.  A Lexicon-Enhanced Method for Sentiment Classification: An Experiment on Online Product Reviews , 2010, IEEE Intelligent Systems.

[13]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[14]  Yubo Chen,et al.  Online Consumer Review: Word-of-Mouth as a New Element of Marketing Communication Mix , 2004, Manag. Sci..

[15]  Kazutaka Shimada,et al.  Seeing Several Stars: A Rating Inference Task for a Document Containing Several Evaluation Criteria , 2008, PAKDD.

[16]  Alessandro Sperduti,et al.  Preferential text classification: learning algorithms and evaluation measures , 2008, Information Retrieval.

[17]  Peter R. R. White,et al.  The language of evaluation , 2005 .

[18]  Zhu Zhang,et al.  Utility scoring of product reviews , 2006, CIKM '06.

[19]  Barry Smyth,et al.  Using readability tests to predict helpful product reviews , 2010, RIAO.

[20]  Jong C. Park,et al.  Toward finer-grained sentiment identification in product reviews through linguistic and ontological analyses , 2009, ACL/IJCNLP.

[21]  Ee-Peng Lim,et al.  Detecting product review spammers using rating behaviors , 2010, CIKM.

[22]  Andrea Esuli,et al.  Evaluation Measures for Ordinal Text Classification , 2009 .

[23]  Kyung Hyan Yoo,et al.  Use and Impact of Online Travel Reviews , 2008, ENTER.

[24]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[25]  Jon Atle Gulla,et al.  Enhancing Negation-Aware Sentiment Classification on Product Reviews via Multi-Unigram Feature Generation , 2010, ICIC.

[26]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[27]  Chrysanthos Dellarocas,et al.  Exploring the value of online product reviews in forecasting sales: The case of motion pictures , 2007 .

[28]  Andrea Esuli,et al.  Feature selection for ordinal regression , 2010, SAC '10.

[29]  Jinhong Xie,et al.  Third-Party Product Review and Firm Marketing Strategy , 2005 .

[30]  Shlomo Argamon,et al.  Automatically Determining Attitude Type and Force for Sentiment Analysis , 2007, LTC.

[31]  Bin Gu,et al.  The Impact of Online Recommendations and Consumer Feedback on Sales , 2004, ICIS.

[32]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[33]  Xiaojin Zhu,et al.  Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[34]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[35]  Andrea Esuli,et al.  Evaluation Measures for Ordinal Regression , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[36]  Andrea Esuli,et al.  Multi-Faceted Rating of Product Reviews , 2009, ERCIM News.

[37]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[38]  Richong Zhang,et al.  An information gain-based approach for recommending useful product reviews , 2011, Knowledge and Information Systems.

[39]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[40]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[41]  Chien Chin Chen,et al.  Quality evaluation of product reviews using an information quality framework , 2011, Decis. Support Syst..

[42]  Jeannett Martin,et al.  The Language of Evaluation: Appraisal in English , 2005 .

[43]  Tim Macer,et al.  Cracking the Code: What customers say in their own words , 2007 .

[44]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[45]  Andrea Esuli,et al.  Feature Selection for Ordinal Text Classification , 2014, Neural Computation.

[46]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[47]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[48]  Bing Liu,et al.  Review spam detection , 2007, WWW '07.

[49]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .