Identifying Helpful Online Reviews with Word Embedding Features

The advent of Web 2.0 has enabled users to share their opinions via various social media websites. People’s decision-making process is strongly influenced by online reviews. Predicting the helpfulness of reviews can help to save time and find helpful suggestions. However, most of previous works focused on exploring new features with external data source, such as user’s profile, semantic dictionaries, etc. In this paper, we maintain that the helpfulness of an online review can be predicted by knowing only word embedding information. Word embedding information is a kind of word semantic representation computed with word context. We hypothesize that word embedding information would allow us to accurately predict the helpfulness of an online review. The experiments were conducted to prove this hypothesis and the results showed a substantial improvement compared with baselines of features previously used.

[1]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[2]  Forrest Sheng Bao,et al.  Semantic Analysis and Helpfulness Prediction of Text for Online Product Reviews , 2015, ACL.

[3]  Ming Gao,et al.  Review Comment Analysis for Predicting Ratings , 2015, WAIM.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Ming Zhou,et al.  Low-Quality Product Review Detection in Opinion Summarization , 2007, EMNLP.

[6]  Zhu Zhang,et al.  Utility scoring of product reviews , 2006, CIKM '06.

[7]  Srikumar Krishnamoorthy,et al.  Linguistic features for review helpfulness prediction , 2015, Expert Syst. Appl..

[8]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[9]  Ari Rappoport,et al.  RevRank: A Fully Unsupervised Algorithm for Selecting the Most Helpful Book Reviews , 2009, ICWSM.

[10]  Sangjae Lee,et al.  Predicting the helpfulness of online reviews using multilayer perceptron neural networks , 2014, Expert Syst. Appl..

[11]  Geert-Jan Houben,et al.  Identification of useful user comments in social media: a case study on flickr commons , 2013, JCDL '13.

[12]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[13]  Deepak Agarwal,et al.  Personalized Recommendation of User Comments via Factor Models , 2011, EMNLP.

[14]  Andrew Whinston,et al.  The Dynamics of Online Word-of-Mouth and Product Sales: An Empirical Investigation of the Movie Industry , 2008 .

[15]  Chien Chin Chen,et al.  Quality evaluation of product reviews using an information quality framework , 2011, Decis. Support Syst..

[16]  Guodong Zhou,et al.  What reviews are satisfactory: novel features for automatic helpfulness voting , 2012, SIGIR '12.

[17]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[18]  Soo-Min Kim,et al.  Automatically Assessing Review Helpfulness , 2006, EMNLP.

[19]  Wolfgang Nejdl,et al.  How useful are your comments?: analyzing and predicting youtube comments and comment ratings , 2010, WWW '10.

[20]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[21]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[22]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[23]  Richard Y. K. Fung,et al.  Identifying helpful online reviews: A product designer's perspective , 2013, Comput. Aided Des..

[24]  Peng Wang,et al.  Semantic Clustering and Convolutional Neural Network for Short Text Categorization , 2015, ACL.

[25]  Jahna Otterbacher,et al.  'Helpfulness' in online communities: a measure of message quality , 2009, CHI.

[26]  Diane J. Litman,et al.  Automatically Predicting Peer-Review Helpfulness , 2011, ACL.

[27]  Léon Bottou,et al.  From machine learning to machine reasoning , 2011, Machine Learning.