Detecting Spam Review through Sentiment Analysis

Online review can help people getting more information about store and product. The potential customers tend to make decision according to it. However, driven by profit, spammers post spurious reviews to mislead the customers by promoting or demoting target store. Previous studies mainly utilize rating as indicator for the detection. However, these studies ignore an important problem that the rating will not necessarily represent the sentiment accurately. In this paper, we first incorporate the sentiment analysis techniques into review spam detection. The proposed method compute sentiment score from the natural language text by a shallow dependency parser. We further discuss the relationship between sentiment score and spam reviews. A series of discriminative rules are established through intuitive observation. In the end, this paper establishes a time series combined with discriminative rules to detect the spam store and spam review efficiently. Experimental results show that the proposed methods in this paper have good detection result and outperform existing methods.

[1]  Philip S. Yu,et al.  Review spam detection via temporal pattern discovery , 2012, KDD.

[2]  Aoying Zhou,et al.  Exploiting shopping and reviewing behavior to re-score online evaluations , 2012, WWW.

[3]  Xuanjing Huang,et al.  Mining product reviews based on shallow dependency parsing , 2009, SIGIR.

[4]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[5]  Carolyn Penstein Rosé,et al.  Generalizing Dependency Features for Opinion Mining , 2009, ACL.

[6]  Desheng Dash Wu,et al.  Using text mining and sentiment analysis for online forums hotspot detection and forecast , 2010, Decis. Support Syst..

[7]  Arjun Mukherjee,et al.  Spotting fake reviewer groups in consumer reviews , 2012, WWW.

[8]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[9]  Philip S. Yu,et al.  Identify Online Store Review Spammers via Social Review Graph , 2012, TIST.

[10]  Alan F. Smeaton,et al.  Classifying sentiment in microblogs: is brevity an advantage? , 2010, CIKM.

[11]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[12]  Christopher S. G. Khoo,et al.  Aspect-based sentiment analysis of movie reviews on discussion boards , 2010, J. Inf. Sci..

[13]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[14]  Bruno Ohana,et al.  Sentiment Classification of Reviews Using SentiWordNet , 2009 .

[15]  Bing Liu,et al.  Mining Comparative Sentences and Relations , 2006, AAAI.

[16]  Xuanjing Huang,et al.  Phrase Dependency Parsing for Opinion Mining , 2009, EMNLP.

[17]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[18]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[19]  Bing Liu,et al.  Review spam detection , 2007, WWW '07.

[20]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[21]  Hongfei Yan,et al.  Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid , 2010, EMNLP.

[22]  Ee-Peng Lim,et al.  Detecting product review spammers using rating behaviors , 2010, CIKM.

[23]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[24]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[25]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[26]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[27]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[28]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[29]  Ee-Peng Lim,et al.  Finding unusual review patterns using unexpected rules , 2010, CIKM.

[30]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[31]  Mark Levene,et al.  Combining lexicon and learning based approaches for concept-level sentiment analysis , 2012, WISDOM '12.