Prodweakfinder: an Information Extraction System for Detecting Product Weaknesses in Online Reviews Based on Sentiment Analysis

Defective products cannot meet the needs of consumers, and they also bring security risks. Prior product weakness detection methods resort to time and money consuming questionnaire, and the accuracy could not be guaranteed. As a result, this paper proposes a novel opinion-aware approach, PRODWeakFinder, which extracts product weaknesses from online reviews through sentiment analysis. PRODWeakFinder detects product weakness by considering both comparative and non-comparative evaluations in online reviews. For comparative evaluation, we build a feature comparison network, where the authority score of each node is assessed. For non-comparative evaluation, sentiment score is calculated through sentiment analysis. The composite score is calculated by combing both comparative and non-comparative evaluations. The features with lower score represent potential product weaknesses. The experiment result shows that PRODWeakFinder outperforms baseline methods in terms of both accuracy and reducing of operating costs.

[1]  Hsinchun Chen,et al.  Selecting Attributes for Sentiment Classification Using Feature Relation Networks , 2011, IEEE Transactions on Knowledge and Data Engineering.

[2]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[3]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[4]  Li Chen,et al.  Comparison of feature-level learning methods for mining online consumer reviews , 2012, Expert Syst. Appl..

[5]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[6]  Stephen Shaoyi Liao,et al.  Classifying Consumer Comparison Opinions to Uncover Product Strengths and Weaknesses , 2011, Int. J. Intell. Inf. Technol..

[7]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[8]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[9]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[10]  Janyce Wiebe,et al.  A Corpus Study of Evaluative and Speculative Language , 2001, SIGDIAL Workshop.

[11]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[12]  Zheng Chen,et al.  CWS: a comparative web search system , 2006, WWW '06.

[13]  R. Groves Nonresponse Rates and Nonresponse Bias in Household Surveys , 2006 .

[14]  Hua Xu,et al.  Weakness Finder: Find product weakness from Chinese reviews by using aspects based sentiment analysis , 2012, Expert Syst. Appl..

[15]  Jian Liu,et al.  Sentiment Classification Using Information Extraction Technique , 2005, IDA.

[16]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..

[17]  Chien Chin Chen,et al.  An Unsupervised Approach for Person Name Bipolarization Using Principal Component Analysis , 2012, IEEE Transactions on Knowledge and Data Engineering.

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  Zhu Zhang,et al.  Product Comparison Networks for Competitive Analysis of Online Word-of-Mouth , 2013, TMIS.

[20]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[21]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[22]  J. Keziya Rani,et al.  Mining Opinion Features in Customer Reviews. , 2016 .

[23]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[24]  E. D. de Leeuw,et al.  Trends in household survey nonresponse: a longitudinal and international comparison , 2002 .

[25]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[26]  Kyoungok Kim,et al.  Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction , 2014, Pattern Recognit..

[27]  Zhu Zhang,et al.  Deciphering word-of-mouth in social media: Text-based metrics of consumer reviews , 2012, TMIS.

[28]  Chunqiang Tang,et al.  Answering relationship queries on the web , 2007, WWW '07.

[29]  Bing Liu,et al.  Opinion Extraction and Summarization on the Web , 2006, AAAI.

[30]  Liu Quan-sheng Mining Chinese comparative sentences and relations based on CRF algorithm , 2010 .

[31]  Janyce Wiebe,et al.  Recognizing subjectivity: a case study in manual tagging , 1999, Natural Language Engineering.

[32]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[33]  M. Trusov,et al.  Estimating Aggregate Consumer Preferences from Online Product Reviews , 2010 .