Mining Product Features from Online Reviews

With the advance of the Internet, e-commerce systems have become extremely important and convenient to human being. More and more products are sold on the web, and more and more people are purchasing products online. As a result, an increasing number of customers post product reviews at merchant websites and express their opinions and experiences in any network space such as Internet forums, discussion groups, and blogs. So there is a large amount of data records related to products on the Web, which are useful for both manufacturers and customers. Mining product reviews becomes a hot research topic, and prior researches mostly base on product features to analyze the opinions. So mining product features is the first step to further reviews processing. In this paper, we present how to mine product features. The proposed extraction approach is different from the previous methods because we only mine the features of the product in opinion sentences which the customers have expressed their positive or negative experiences on. In order to find opinion sentence, a SentiWordNet-based algorithm is proposed. There are three steps to perform our task: (1) identifying opinion sentences in each review which is positive or negative via SentiWordNet; (2) mining product features that have been commented on by customers from opinion sentences; (3) pruning feature to remove those incorrect features. Compared to previous work, our experimental result achieves higher precision and recall.

[1]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[2]  Iryna Gurevych,et al.  A Comparative Study of Feature Extraction Algorithms in Customer Reviews , 2008, 2008 IEEE International Conference on Semantic Computing.

[3]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[4]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[5]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[6]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[7]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[8]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[9]  Esko Ukkonen,et al.  Two Algorithms for Approximate String Matching in Static Texts , 1991, MFCS.

[10]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[11]  Chin-Yew Lin,et al.  Automated Text Summarization , 2005, IJCNLP.

[12]  Christopher Scaffidi,et al.  Application of a Probability-Based Algorithm to Extraction of Product Features from Online Reviews , 2006 .

[13]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[14]  Ryoji Kataoka,et al.  Opinion Sentence Search Engine on Open-Domain Blog , 2007, IJCAI.

[15]  F. Scientific Online Shopping , 1998, Current Biology.

[16]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[17]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[18]  Kathleen R. McKeown,et al.  Generating natural language summaries from multiple on-line sources , 1998 .

[19]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[20]  Jian Liu,et al.  Sentiment classification using phrase patterns , 2004, The Fourth International Conference onComputer and Information Technology, 2004. CIT '04..

[21]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[22]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[23]  Chao Wang,et al.  A semantic classification approach for online product reviews , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).