A Comparative Study of Feature Extraction Algorithms in Customer Reviews

The paper systematically compares two feature extraction algorithms to mine product features commented on in customer reviews. The first approach [17] identifies candidate features by applying a set of POS patterns and pruning the candidate set based on the log likelihood ratio test. The second approach [11] applies association rule mining for identifying frequent features and a heuristic based on the presence of sentiment terms for identifying infrequent features. We evaluate the performance of the algorithms on five product specific document collections regarding consumer electronic devices. We perform an analysis of errors and discuss advantages and limitations of the algorithms.

[1]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[2]  R. Agarwal Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[3]  Janyce Wiebe,et al.  Recognizing subjectivity: a case study in manual tagging , 1999, Natural Language Engineering.

[4]  Janyce Wiebe,et al.  Development and Use of a Gold-Standard Data Set for Subjectivity Classifications , 1999, ACL.

[5]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6]  Satoshi Morinaga,et al.  Mining product reputations on the Web , 2002, KDD.

[7]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[8]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[9]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[10]  Yuji Matsumoto,et al.  Collecting Evaluative Expressions for Opinion Extraction , 2004, IJCNLP.

[11]  Eric K. Ringger,et al.  Pulse: Mining Customer Opinions from Free Text , 2005, IDA.

[12]  Matthew Hurst,et al.  Deriving marketing intelligence from online discussion , 2005, KDD '05.

[13]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[14]  Guy Lapalme,et al.  Query-Based Summarization of Customer Reviews , 2007, Canadian Conference on AI.

[15]  Shlomo Argamon,et al.  Extracting Appraisal Expressions , 2007, NAACL.