Analyzing Product Comparisons on Discussion Boards

Product discussion boards are a rich source of information about consumer sentiment about products, which is being increasingly exploited. Most sentiment analysis has looked at single products in isolation, but users often compare different products, stating which they like better and why. We present a set of techniques for analyzing how consumers view product markets. Specifically, we extracted relative sentiment analysis and comparisons between products, to understand what attributes users compare products on, and which products they prefer on each dimension. We illustrate these methods in an extended case study analyzing the sedan car markets.

[1]  Jacob Goldenberg,et al.  Extracting Product Comparisons from Discussion Boards , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[2]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[3]  B. Schwartz The Paradox of Choice: Why More Is Less , 2004 .

[4]  Andrew McCallum,et al.  Information Extraction , 2005, ACM Queue.

[5]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[6]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[7]  Ronen Feldman,et al.  Using Corpus Statistics on Entities to Improve Semi-supervised Relation Extraction from the Web , 2007, ACL.

[8]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[9]  Ronen Feldman,et al.  Visualization Techniques to Explore Data Mining Results for Document Collections , 1997, KDD.

[10]  Ralph Weischedel,et al.  Named Entity Extraction from Broadcast News , 1999 .

[11]  Wei Li,et al.  Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[12]  Soo-Min Kim,et al.  Identifying and Analyzing Judgment Opinions , 2006, NAACL.

[13]  Yonatan Aumann,et al.  Maximal Association Rules: A New Tool for Mining for Keyword Co-Occurrences in Document Collections , 1997, KDD.

[14]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[15]  Sabine Buchholz,et al.  Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[16]  P. Green,et al.  Analyzing multivariate data , 1978 .

[17]  Ronen Feldman,et al.  TEG—a hybrid approach to information extraction , 2005, Knowledge and Information Systems.

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  Timothy Chklovski,et al.  Deriving quantitative overviews of free text assessments on the web , 2006, IUI '06.

[20]  Bing Liu,et al.  Mining Comparative Sentences and Relations , 2006, AAAI.

[21]  Alexander A. Morgan,et al.  BioCreAtIvE Task 1A: gene mention finding evaluation , 2005, BMC Bioinformatics.

[22]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.

[23]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[24]  Michelle L. Gregory,et al.  User-directed Sentiment Analysis: Visualizing the Affective Content of Documents , 2006 .