Learning Domain-Specific Sentiment Lexicons for Predicting Product Sales

Generic sentiment lexicons have been widely used for sentiment analysis these days. However, manually constructing sentiment lexicons is very time-consuming and it may not be feasible for certain application domains where annotation expertise is not available. One contribution of this paper is the development of a statistical learning based computational method for the automatic construction of domain-specific sentiment lexicons to enhance cross-domain sentiment analysis. Our initial experiments show that the proposed methodology can automatically generate domain-specific sentiment lexicons which contribute to improve the effectiveness of opinion retrieval at the document level. Another contribution of our work is that we show the feasibility of applying the sentiment metric derived based on the automatically constructed sentiment lexicons to predict product sales of certain product categories. Our research contributes to the development of more effective sentiment analysis system to extract business intelligence from numerous opinionated expressions posted to the Web.

[1]  Hongbo Xu,et al.  ICTNET at Blog Track TREC 2010 , 2010, TREC.

[2]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[3]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[4]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[5]  Songbo Tan,et al.  Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples , 2008, SIGIR '08.

[6]  Min Zhang,et al.  A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval , 2008, SIGIR '08.

[7]  Giorgio Gambosi,et al.  Automatic Construction of an Opinion-Term Vocabulary for Ad Hoc Retrieval , 2008, ECIR.

[8]  Craig MacDonald,et al.  Overview of the TREC 2007 Blog Track , 2007, TREC.

[9]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[10]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[11]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[12]  Raymond Y. K. Lau,et al.  Toward a Fuzzy Domain Ontology Extraction Method for Adaptive e-Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[13]  Xinying Xu,et al.  Hidden sentiment association in chinese web opinion mining , 2008, WWW.

[14]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[15]  Toshiki Kindo,et al.  Adaptive Personal Information Filtering System that Organizes Personal Profiles Automatically , 1997, IJCAI.

[16]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[17]  Janyce Wiebe,et al.  Just How Mad Are You? Finding Strong and Weak Opinion Clauses , 2004, AAAI.

[18]  Hui Zhang,et al.  WIDIT in TREC 2007 Blog Track: Combining Lexicon-Based Methods to Detect Opinionated Blogs , 2007, TREC.

[19]  Oren Etzioni,et al.  OPINE: Extracting Product Features and Opinions from Reviews , 2005, HLT/EMNLP.

[20]  Kam-Fai Wong,et al.  An intelligent information agent for document title classification and filtering in document-intensive domains , 2007, Decis. Support Syst..

[21]  Bin Li,et al.  UTDallas at TREC 2008 Blog Track , 2008, TREC.

[22]  Chris H. Q. Ding,et al.  Knowledge transformation for cross-domain sentiment classification , 2009, SIGIR.

[23]  Songbo Tan,et al.  A novel scheme for domain-transfer problem in the context of sentiment analysis , 2007, CIKM '07.

[24]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.

[25]  Raymond Y. K. Lau,et al.  Leveraging the web context for context-sensitive opinion mining , 2009, 2009 2nd IEEE International Conference on Computer Science and Information Technology.

[26]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[27]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[28]  Diego Reforgiato Recupero,et al.  AVA: Adjective-Verb-Adverb Combinations for Sentiment Analysis , 2008, IEEE Intelligent Systems.

[29]  Raymond Y. K. Lau,et al.  Towards a belief-revision-based adaptive and context-sensitive information retrieval system , 2008, TOIS.

[30]  Panagiotis G. Ipeirotis,et al.  Show me the money!: deriving the pricing power of product features by mining consumer reviews , 2007, KDD '07.

[31]  Panagiotis G. Ipeirotis,et al.  Designing novel review ranking systems: predicting the usefulness and impact of reviews , 2007, ICEC.

[32]  Craig MacDonald,et al.  An effective statistical approach to blog post opinion retrieval , 2008, CIKM '08.