Cross-Domain Sentiment Analysis of Product Reviews by Combining Lexicon-Based and Learn-Based Techniques

Product reviews can direct consumers' purchasing behavior and sellers' marketing strategy. Therefor, in this paper, we propose a novel method which combines lexicon-based and learn-based techniques in order to solve cross-domain sentiment analysis of product reviews. We first create three domain lexicons based on the basic lexicon and corpus from three domains containing book, hotel and electronics. Furthermore, we use four kinds of features to build classifiers. Besides, we conduct a series of experiments to evaluate our method by using different lexicons and different classifiers. Experimental results on data of product reviews show that domain lexicons outperform the basic lexicon no matter in which domain. What's more, our method performs much better than the existing state-of-the-art method especially in domains of book and hotel, and is slightly inferior in the domian of electronics.

[1]  Jeffrey Pennington,et al.  Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[2]  Mike Thelwall,et al.  Topic-based sentiment analysis for the social web: The role of mood and issue-related words , 2013, J. Assoc. Inf. Sci. Technol..

[3]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[4]  Fei Song,et al.  Feature Selection for Sentiment Analysis Based on Content and Syntax Models , 2011, Decis. Support Syst..

[5]  TaboadaMaite,et al.  Lexicon-based methods for sentiment analysis , 2011 .

[6]  Songbo Tan,et al.  Optimizing modularity to identify semantic orientation of Chinese words , 2010, Expert Syst. Appl..

[7]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[8]  Panagiotis Stamatopoulos,et al.  Summarization from Medical Documents: A Survey , 2005, Artif. Intell. Medicine.

[9]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[10]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[11]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[12]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[13]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[14]  Jin Zhang,et al.  An empirical study of sentiment analysis for chinese documents , 2008, Expert Syst. Appl..

[15]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[16]  Japinder Singh,et al.  Feature-based opinion mining and ranking , 2012, J. Comput. Syst. Sci..

[17]  Songbo Tan,et al.  Combining learn-based and lexicon-based techniques for sentiment detection without using labeled examples , 2008, SIGIR '08.

[18]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[19]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.