Opinion-enhanced collaborative filtering for recommender systems through sentiment analysis

The motivation of collaborative filtering (CF) comes from the idea that people often get the best recommendations from someone with similar tastes. With the growing popularity of opinion-rich resources such as online reviews, new opportunities arise as we can identify the preferences from user opinions. The main idea of our approach is to elicit user opinions from online reviews, and map such opinions into preferences that can be understood by CF-based recommender systems. We divide recommender systems into two types depending on the number of product category recommended: the multiple-category recommendation and the single-category recommendation. For the former, sentiment polarity in coarse-grained manner is identified while for the latter fine-grained sentiment analysis is conducted for each product aspect. If the evaluation frequency for an aspect by a user is greater than the average frequency by all users, it indicates that the user is more concerned with that aspect. If a user's rating for an aspect is lower than the average rating by all users, he or she is much pickier than others on that aspect. Through sentiment analysis, we then build an opinion-enhanced user preference model, where the higher the similarity between user opinions the more consistent preferences between users are. Experiment results show that the proposed CF algorithm outperforms baseline methods for product recommendation in terms of accuracy and recall.

[1]  Mehran Sahami,et al.  Evaluating similarity measures: a large-scale study in the orkut social network , 2005, KDD '05.

[2]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[3]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[4]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[5]  David Schuff,et al.  What Makes a Helpful Review? A Study of Customer Reviews on Amazon.com , 2010 .

[6]  SchuffDavid,et al.  What makes a helpful online review? a study of customer reviews on amazon.com , 2010 .

[7]  R. Merton The Matthew effect in science. The reward and communication systems of science are considered. , 1968, Science.

[8]  Mingyu Lu,et al.  Increasing Serendipity of Recommender System with Ranking Topic Model , 2014 .

[9]  Panagiotis G. Ipeirotis,et al.  Show me the money!: deriving the pricing power of product features by mining consumer reviews , 2007, KDD '07.

[10]  Cane Wing-ki Leung,et al.  Integrating Collaborative Filtering and Sentiment Analysis: A Rating Inference Approach , 2006 .

[11]  James Nga-Kwok Liu,et al.  Sentiment classification of online reviews: using sentence-based language model , 2014, J. Exp. Theor. Artif. Intell..

[12]  Naixue Xiong,et al.  Cold-Start Recommendation Using Bi-Clustering and Fusion for Large-Scale Social Recommender Systems , 2014, IEEE Transactions on Emerging Topics in Computing.

[13]  G. Somprasertsri,et al.  A maximum entropy model for product feature extraction in online customer reviews , 2008, 2008 IEEE Conference on Cybernetics and Intelligent Systems.

[14]  Miguel Ángel García Cumbreras,et al.  Pessimists and optimists: Improving collaborative filtering through sentiment analysis , 2013, Expert Syst. Appl..

[15]  Minyi Guo,et al.  An Efficient Collaborative Filtering Approach Using Smoothing and Fusing , 2009, 2009 International Conference on Parallel Processing.

[16]  Marcelo Freitas,et al.  Using Social Network Information to Identify User Contexts for Query Personalization , 2013 .

[17]  Hiroshi Nakagawa,et al.  A Simple but Powerful Automatic Term Extraction Method , 2002, COLING 2002.

[18]  Kartik Hosanagar,et al.  Recommender systems and their impact on sales diversity , 2007, EC '07.

[19]  XianghuaFu,et al.  Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon , 2013 .

[20]  Xianghua Fu,et al.  Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon , 2013, Knowl. Based Syst..

[21]  Jan Zizka,et al.  Grouping of Customer Opinions Written in Natural Language Using Unsupervised Machine Learning , 2012, 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[22]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[23]  Hui Song,et al.  Extracting product features from online reviews for sentimental analysis , 2011, 2011 6th International Conference on Computer Sciences and Convergence Information Technology (ICCIT).

[24]  Hongwei Wang,et al.  Feature–opinion pair identification of product reviews in Chinese: a domain ontology modeling method , 2013, New Rev. Hypermedia Multim..

[25]  Hiroshi Kanayama,et al.  Unsupervised lexicon induction for clause-level detection of evaluations , 2012, Nat. Lang. Eng..

[26]  Kyoungok Kim,et al.  Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction , 2014, Pattern Recognit..

[27]  Mohd Abdul Hameed,et al.  Supervised Opinion Mining of Social Network Data Using a Bag-of-Words Approach on the Cloud , 2012, BIC-TA.

[28]  Rui Jiang,et al.  Constructing a user similarity network to remove adverse influence of popular objects for personalized recommendation , 2013, Expert Syst. Appl..

[29]  Korris Fu-Lai Chung,et al.  A probabilistic rating inference framework for mining user preferences from reviews , 2011, World Wide Web.

[30]  Andrei Z. Broder,et al.  Anatomy of the long tail: ordinary people with extraordinary tastes , 2010, WSDM '10.

[31]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[32]  Hongyan Liu,et al.  Combining user preferences and user opinions for accurate recommendation , 2013, Electron. Commer. Res. Appl..

[33]  Licia Capra,et al.  The effect of correlation coefficients on communities of recommenders , 2008, SAC '08.

[34]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[35]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[36]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[37]  Jiunn-Liang Guo,et al.  AN OPINION FEATURE EXTRACTION APPROACH BASED ON A MULTIDIMENSIONAL SENTENCE ANALYSIS MODEL , 2013, Cybern. Syst..

[38]  X. Zhang,et al.  Impact of Online Consumer Reviews on Sales: The Moderating Role of Product and Consumer Characteristics , 2010 .

[39]  Janyce Wiebe,et al.  Recognizing subjectivity: a case study in manual tagging , 1999, Natural Language Engineering.

[40]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[41]  Pattarachai Lalitrojwong,et al.  Mining Feature-Opinion in Online Customer Reviews for Opinion Summarization , 2010, J. Univers. Comput. Sci..

[42]  Rada Mihalcea,et al.  Computational approaches to subjectivity and sentiment analysis: Present and envisaged methods and applications , 2014, Comput. Speech Lang..

[43]  Guo Fang Kuang,et al.  The Development of Building Materials Recommendation System Based on Collaborative Filtering , 2013 .

[44]  A. Chris,et al.  Long Tail: How Endless Choice is Creating Unlimited Demand, London: Random House. , 1996 .

[45]  Ning Zheng,et al.  An automatic product features extracting method in Chinese customer reviews , 2012, SoSE 2012.

[46]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[47]  Athanasios V. Vasilakos,et al.  Predicting location using mobile phone calls , 2012, CCRV.

[48]  SangKeun Lee,et al.  Semantic Aspect Discovery for Online Reviews , 2012, 2012 IEEE 12th International Conference on Data Mining.

[49]  Chia-Ching Lin,et al.  Applying social bookmarking to collective information searching (CIS): An analysis of behavioral pattern and peer interaction for co-exploring quality online resources , 2011, Comput. Hum. Behav..

[50]  Mike Thelwall,et al.  Twitter, MySpace, Digg: Unsupervised Sentiment Analysis in Social Media , 2012, TIST.

[51]  M. de Rijke,et al.  Predicting IMDB Movie Ratings Using Social Media , 2012, ECIR.

[52]  Mehrbakhsh Nilashi,et al.  Hybrid recommendation approaches for multi-criteria collaborative filtering , 2014, Expert Syst. Appl..

[53]  D TurneyPeter,et al.  Measuring praise and criticism , 2003 .

[54]  Xiaolin Zheng,et al.  Personalized Recommendation Based on Reviews and Ratings Alleviating the Sparsity Problem of Collaborative Filtering , 2012, ICEBE.

[55]  Chien Chin Chen,et al.  An Unsupervised Approach for Person Name Bipolarization Using Principal Component Analysis , 2012, IEEE Transactions on Knowledge and Data Engineering.

[56]  Qun Liu,et al.  基於《知網》的辭彙語義相似度計算 (Word Similarity Computing Based on How-net) [In Chinese] , 2002, ROCLING/IJCLCLP.

[57]  Chris Anderson The long tail : how endless choice is creating unlimited demand , 2006 .

[58]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.