Mining affective text to improve social media item recommendation

We propose a sentiment-aware social media recommendation framework.An ensemble learning-based method is proposed to classify sentiments from affective texts.We conduct comprehensive experiments to verify the effectiveness of the proposed methods. Social media websites, such as YouTube and Flicker, are currently gaining in popularity. A large volume of information is generated by online users and how to appropriately provide personalized content is becoming more challenging. Traditional recommendation models are overly dependent on preference ratings and often suffer from the problem of "data sparsity". Recent research has attempted to integrate sentiment analysis results of online affective texts into recommendation models; however, these studies are still limited. The one class collaborative filtering (OCCF) method is more applicable in the social media scenario yet it is insufficient for item recommendation. In this study, we develop a novel sentiment-aware social media recommendation framework, referred to as SA_OCCF, in order to tackle the above challenges. We leverage inferred sentiment feedback information and OCCF models to improve recommendation performance. We conduct comprehensive experiments on a real social media web site to verify the effectiveness of the proposed framework and methods. The results show that the proposed methods are effective in improving the performance of the baseline OCCF methods.

[1]  Likun Qiu,et al.  SELC: a self-supervised model for sentiment classification , 2009, CIKM.

[2]  Andrei Popescu-Belis,et al.  Sentiment analysis of user comments for one-class collaborative filtering over ted talks , 2013, SIGIR.

[3]  Liang Chen,et al.  Trust-aware media recommendation in heterogeneous social networks , 2013, World Wide Web.

[4]  Antal van den Bosch,et al.  Fusing Recommendations for Social Bookmarking Web Sites , 2011, Int. J. Electron. Commer..

[5]  Lars Schmidt-Thieme,et al.  MyMediaLite: a free recommender system library , 2011, RecSys '11.

[6]  Lisa DaVia Rubenstein,et al.  Using TED Talks to Inspire Thoughtful Practice , 2012 .

[7]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[8]  Xiaohui Yu,et al.  Collaborative Filtering with Aspect-Based Opinion Mining: A Tensor Factorization Approach , 2012, 2012 IEEE 12th International Conference on Data Mining.

[9]  Iryna Gurevych,et al.  Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations , 2009, TSA@CIKM.

[10]  Domonkos Tikk,et al.  Scalable Collaborative Filtering Approaches for Large Recommender Systems , 2009, J. Mach. Learn. Res..

[11]  Li Chen,et al.  Generating virtual ratings from chinese reviews to augment online recommendations , 2013, TIST.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[15]  Janyce Wiebe,et al.  RECOGNIZING STRONG AND WEAK OPINION CLAUSES , 2006, Comput. Intell..

[16]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[17]  Ophir Frieder,et al.  Repeatable evaluation of search services in dynamic environments , 2007, TOIS.

[18]  Chun Chen,et al.  Using rich social media information for music recommendation via hypergraph model , 2011, TOMCCAP.

[19]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  James Bennett,et al.  The Netflix Prize , 2007 .

[21]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[22]  Yi-Cheng Zhang,et al.  Information filtering via weighted heat conduction algorithm , 2011 .

[23]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[24]  John K. Debenham,et al.  Informed Recommender: Basing Recommendations on Consumer Product Reviews , 2007, IEEE Intelligent Systems.

[25]  Anil K. Jain,et al.  Likelihood Ratio-Based Biometric Score Fusion , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[27]  Yi-Cheng Zhang,et al.  Tag-Aware Recommender Systems: A State-of-the-Art Survey , 2011, Journal of Computer Science and Technology.

[28]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[29]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[30]  Korris Fu-Lai Chung,et al.  A probabilistic rating inference framework for mining user preferences from reviews , 2011, World Wide Web.

[31]  Antal van den Bosch,et al.  Recommending scientific articles using citeulike , 2008, RecSys '08.

[32]  Rui Xia,et al.  Ensemble of feature sets and classification algorithms for sentiment classification , 2011, Inf. Sci..

[33]  Wolfgang Nejdl,et al.  How useful are your comments?: analyzing and predicting youtube comments and comment ratings , 2010, WWW '10.

[34]  Hsinchun Chen,et al.  Affect Analysis of Web Forums and Blogs Using Correlation Ensembles , 2008, IEEE Transactions on Knowledge and Data Engineering.

[35]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[36]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[37]  Bruno Ohana,et al.  Sentiment Classification of Reviews Using SentiWordNet , 2009 .

[38]  Hsinchun Chen,et al.  Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace , 2008, TOIS.

[39]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[40]  Jian Ma,et al.  Sentiment classification: The contribution of ensemble learning , 2014, Decis. Support Syst..

[41]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[42]  Qing Zhu,et al.  Improve Top-K Recommendation by Extending Review Analysis , 2012, APWeb.

[43]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[44]  Andrés Montoyo,et al.  Detecting Implicit Expressions of Sentiment in Text Based on Commonsense Knowledge , 2011, WASSA@ACL.

[45]  ChengXiang Zhai,et al.  Exploiting rich user information for one-class collaborative filtering , 2012, Knowledge and Information Systems.

[46]  Kevin C. Almeroth,et al.  Social computing: an intersection of recommender systems, trust/reputation systems, and social networks , 2012, IEEE Network.

[47]  Li Chen,et al.  Preference-based clustering reviews for augmenting e-commerce recommendation , 2013, Knowl. Based Syst..

[48]  Hongyan Liu,et al.  Combining user preferences and user opinions for accurate recommendation , 2013, Electron. Commer. Res. Appl..

[49]  Qiang Yang,et al.  One-Class Collaborative Filtering , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[50]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[51]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[52]  Marie-Francine Moens,et al.  A machine learning approach to sentiment analysis in multilingual Web texts , 2009, Information Retrieval.

[53]  Yehuda Koren,et al.  Modeling relationships at multiple scales to improve accuracy of large recommender systems , 2007, KDD '07.

[54]  Mike Thelwall,et al.  Scholars on soap boxes: Science communication and dissemination in TED videos , 2013, J. Assoc. Inf. Sci. Technol..

[55]  Andrei Popescu-Belis,et al.  Combining content with user preferences for TED lecture recommendation , 2013, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI).

[56]  Ling Liu,et al.  Encyclopedia of Database Systems , 2009, Encyclopedia of Database Systems.

[57]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[58]  Mike Thelwall,et al.  Sentiment in Twitter events , 2011, J. Assoc. Inf. Sci. Technol..

[59]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[60]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[61]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[62]  Berkant Barla Cambazoglu,et al.  Review of "Search Engines: Information Retrieval in Practice" by Croft, Metzler and Strohman , 2010, Inf. Process. Manag..

[63]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[64]  Joemon M. Jose,et al.  Handling data sparsity in collaborative filtering using emotion and semantic based features , 2011, SIGIR.

[65]  João Magalhães,et al.  A recommender system for the TV on the web: integrating unrated reviews and movie ratings , 2013, Multimedia Systems.