Exploring demographic information in social media for product recommendation

In many e-commerce Web sites, product recommendation is essential to improve user experience and boost sales. Most existing product recommender systems rely on historical transaction records or Web-site-browsing history of consumers in order to accurately predict online users’ preferences for product recommendation. As such, they are constrained by limited information available on specific e-commerce Web sites. With the prolific use of social media platforms, it now becomes possible to extract product demographics from online product reviews and social networks built from microblogs. Moreover, users’ public profiles available on social media often reveal their demographic attributes such as age, gender, and education. In this paper, we propose to leverage the demographic information of both products and users extracted from social media for product recommendation. In specific, we frame recommendation as a learning to rank problem which takes as input the features derived from both product and user demographics. An ensemble method based on the gradient-boosting regression trees is extended to make it suitable for our recommendation task. We have conducted extensive experiments to obtain both quantitative and qualitative evaluation results. Moreover, we have also conducted a user study to gauge the performance of our proposed recommender system in a real-world deployment. All the results show that our system is more effective in generating recommendation results better matching users’ preferences than the competitive baselines.

[1]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[2]  Izak Benbasat,et al.  E-Commerce Product Recommendation Agents: Use, Characteristics, and Impact , 2007, MIS Q..

[3]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[4]  Marios Poulos,et al.  Using Online Consumer Reviews as a Source for Demographic Recommendations: A Case Study Using Online Travel Reviews , 2013, Expert Syst. Appl..

[5]  Markus Strohmaier,et al.  Towards linking buyers and sellers: detecting commercial Intent on twitter , 2013, WWW.

[6]  V. Zeithaml The New Demographics and Market Fragmentation , 1985 .

[7]  Yuexin Wu,et al.  We know what you want to buy: a demographic-based system for product recommendation on microblogs , 2014, KDD.

[8]  G. Sridhar Consumer Involvement in Product Choice - A Demographic Analysis , 2007 .

[9]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[10]  Guodong Zhou,et al.  Negation Focus Identification with Contextual Discourse Information , 2014, ACL.

[11]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[13]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[14]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[15]  Hang Li,et al.  AdaRank: a boosting algorithm for information retrieval , 2007, SIGIR.

[16]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.

[17]  Tie-Yan Liu,et al.  Learning to Rank for Information Retrieval , 2011 .

[18]  Hang Zhang,et al.  Committee Based Prediction System for Recommendation: KDD Cup 2011, Track2 , 2012, KDD Cup.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Panagiotis Symeonidis,et al.  Product recommendation and rating prediction based on multi-modal social networks , 2011, RecSys '11.

[21]  Michael Giering,et al.  Retail sales prediction and item recommendations using customer demographics at store level , 2008, SKDD.

[22]  Cristina V. Lopes,et al.  Bagging gradient-boosted trees for high precision, low variance ranking models , 2011, SIGIR.

[23]  Albrecht Schmidt,et al.  The design space of ubiquitous product recommendation systems , 2009, MUM.

[24]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[25]  J PazzaniMichael A Framework for Collaborative, Content-Based and Demographic Filtering , 1999 .

[26]  Yiqun Liu,et al.  Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[27]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[28]  Konstantinos Tsiptsis,et al.  Data Mining Techniques in CRM: Inside Customer Segmentation , 2010 .

[29]  J. Friedman Stochastic gradient boosting , 2002 .

[30]  Ingrid Zukerman,et al.  Personalised rating prediction for new users using latent factor models , 2011, HT '11.

[31]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[32]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[33]  Ying Li,et al.  Detecting online commercial intention (OCI) , 2006, WWW '06.

[34]  Filip Radlinski,et al.  Large-scale validation and analysis of interleaved search evaluation , 2012, TOIS.

[35]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[36]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[37]  Izak Benbasat,et al.  A study of demographic embodiments of product recommendation agents in electronic commerce , 2010, Int. J. Hum. Comput. Stud..

[38]  Guokun Lai,et al.  Explicit factor models for explainable recommendation based on phrase-level sentiment analysis , 2014, SIGIR.

[39]  Natalie S. Glance,et al.  Star Quality: Aggregating Reviews to Rank Products and Merchants , 2010, ICWSM.

[40]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[41]  Amélie Marian,et al.  Improving the quality of predictions using textual information in online user reviews , 2013, Inf. Syst..

[42]  Michael J. Baker,et al.  The Marketing Book , 2002 .

[43]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Milad Shokouhi,et al.  Inferring the demographics of search users: social data meets search queries , 2013, WWW.

[45]  Nitesh V. Chawla,et al.  Inferring user demographics and social strategies in mobile social networks , 2014, KDD.

[46]  Konstantinos Tsiptsis,et al.  Data Mining Techniques in CRM , 2010 .

[47]  Jian Wang,et al.  Opportunity model for e-commerce recommendation: right product; right time , 2013, SIGIR.

[48]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[49]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .