Modeling impression discounting in large-scale recommender systems

Recommender systems have become very important for many online activities, such as watching movies, shopping for products, and connecting with friends on social networks. User behavioral analysis and user feedback (both explicit and implicit) modeling are crucial for the improvement of any online recommender system. Widely adopted recommender systems at LinkedIn such as "People You May Know" and "Endorsements" are evolving by analyzing user behaviors on impressed recommendation items. In this paper, we address modeling impression discounting of recommended items, that is, how to model user's no-action feedback on impressed recommended items. The main contributions of this paper include (1) large-scale analysis of impression data from LinkedIn and KDD Cup; (2) novel anti-noise regression techniques, and its application to learn four different impression discounting functions including linear decay, inverse decay, exponential decay, and quadratic decay; (3) applying these impression discounting functions to LinkedIn's "People You May Know" and "Endorsements" recommender systems.

[1]  G. Styan Hadamard products and multivariate statistical analysis , 1973 .

[2]  M. Wand,et al.  Multivariate Locally Weighted Least Squares Regression , 1994 .

[3]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[4]  UsingSmoothing SplinesbyXihong Liny,et al.  Inference in Generalized Additive Mixed Models , 1999 .

[5]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[6]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[7]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[8]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[9]  Pearl Pu,et al.  A recursive prediction algorithm for collaborative filtering recommender systems , 2007, RecSys '07.

[10]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[11]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[12]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[13]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[14]  Deepak Agarwal,et al.  Spatio-temporal models for estimating click-through rate , 2009, WWW '09.

[15]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[16]  Shuang-Hong Yang,et al.  Collaborative competitive filtering: learning recommender using context of user choice , 2011, SIGIR.

[17]  Abhimanyu Das,et al.  Factoring past exposure in display advertising targeting , 2012, KDD.

[18]  Diyi Yang,et al.  Local implicit feedback mining for music recommendation , 2012, RecSys.

[19]  Francesco Ricci,et al.  Optimal radio channel recommendations with explicit and implicit feedback , 2012, RecSys.

[20]  Foster J. Provost,et al.  Machine learning for targeted display advertising: transfer learning in action , 2013, Machine Learning.

[21]  Laks V. S. Lakshmanan,et al.  KeySee: supporting keyword search on evolving events in social streams , 2013, KDD.

[22]  Yixin Chen,et al.  Density-based logistic regression , 2013, KDD.

[23]  Vikas K. Garg,et al.  Adaptivity to Local Smoothness and Dimension in Kernel Regression , 2013, NIPS.

[24]  Sam Shah,et al.  The big data ecosystem at LinkedIn , 2013, SIGMOD '13.

[25]  James Bailey,et al.  Time Series Forecasting Using Distribution Enhanced Linear Regression , 2013, PAKDD.

[26]  Laks V. S. Lakshmanan,et al.  Incremental cluster evolution tracking from highly dynamic network data , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[27]  Foster Provost,et al.  Evaluating and Optimizing Online Advertising: Forget the Click, but There Are Good Proxies , 2015, Big Data.