Focused matrix factorization for audience selection in display advertising

Audience selection is a key problem in display advertising systems in which we need to select a list of users who are interested (i.e., most likely to buy) in an advertising campaign. The users' past feedback on this campaign can be leveraged to construct such a list using collaborative filtering techniques such as matrix factorization. However, the user-campaign interaction is typically extremely sparse, hence the conventional matrix factorization does not perform well. Moreover, simply combining the users feedback from all campaigns does not address this since it dilutes the focus on target campaign in consideration. To resolve these issues, we propose a novel focused matrix factorization model (FMF) which learns users' preferences towards the specific campaign products, while also exploiting the information about related products. We exploit the product taxonomy to discover related campaigns, and design models to discriminate between the users' interest towards campaign products and non-campaign products. We develop a parallel multi-core implementation of the FMF model and evaluate its performance over a real-world advertising dataset spanning more than a million products. Our experiments demonstrate the benefits of using our models over existing approaches.

[1]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[2]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[3]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[4]  Lars Schmidt-Thieme,et al.  Pairwise interaction tensor factorization for personalized tag recommendation , 2010, WSDM '10.

[5]  Rajesh Parekh,et al.  Large-Scale Customized Models for Advertisers , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[6]  Jerry Nedelman,et al.  Book review: “Bayesian Data Analysis,” Second Edition by A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin Chapman & Hall/CRC, 2004 , 2005, Comput. Stat..

[7]  Léon Bottou,et al.  The Tradeoffs of Large Scale Learning , 2007, NIPS.

[8]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[9]  Yann LeCun,et al.  Large Scale Online Learning , 2003, NIPS.

[10]  Deepak Agarwal,et al.  Regression-based latent factor models , 2009, KDD.

[11]  Bin Cao,et al.  Multi-Domain Collaborative Filtering , 2010, UAI.

[12]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[13]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[14]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[15]  Eric P. Xing,et al.  Sparse Additive Generative Models of Text , 2011, ICML.

[16]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[17]  Lars Schmidt-Thieme,et al.  Factorizing personalized Markov chains for next-basket recommendation , 2010, WWW '10.

[18]  A. Gelman A Bayesian Formulation of Exploratory Data Analysis and Goodness‐of‐fit Testing * , 2003 .

[19]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[20]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[21]  Sandeep Koranne,et al.  Boost C++ Libraries , 2011 .

[22]  Yehuda Koren,et al.  Yahoo! music recommendations: modeling music ratings with temporal dynamics and item taxonomy , 2011, RecSys '11.

[23]  Jun Wang,et al.  Unifying user-based and item-based collaborative filtering approaches by similarity fusion , 2006, SIGIR.

[24]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[25]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[26]  Vanja Josifovski,et al.  Finding the right consumer: optimizing for conversion in display advertising campaigns , 2012, WSDM '12.

[27]  Léon Bottou,et al.  Stochastic Learning , 2003, Advanced Lectures on Machine Learning.