Sequential Scenario-Specific Meta Learner for Online Recommendation

Cold-start problems are long-standing challenges for practical recommendations. Most existing recommendation algorithms rely on extensive observed data and are brittle to recommendation scenarios with few interactions. This paper addresses such problems usingfew-shot learning andmeta learning. Our approach is based on the insight that having a good generalization from a few examples relies on both a generic model initialization and an effective strategy for adapting this model to newly arising tasks. To accomplish this, we combine the scenario-specific learning with a model-agnostic sequential meta-learning and unify them into an integrated end-to-end framework, namely S cenario-specific S equential Meta learner (or s^2Meta). By doing so, ourmeta-learner produces a generic initial model through aggregating contextual information from a variety of prediction tasks while effectively adapting to specific tasks by leveraging learning-to-learn knowledge. Extensive experiments on various real-world datasets demonstrate that our proposed model can achieve significant gains over the state-of-the-arts for cold-start problems in online recommendation. Deployment is at the Guess You Like session, the front page of the Mobile Taobao; and the illustration video can also be watched from the link\footnote\urlhttps://youtu.be/TNHLZqWnQwc .

[1]  Sergio Ilarri,et al.  Datasets for Context-Aware Recommender Systems: Current Context and Possible Directions , 2018, 2018 IEEE 34th International Conference on Data Engineering Workshops (ICDEW).

[2]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[3]  Paul Covington,et al.  Deep Neural Networks for YouTube Recommendations , 2016, RecSys.

[4]  Ke Wang,et al.  Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding , 2018, WSDM.

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Peter Ingwersen,et al.  Information retrieval in context: IRiX , 2005, SIGF.

[7]  Gediminas Adomavicius,et al.  Incorporating contextual information in recommender systems using a multidimensional approach , 2005, TOIS.

[8]  Martha Larson,et al.  Cross-Domain Collaborative Filtering with Factorization Machines , 2014, ECIR.

[9]  H. Robbins A Stochastic Approximation Method , 1951 .

[10]  Marcin Andrychowicz,et al.  Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[11]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Gediminas Adomavicius,et al.  Context-aware recommender systems , 2008, ACM Conference on Recommender Systems.

[13]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Metalearning and Recommender Systems: A literature review and empirical study on the algorithm selection problem for Collaborative Filtering , 2018, Inf. Sci..

[14]  Ricardo Vilalta,et al.  Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.

[15]  John Riedl,et al.  When recommenders fail: predicting recommender failure for algorithm selection and combination , 2012, RecSys.

[16]  Paolo Cremonesi,et al.  Cross-Domain Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[17]  Hugo Larochelle,et al.  Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[18]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[19]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[20]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[21]  Jan N. van Rijn,et al.  Hyperparameter Importance Across Datasets , 2017, KDD.

[22]  Evaggelia Pitoura,et al.  A context-aware preference database system , 2008, Int. J. Pervasive Comput. Commun..

[23]  R. Olshavsky,et al.  Task Complexity and Contingent Processing in Brand Choice , 1979 .

[24]  Hong Yu,et al.  Meta Networks , 2017, ICML.

[25]  Neil Davey,et al.  The importance of hyperparameters selection within small datasets , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[26]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[27]  Jingrui He,et al.  A Graph-based Recommendation across Heterogeneous Domains , 2015, CIKM.

[28]  Yu Zhang,et al.  CoNet: Collaborative Cross Networks for Cross-Domain Recommendation , 2018, UMCit@KDD.

[29]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[30]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[31]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[32]  Mo Yu,et al.  One-Shot Relational Learning for Knowledge Graphs , 2018, EMNLP.

[33]  Xiaolong Jin,et al.  Cross-Domain Recommendation: An Embedding and Mapping Approach , 2017, IJCAI.

[34]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[35]  Heng-Tze Cheng,et al.  Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.

[36]  Ee-Peng Lim,et al.  A Survey of Recommender Systems in Twitter , 2012, SocInfo.

[37]  Joshua Achiam,et al.  On First-Order Meta-Learning Algorithms , 2018, ArXiv.

[38]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[39]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[40]  Oriol Vinyals,et al.  Matching Networks for One Shot Learning , 2016, NIPS.

[41]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Selecting Collaborative Filtering Algorithms Using Metalearning , 2016, ECML/PKDD.

[42]  Alexander Tuzhilin,et al.  Using Context to Improve Predictive Modeling of Customers in Personalization Applications , 2008, IEEE Transactions on Knowledge and Data Engineering.

[43]  Julian J. McAuley,et al.  Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering , 2016, WWW.

[44]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Bill N. Schilit,et al.  Disseminating active map information to mobile hosts , 1994, IEEE Network.

[46]  R. J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[47]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[48]  Gregory R. Koch,et al.  Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[49]  Hugo Larochelle,et al.  A Meta-Learning Perspective on Cold-Start Recommendations for Items , 2017, NIPS.

[50]  Dietmar Jannach,et al.  Sequence-Aware Recommender Systems , 2018, UMAP.

[51]  Alexandros Karatzoglou,et al.  Recurrent Neural Networks with Top-k Gains for Session-based Recommendations , 2017, CIKM.

[52]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[53]  Jude W. Shavlik,et al.  Transfer Learning via Advice Taking , 2010, Advances in Machine Learning I.

[54]  Maksims Volkovs,et al.  DropoutNet: Addressing Cold Start in Recommender Systems , 2017, NIPS.

[55]  John W. Payne,et al.  A Constructive Process View of Decision Making: Multiple Strategies in Judgment and Choice , 1992 .

[56]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.