Contextual and Sequential User Embeddings for Large-Scale Music Recommendation

Recommender systems play an important role in providing an engaging experience on online music streaming services. However, the musical domain presents distinctive challenges to recommender systems: tracks are short, listened to multiple times, typically consumed in sessions with other tracks, and relevance is highly context-dependent. In this paper, we argue that modeling users’ preferences at the beginning of a session is a practical and effective way to address these challenges. Using a dataset from Spotify, a popular music streaming service, we observe that a) consumption from the recent past and b) session-level contextual variables (such as the time of the day or the type of device used) are indeed predictive of the tracks a user will stream—much more so than static, average preferences. Driven by these findings, we propose CoSeRNN, a neural network architecture that models users’ preferences as a sequence of embeddings, one for each session. CoSeRNN predicts, at the beginning of a session, a preference vector, based on past consumption history and current context. This preference vector can then be used in downstream tasks to generate contextually relevant just-in-time recommendations efficiently, by using approximate nearest-neighbour search algorithms. We evaluate CoSeRNN on session and track ranking tasks, and find that it outperforms the current state of the art by upwards of 10% on different ranking metrics. Dissecting the performance of our approach, we find that sequential and contextual information are both crucial.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  David Maxwell Chickering,et al.  Using Temporal Data for Making Recommendations , 2001, UAI.

[3]  Guy Shani,et al.  An MDP-Based Recommender System , 2002, J. Mach. Learn. Res..

[4]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[5]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[6]  T. Pettijohn,et al.  Music for the Seasons: Seasonal Music Preferences in College Students , 2010 .

[7]  Paulo Villegas,et al.  Music recommendations with temporal context awareness , 2010, RecSys '10.

[8]  Robin Burke,et al.  Context-aware music recommendation based on latenttopic sequential patterns , 2012, RecSys.

[9]  Thorsten Joachims,et al.  Playlist prediction via metric embedding , 2012, KDD.

[10]  Francesco Ricci,et al.  Location-aware music recommendation using auto-tagging and hybrid matching , 2013, RecSys.

[11]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[12]  Tie-Yan Liu,et al.  Sequential Click Prediction for Sponsored Search with Recurrent Neural Networks , 2014, AAAI.

[13]  Sergei Vassilvitskii,et al.  The dynamics of repeat consumption , 2014, WWW.

[14]  Dietmar Jannach,et al.  Automated Generation of Music Playlists: Survey and Experiments , 2014, ACM Comput. Surv..

[15]  Ulrich Paquet,et al.  Speeding up the Xbox recommender system using a euclidean transformation for inner-product spaces , 2014, RecSys '14.

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Jianmin Wang,et al.  Will You "Reconsume" the Near Past? Fast Prediction on Short-Term Reconsumption Behaviors , 2015, AAAI.

[18]  Le Song,et al.  Deep Coevolutionary Network: Embedding User and Item Features for Recommendation , 2016, 1609.03675.

[19]  Alexandros Karatzoglou,et al.  Session-based Recommendations with Recurrent Neural Networks , 2015, ICLR.

[20]  Matthias Grossglauser,et al.  Collaborative Recurrent Neural Networks for Dynamic Recommender Systems , 2016, ACML.

[21]  Alex Beutel,et al.  Recurrent Recommender Networks , 2017, WSDM.

[22]  Ivan V. Oseledets,et al.  Tensor methods and recommender systems , 2016, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[23]  Markus Schedl,et al.  The Importance of Song Context in Music Playlists , 2017, RecSys Posters.

[24]  Zi Huang,et al.  Discrete Deep Learning for Fast Content-Aware Recommendation , 2018, WSDM.

[25]  Guoyin Wang,et al.  NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing , 2018, ACL.

[26]  Jia Li,et al.  Latent Cross: Making Use of Context in Recurrent Recommender Systems , 2018, WSDM.

[27]  Dietmar Jannach,et al.  Sequence-Aware Recommender Systems , 2018, UMAP.

[28]  Hamed Zamani,et al.  Current challenges and visions in music recommender systems research , 2017, International Journal of Multimedia Information Retrieval.

[29]  Lina Yao,et al.  Next Item Recommendation with Self-Attention , 2018, ArXiv.

[30]  Fernando Diaz,et al.  Towards a Fair Marketplace: Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems , 2018, CIKM.

[31]  Lina Yao,et al.  Deep Learning Based Recommender System , 2017, ACM Comput. Surv..

[32]  Henriette Cramer,et al.  Global music streaming data reveal diurnal and seasonal patterns of affective preference , 2019, Nature Human Behaviour.

[33]  Rishabh Mehrotra,et al.  The Music Streaming Sessions Dataset , 2018, WWW.

[34]  Christian Hansen,et al.  Unsupervised Neural Generative Semantic Hashing , 2019, SIGIR.

[35]  Jure Leskovec,et al.  Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks , 2019, KDD.

[36]  M. de Rijke,et al.  RepeatNet: A Repeat Aware Neural Recommendation Machine for Session-based Recommendation , 2018, AAAI.

[37]  Chen Ding,et al.  Progress in context-aware recommender systems - An overview , 2019, Comput. Sci. Rev..

[38]  Casper Hansen,et al.  Content-aware Neural Hashing for Cold-start Recommendation , 2020, SIGIR.

[39]  Casper Hansen,et al.  Unsupervised Semantic Hashing with Pairwise Reconstruction , 2020, SIGIR.

[40]  Quan Z. Sheng,et al.  A Survey on Session-based Recommender Systems , 2019, ArXiv.