Language Models for Next-Track Music Recommendation

Next-track music recommendation is the task of automatically determining the next song to play in a music listening session. Almost all music streaming platforms on the web provide their users with such a feature today. In this work, we propose the use of language modeling techniques for this task and investigate how well these techniques perform in the context of popular and also more diverse music. For this, we implement two basic language models, one based on n-grams and the other based on a recurrent neural network. We evaluate these models on two datasets, one limited to popular music and one consisting of more diverse tracks. Further, we also compare them with a nearest-neighbor model. Our results suggest that language models perform well in the context of popular music and can be used both as a basis for more sophisticated models and as a strong comparative baseline.

[1]  Markus Schedl,et al.  The LFM-1b Dataset for Music Retrieval and Recommendation , 2016, ICMR.

[2]  Liming Zhu,et al.  Hybrid Collaborative Recommendation via Semi-AutoEncoder , 2017, ICONIP.

[3]  Dietmar Jannach,et al.  Beyond "Hitting the Hits": Generating Coherent Music Playlist Continuations with the Right Tracks , 2015, RecSys.

[4]  Gert R. G. Lanckriet,et al.  The Natural Language of Playlists , 2011, ISMIR.

[5]  Dietmar Jannach,et al.  Automated Generation of Music Playlists: Survey and Experiments , 2014, ACM Comput. Surv..

[6]  Feng Yu,et al.  A Dynamic Recurrent Model for Next Basket Recommendation , 2016, SIGIR.

[7]  Dietmar Jannach,et al.  Biases in Automated Music Playlist Generation: A Comparison of Next-Track Recommending Techniques , 2016, UMAP.

[8]  R. Turrin,et al.  Large Scale Music Recommendation , 2015 .

[9]  Markus Schedl,et al.  Music Playlist Continuation by Learning from Hand-Curated Examples and Song Features: Alleviating the Cold-Start Problem for Rare and Out-of-Set Songs , 2017, DLRS@RecSys.

[10]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[11]  Zoubin Ghahramani,et al.  A Theoretically Grounded Application of Dropout in Recurrent Neural Networks , 2015, NIPS.

[12]  Dietmar Jannach,et al.  User Perception of Next-Track Music Recommendations , 2017, UMAP.

[13]  Scott Sanner,et al.  AutoRec: Autoencoders Meet Collaborative Filtering , 2015, WWW.

[14]  Dietmar Jannach,et al.  Sequence-Aware Recommender Systems , 2018, UMAP.

[15]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[16]  D. Jannach,et al.  EVALUATING THE QUALITY OF PLAYLISTS BASED ON HAND-CRAFTED SAMPLES , 2013 .

[17]  Gert R. G. Lanckriet,et al.  Hypergraph Models of Playlist Dialects , 2012, ISMIR.