Hit Song Prediction: Leveraging Low- and High-Level Audio Features

Assessing the potential success of a given song based on its acoustic characteristics is an important task in the music industry. This task has mostly been approached from an internal perspective, utilizing audio descriptors to predict the success of a given song, where either lowor high-level audio features have been utilized separately. In this work, we aim to jointly exploit lowand high-level audio features and model the prediction as a regression task. Particularly, we make use of a wide and deep neural network architecture that allows for jointly exploiting lowand high-level features. Furthermore, we enrich the set of features with information about the release year of tracks. We evaluate our approach based on the Million Song Dataset and characterize a song as a hit if it is contained in the Billboard Hot 100 at any point in time. Our findings suggest that the proposed approach is able to outperform baseline approaches as well as approaches utilizing lowor high-level features individually. Furthermore, we find that incorporating the release year as well as features describing the mood and vocals of a song improve prediction results.

[1]  Michael A. Casey,et al.  Study of Chinese and UK Hit Songs Prediction , 2013 .

[2]  Daniel Müllensiefen,et al.  Is it the song and not the singer? Hit song prediction using structural features of melodies. , 2015 .

[3]  Jong-Seok Lee,et al.  Music Popularity: Metrics, Characteristics, and Audio-Based Prediction , 2018, IEEE Transactions on Multimedia.

[4]  Yi-Hsuan Yang,et al.  Revisiting the problem of audio-based hit song prediction using convolutional neural networks , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Yi-Hsuan Yang,et al.  Hit Song Prediction for Pop Music by Siamese CNN with Ranking Loss , 2017, ArXiv.

[6]  T. D. Bie,et al.  Hit Song Science Once Again a Science? , 2011 .

[7]  Bongwon Suh,et al.  #nowplaying the future billboard: mining music listening behaviors of twitter users for hit song prediction , 2014, SoMeRA@SIGIR.

[8]  Beth Logan,et al.  Automatic Prediction of Hit Songs , 2005, ISMIR.

[9]  J. Stephen Downie,et al.  Exploring Mood Metadata: Relationships with Genre, Artist and Usage Metadata , 2007, ISMIR.

[10]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[11]  Zhaoxia Yu,et al.  Musical trends and predictability of success in contemporary songs in and out of the top charts , 2018, Royal Society Open Science.

[12]  Wyeth W. Wasserman,et al.  Deep Feature Selection: Theory and Application to Identify Enhancers and Promoters , 2015, RECOMB.

[13]  Eva Zangerle,et al.  Can Microblogs Predict Music Charts? An Analysis of the Relationship Between #Nowplaying Tweets and Music Charts , 2016, ISMIR.

[14]  F. Pachet Hit Song Science , 2011 .

[15]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[16]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[17]  Xavier Serra,et al.  Essentia: An Audio Analysis Library for Music Information Retrieval , 2013, ISMIR.

[18]  Xavier Serra,et al.  AcousticBrainz: A Community Platform for Gathering Music Information Obtained from Audio , 2015, ISMIR.

[19]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[20]  Kenneth Sörensen,et al.  Dance Hit Song Prediction , 2014, ArXiv.

[21]  Heng-Tze Cheng,et al.  Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.

[22]  Joan Serrà,et al.  Music Mood Annotator Design and Integration , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[23]  Rachel M. Bittner,et al.  Vocals in Music Matter: the Relevance of Vocals in the Minds of Listeners , 2018, ISMIR.

[24]  Dorien Herremans,et al.  Hit Song Prediction Based on Early Adopter Data and Audio Features , 2020, ArXiv.