The AcousticBrainz Genre Dataset : Music Genre Recognition with Annotations from Multiple Sources

This paper introduces the AcousticBrainz Genre Dataset, a large-scale collection of hierarchical multi-label genre annotations from different metadata sources. It allows researchers to explore how the same music pieces are annotated differently by different communities following their own genre taxonomies, and how this could be addressed by genre recognition systems. Genre labels for the dataset are sourced from both expert annotations and crowds, permitting comparisons between strict hierarchies and folksonomies. Music features are available via the Acoustic- Brainz database. To guide research, we suggest a concrete research task and provide a baseline as well as an evaluation method. This task may serve as an example of the development and validation of automatic annotation algorithms on complementary datasets with different taxonomies and coverage. With this dataset, we hope to contribute to developments in content-based music genre recognition as well as cross-disciplinary studies on genre metadata analysis.

[1]  Bob L. Sturm A Survey of Evaluation in Music Genre Recognition , 2012, Adaptive Multimedia Retrieval.

[2]  Xavier Serra,et al.  Quantifying Music Trends and Facts Using Editorial Metadata from the Discogs Database , 2017, ISMIR.

[3]  Daniel P. W. Ellis,et al.  A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures , 2004, Computer Music Journal.

[4]  Enric Guaus i Termens Audio content processing for automatic music genre classification: descriptors, databases, and classifiers , 2010 .

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Charles Elkan,et al.  Optimal Thresholding of Classifiers to Maximize F1 Measure , 2014, ECML/PKDD.

[7]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[8]  Masataka Goto,et al.  Development of the RWC Music Database , 2004 .

[9]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[10]  Xavier Serra,et al.  Multi-Label Music Genre Classification from Audio, Text and Images Using Deep Features , 2017, ISMIR.

[11]  Xavier Serra,et al.  Multimodal Deep Learning for Music Genre Classification , 2018, Trans. Int. Soc. Music. Inf. Retr..

[12]  Markus Schedl,et al.  MediaEval 2017 AcousticBrainz Genre Task: Multilayer Perceptron Approach , 2017, MediaEval.

[13]  Xavier Bresson,et al.  FMA: A Dataset for Music Analysis , 2016, ISMIR.

[14]  Andreas Rauber,et al.  Facilitating Comprehensive Benchmarking Experiments on the Million Song Dataset , 2012, ISMIR.

[15]  Xavier Serra,et al.  Essentia: An Audio Analysis Library for Music Information Retrieval , 2013, ISMIR.

[16]  J. Kepler,et al.  Album And Artist Effects For Audio Similarity At The Scale Of The Web , 2009 .

[17]  Hendrik Schreiber,et al.  Improving Genre Annotations for the Million Song Dataset , 2015, ISMIR.

[18]  J. Stephen Downie,et al.  K-Pop Genres: A Cross-Cultural Exploration , 2013, ISMIR.

[19]  George Tzanetakis,et al.  An experimental comparison of audio tempo induction algorithms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Hendrik Schreiber MediaEval 2018 AcousticBrainz Genre Task: A CNN Baseline Relying on Mel-Features , 2018, MediaEval.

[21]  Bob L. Sturm The State of the Art Ten Years After a State of the Art: Future Research in Music Information Retrieval , 2013, ArXiv.

[22]  Geraint A. Wiggins,et al.  How Many Beans Make Five? The Consensus Problem in Music-Genre Classification and a New Evaluation Method for Single-Genre Categorisation Systems , 2007, ISMIR.

[23]  Julián Urbano,et al.  The MediaEval 2018 AcousticBrainz Genre Task: Content-based Music Genre Recognition from Multiple Sources , 2017, MediaEval.

[24]  Benjamin Schrauwen,et al.  Audio-based Music Classification with a Pretrained Convolutional Network , 2011, ISMIR.

[25]  Xavier Serra,et al.  AcousticBrainz: A Community Platform for Gathering Music Information Obtained from Audio , 2015, ISMIR.

[26]  Xavier Serra,et al.  Mining metadata from the web for AcousticBrainz , 2016, DLfm.