Leveraging knowledge bases and parallel annotations for music genre translation

Prevalent efforts have been put in automatically inferring genres of musical items. Yet, the propose solutions often rely on simplifications and fail to address the diversity and subjectivity of music genres. Accounting for these has, though, many benefits for aligning knowledge sources, integrating data and enriching musical items with tags. Here, we choose a new angle for the genre study by seeking to predict what would be the genres of musical items in a target tag system, knowing the genres assigned to them within source tag systems. We call this a translation task and identify three cases: 1) no common annotated corpus between source and target tag systems exists, 2) such a large corpus exists, 3) only few common annotations exist. We propose the related solutions: a knowledge-based translation modeled as taxonomy mapping, a statistical translation modeled with maximum likelihood logistic regression; a hybrid translation modeled with maximum a posteriori logistic regression with priors given by the knowledge-based translation. During evaluation, the solutions fit well the identified cases and the hybrid translation is systematically the most effective w.r.t. multilabel classification metrics. This is a first attempt to unify genre tag systems by leveraging both representation and interpretation diversity.

[1]  M. G. Pittau,et al.  A weakly informative default prior distribution for logistic and other regression models , 2008, 0901.4011.

[2]  Gregory Gutin,et al.  Digraphs - theory, algorithms and applications , 2002 .

[3]  Lorena Otero-Cerdeira,et al.  Ontology matching: A literature review , 2015, Expert Syst. Appl..

[4]  J. Stephen Downie,et al.  K-Pop Genres: A Cross-Cultural Exploration , 2013, ISMIR.

[5]  Julián Urbano,et al.  The MediaEval 2018 AcousticBrainz Genre Task: Content-based Music Genre Recognition from Multiple Sources , 2017, MediaEval.

[6]  Rene De La Briandais File searching using variable length keys , 1959, IRE-AIEE-ACM Computer Conference.

[7]  Flavius Frasincar,et al.  Automated product taxonomy mapping in an e-commerce environment , 2015, Expert Syst. Appl..

[8]  Raphaël Troncy,et al.  DOREMUS: A Graph of Linked Musical Works , 2018, International Semantic Web Conference.

[9]  Geraint A. Wiggins,et al.  How Many Beans Make Five? The Consensus Problem in Music-Genre Classification and a New Evaluation Method for Single-Genre Categorisation Systems , 2007, ISMIR.

[10]  Riccardo Miotto,et al.  Combining Content-Based Auto-Taggers with Decision-Fusion , 2011, ISMIR.

[11]  Strother H. Walker,et al.  Estimation of the probability of an event as a function of several independent variables. , 1967, Biometrika.

[12]  Matthias Hemmje,et al.  Combining Taxonomies using Word2vec , 2016, DocEng.

[13]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[14]  Raphaël Troncy,et al.  Controlled Vocabularies for Music Metadata , 2018, ISMIR.

[15]  David Brackett,et al.  Categorizing Sound: Genre and Twentieth-Century Popular Music , 2016 .

[16]  Grigorios Tsoumakas,et al.  On the Stratification of Multi-label Data , 2011, ECML/PKDD.

[17]  Douglas Turnbull,et al.  Using Regression to Combine Data Sources for Semantic Music Discovery , 2009, ISMIR.

[18]  Lise Getoor,et al.  TACI: Taxonomy-Aware Catalog Integration , 2013, IEEE Transactions on Knowledge and Data Engineering.

[19]  Simone Paolo Ponzetto,et al.  Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia , 2009, IJCAI.

[20]  Romain Hennequin,et al.  Audio Based Disambiguation of Music Genre Tags , 2018, ISMIR.

[21]  Guillaume Lample,et al.  Word Translation Without Parallel Data , 2017, ICLR.

[22]  Douglas Eck,et al.  Learning Tags that Vary Within a Song , 2010, ISMIR.

[23]  Gerhard Weikum,et al.  Aligning Multi-Cultural Knowledge Taxonomies by Combinatorial Optimization , 2015, WWW.

[24]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[25]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[26]  Òscar Celma,et al.  The Quest for Musical Genres: Do the Experts and the Wisdom of Crowds Agree? , 2008, ISMIR.

[27]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[28]  Xavier Serra,et al.  Multi-Label Music Genre Classification from Audio, Text and Images Using Deep Features , 2017, ISMIR.

[29]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[30]  A. Swartz MusicBrainz: A Semantic Web Service , 2002, IEEE Intell. Syst..

[31]  Jun Wang,et al.  Predicting High-level Music Semantics Using Social Tags via Ontology-based Reasoning , 2010, ISMIR.

[32]  Guilin Qi,et al.  Cross-Lingual Taxonomy Alignment with Bilingual Biterm Topic Model , 2016, AAAI.

[33]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[34]  Hendrik Schreiber,et al.  Improving Genre Annotations for the Million Song Dataset , 2015, ISMIR.

[35]  Hendrik Schreiber,et al.  Genre Ontology Learning: Comparing Curated with Crowd-Sourced Ontologies , 2016, ISMIR.

[36]  Arthur Flexer,et al.  A Closer Look on Artist Filters for Musical Genre Classification , 2007, ISMIR.

[37]  Catherine Havasi,et al.  ConceptNet 5.5: An Open Multilingual Graph of General Knowledge , 2016, AAAI.

[38]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.