The Quest for Ground Truth in Musical Artist Tagging in the Social Web Era

Research in Web music information retrieval traditionally focuses on the classication, clustering or categorizing of music into genres or other subdivisions. However, current community-based web sites provide richer descriptors (i.e. tags) for all kinds of products. Although tags have no well-dened semantics, they have proven to be an effective mechanism to label and retrieve items. Moreover, these tags are community-based and hence give a description of a product through the eyes of a community rather than an expert opinion. In this work we focus on Last.fm, which is currently the largest music community web service. We investigate whether the tagging of artists is consistent with the artist similarities found with collaborative ltering techniques. As the Last.fm data shows to be both consistent and descriptive, we propose a method to use this community-based data to create a ground truth for artist tagging and artist similarity.

[1]  Peter Knees,et al.  Building an Interactive Next-Generation Artist Recommender Based on Automatically Derived High-Level Concepts , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[2]  Jan Korst,et al.  Tagging Artists using Co-Occurrences on the Web , 2006 .

[3]  Peter Knees,et al.  Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis , 2006, ISMIR.

[4]  Tao Li,et al.  A comparative study on content-based music genre classification , 2003, SIGIR.

[5]  François Pachet,et al.  A taxonomy of musical genres , 2000, RIAO.

[6]  Peter Knees,et al.  Artist Classification with Web-Based Data , 2004, ISMIR.

[7]  Peter Knees,et al.  A WEB-BASED APPROACH TO ASSESSING ARTIST SIMILARITY USING CO-OCCURRENCES , 2005 .

[8]  Ichiro Fujinaga,et al.  Musical genre classification: Is it worth pursuing and how can it be improved? , 2006, ISMIR.

[9]  Frank van Harmelen,et al.  APPROXIMATE SEMANTIC MATCHING OF MUSIC CLASSES ON THE INTERNET , 2006 .

[10]  Jan H. M. Korst,et al.  Web-Based Artist Categorization , 2006, ISMIR.

[11]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..

[12]  Daniel P. W. Ellis,et al.  The Quest for Ground Truth in Musical Artist Similarity , 2002, ISMIR.

[13]  Roberto Basili,et al.  Classification of musical genre: a machine learning approach , 2004, ISMIR.

[14]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[15]  François Pachet,et al.  Representing Musical Genre: A State of the Art , 2003 .

[16]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[17]  Gerhard Widmer,et al.  Improvements of Audio-Based Music Similarity and Genre Classificaton , 2005, ISMIR.