A Game-Based Approach for Collecting Semantic Annotations of Music

Games based on human computation are a valuable tool for collecting semantic information about images. We show how to transfer this idea into the music domain in order to collect high-quality semantic information about songs. We present Listen Game, a online, multiplayer game that measures the semantic relationship between music and words. In the normal mode, a player sees a list of semantically related words (e.g., instruments, emotions, usages, genres) and is asked to pick the best and worst word to describe a song. In the freestyle mode, a user is asked to suggest a new word that describes the music. Each player receives realtime feedback about the agreement amongst all players. We show that we can use the data collected during a two-week pilot study of Listen Game to learn a supervised multiclass labeling (SML) model. We show that this SML model can annotate a novel song with meaningful words and retrieve relevant songs from a database of audio content.

[1]  Daniel P. W. Ellis,et al.  Automatic Record Reviews , 2004, ISMIR.

[2]  Gaël Richard,et al.  Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments , 2005, ISMIR.

[3]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[4]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[5]  Gert R. G. Lanckriet,et al.  Modeling music and words using a multi-class naïve Bayes approach , 2006, ISMIR.

[6]  Masataka Goto,et al.  AIST Annotation for the RWC Music Database , 2006, ISMIR.

[7]  Tao Li,et al.  Detecting emotion in music , 2003, ISMIR.

[8]  Manuel Blum,et al.  Improving accessibility of the web with a computer game , 2006, CHI.

[9]  Gert R. G. Lanckriet,et al.  Towards musical query-by-semantic-description using the CAL500 data set , 2007, SIGIR.

[10]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[11]  Nuno Vasconcelos,et al.  Image indexing with mixture hierarchies , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jeroen Breebaart,et al.  Features for audio and music classification , 2003, ISMIR.