Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features

This paper presents a principled method called MusicThumbnailer to transform musical pieces into visual thumbnail images based on acoustic features extracted from their audio signals. These thumbnails can help users immediately guess the musical contents of audio signals without trial listening. This method is consistent in ways that optimize thumbnails according to the characteristics of a target music collection. This means the appropriateness of transformation should be defined to eliminate ad hoc transformation rules. In this paper, we introduce three top-down criteria to improve memorability of thumbnails (generate gradations), deliver information more completely, and distinguish thumbnails more clearly. These criteria are mathematically implemented as minimization of brightness differences of adjacent pixels and maximization of brightness variances within and between thumbnails. The optimized parameters of a modified linear mapping model we assumed are obtained by minimizing a unified cost function based on the three criteria with a steepest descent method. Experimental results indicate that generated thumbnails can provide users with useful hints as to the musical contents of musical pieces.

[1]  Paul Lamere,et al.  Using 3D Visualizations to Explore and Discover Music , 2007, ISMIR.

[2]  Gerhard Widmer,et al.  Exploring Music Collections by Browsing Different Views , 2004, Computer Music Journal.

[3]  Stefan Leitich,et al.  Globe of Music - Music Library Visualization Using Geosom , 2007, ISMIR.

[4]  Mario Nöcker,et al.  Databionic Visualization of Music Collections According to Perceptual Distance , 2005, ISMIR.

[5]  Masataka Goto,et al.  An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Òscar Celma,et al.  Foafing the Music: A Music Recommendation System based on RSS Feeds and User Preferences , 2005, ISMIR.

[7]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[8]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[9]  Josep Lluís Arcos,et al.  Visualizing and Exploring Personal Music Libraries , 2004, ISMIR.

[10]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[11]  Keiichiro Hoashi,et al.  Personalization of user profiles for content-based music retrieval based on relevance feedback , 2003, ACM Multimedia.

[12]  Andreas Rauber,et al.  The Map of Mozart , 2006, ISMIR.

[13]  Beth Logan,et al.  Music Recommendation from Song Sets , 2004, ISMIR.

[14]  Justin Donaldson Music Recommendation Mapping and Interface Based on Structural Network Entropy , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[15]  William W. Cohen,et al.  Web-collaborative filtering: recommending music by crawling the Web , 2000, Comput. Networks.

[16]  Masataka Goto,et al.  Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces , 2005, ISMIR.

[17]  Peter Knees,et al.  An innovative three-dimensional user interface for exploring music collections enriched , 2006, MM '06.

[18]  Andreas Rauber,et al.  PlaySOM and PocketSOMPlayer, Alternative Interfaces to Large Music Collections , 2005, ISMIR.