Music Icons: Procedural Glyphs for Audio Files

Nowadays, a personal music collection may comprise thousands of MP3 files. Visualization can help the user to gain an overview and to find similar songs inside so large a set. We describe a method to create icons from audio files in such a way that songs which the user considers similar receive similar icons. This allows visual data mining in standard directory listings of window-based operating systems. The icons consist of bloom-like shapes, whose form and color depend on eight parameters. These parameters are controlled through a neural net, the input of which are audio features that are extracted algorithmically from the MP3 files. To adapt the system to the user's perception and interests, the neural net is initially trained with a small set of songs and icons. User studies done on the system demonstrate a strong perceptual relation between music and icons

[1]  Sougata Mukherjea,et al.  Glyphmaker: creating customized visualizations of complex data , 1994, Computer.

[2]  Fernando Pereira,et al.  MPEG-7 the generic multimedia content description standard, part 1 - Multimedia, IEEE , 2001 .

[3]  Mark Watson Neural Network Library , 1996 .

[4]  Stephen G. Eick,et al.  Glyphs for software visualization , 1997, Proceedings Fifth International Workshop on Program Comprehension. IWPC'97.

[5]  Daniel A. Keim,et al.  43 Visual Data-Mining Techniques* , 2004 .

[6]  Michael D. Byrne,et al.  Using icons to find documents: simplicity is critical , 1993, INTERCHI.

[7]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[8]  Ulrich Neumann,et al.  VisualIDs: automatic distinctive icons for desktop interfaces , 2004, SIGGRAPH 2004.

[9]  J. Hartigan,et al.  Representing Points in Many Dimensions by Trees and Castles , 1981 .

[10]  Mira Dontcheva,et al.  Metadata Visualization for Image Browsing , 2005 .

[11]  David S. Ebert,et al.  Procedural Shape Generation for Multi-dimensional Data Visualization , 1999, VisSym.

[12]  Vidya Setlur,et al.  Semanticons: Visual Metaphors as File Icons , 2005, Comput. Graph. Forum.

[13]  Jörn Loviscach,et al.  Evolutionary Design of BRDFs , 2003, Eurographics.

[14]  Xavier Serra,et al.  ISMIR 2004 Audio Description Contest , 2006 .

[15]  François Pachet,et al.  Improving Timbre Similarity : How high’s the sky ? , 2004 .

[16]  G. Widmer,et al.  ON THE EVALUATION OF PERCEPTUAL SIMILARITY MEASURES FOR MUSIC , 2003 .

[17]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[18]  Ichiro Fujinaga,et al.  jAudio: An Feature Extraction Library , 2005, ISMIR.

[19]  Jeroen Breebaart,et al.  Features for audio and music classification , 2003, ISMIR.

[20]  Remco C. Veltkamp,et al.  A Survey of Music Information Retrieval Systems , 2005, ISMIR.