MusicGalaxy - An Adaptive User-Interface for Exploratory Music Retrieval

Sometimes users of a music retrieval system are not able to explicitly state what they are looking for. They rather want to browse a collection in order to get an overview and to discover interesting content. A common approach for browsing a collection relies on a similarity-preserving projection of objects (tracks, albums or artists) onto the (typically two-dimensional) display space. Inevitably, this implicates the use of dimension reduction techniques that cannot always preserve neighborhood and thus introduce distortions of the similarity space. MusicGalaxy is an interface for exploring large music collections (on the track level) using a galaxy metaphor that addresses the problem of distorted neighborhoods. Furthermore, the interface allows to adapt the underlying similarity measure to the user’s way of comparing tracks by weighting different facets of music similarity. Figure 1 shows a screenshot of the interface visualizing a music collection. Each track is displayed as a star (i.e. a point) with its brightness and (to some extend) its hue depending on a predefined importance measure (here a play count obtained from last.fm ‐ other measures such as a general popularity or ratings are possible). A spatially well distributed subset of the collection (specified by filters) is additionally displayed as a album cover for orientation. The arrangement of the stars is computed using multidimensional scaling (MDS) relying on a set of descriptive features to be extracted beforehand. (Alternatively, feature information may also be annotated manually or collected from external sources.) MDS is a popular neighborhoodpreserving projection technique that attempts to preserve the distances (dissimilarities) between the objects in the projection. The result of the MDS is optimal w.r.t. the minimization of the overall distance distortions. Thus, fixing one distorted neighborhood is not possible without damaging others. However, if the user shows interest in a specific neighborhood, this one can get a higher priority and be temporarily fixed (to some extend) at the cost of the other neighborhoods. To this end, an adaptive distortion technique called SpringLens [1] is applied that is guided by the user’s focus of interest. The SpringLens is a complex overlay of multiple fish-eye lenses divided into primary and secondary focus. The primary focus is a single large fish-eye lens used to zoom into regions of interest. At the same time, it compacts the surrounding space but does not hide it from the user to preserve overview. While the user can control the primary focus, the secondary focus is automatically adapted. It consists of a varying number of

[1]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[2]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[3]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[4]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[5]  Elias Pampalk,et al.  Content-based organization and visualization of music archives , 2002, MULTIMEDIA '02.

[6]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[7]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[8]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[9]  François Pachet,et al.  Improving Timbre Similarity : How high’s the sky ? , 2004 .

[10]  Ichiro Fujinaga,et al.  jAudio: An Feature Extraction Library , 2005, ISMIR.

[11]  Daniel P. W. Ellis,et al.  Song-Level Features and Support Vector Machines for Music Classification , 2005, ISMIR.

[12]  Timo Götzelmann,et al.  SpringLens Distributed Nonlinear Magnifications , 2006, Eurographics.

[13]  Peter Knees,et al.  The CoMIRVA Toolkit for Visualizing Music-Related Data , 2007, EuroVis.

[14]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[15]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16]  Andreas Nürnberger,et al.  MusicGalaxy: A Multi-focus Zoomable Interface for Multi-facet Exploration of Music Collections , 2010, CMMR.

[17]  Andreas Nürnberger,et al.  Evaluation of adaptive SpringLens: a multi-focus interface for exploring multimedia collections , 2010, NordiCHI.