Adaptive content-based music retrieval system

This paper presents a tunable content-based music retrieval (CBMR) system suitable the for retrieval of music audio clips. The audio clips are represented as extracted feature vectors. The CBMR system is expert-tunable by altering the feature space. The feature space is tuned according to the expert-specified similarity criteria expressed in terms of clusters of similar audio clips. The main goal of tuning the feature space is to improve retrieval performance, since some features may have more impact on perceived similarity than others. The tuning process utilizes our genetic algorithm. The R-tree index for efficient retrieval of audio clips is based on the clustering of feature vectors. For each cluster a minimal bounding rectangle (MBR) is formed, thus providing objects for indexing. Inserting new nodes into the R-tree is efficiently performed because of the chosen Quadratic Split algorithm. Our CBMR system implements the point query and the n-nearest neighbors query with the O(logn) time complexity. Different objective functions based on cluster similarity and dissimilarity measures are used for the genetic algorithm. We have found that all of them have similar impact on the retrieval performance in terms of precision and recall. The paper includes experimental results in measuring retrieval performance, reporting significant improvement over the untuned feature space.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Ada Wai-Chee Fu,et al.  Enhanced nearest neighbour search on the R-tree , 1998, SGMD.

[3]  George A. Tsihrintzis,et al.  A middleware system for Web-based digital music libraries , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[4]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[5]  Zora Konjovic,et al.  Tuning the Feature Space for Content-Based Music Retrieval , 2006, STAIRS.

[6]  Branko Milosavljevic,et al.  Models for extensible multimedia document retrieval , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[7]  Wolfgang Lehner,et al.  Eyes4Ears - More than a Classical Music Retrieval System , 2005 .

[8]  Remco C. Veltkamp,et al.  Using transportation distances for measuring melodic similarity , 2003, ISMIR.

[9]  Tim Crawford,et al.  Harmonic models for polyphonic music retrieval , 2002, CIKM '02.

[10]  Nao and Iba Hitoshi Tokui,et al.  Music Composition with Interactive Evolutionary Computation , 2000 .

[11]  Jeremy Pickens,et al.  A Survey of Feature Selection Techniques for Music Information Retrieval , 2001 .

[12]  Nando de Freitas,et al.  "Name That Song!" A Probabilistic Approach to Querying on Music and Text , 2002, NIPS.

[13]  Guodong Guo,et al.  Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[14]  Ning Hu,et al.  A comparison of melodic database retrieval techniques using sung queries , 2002, JCDL '02.

[15]  Remco C. Veltkamp,et al.  A Survey of Music Information Retrieval Systems , 2005, ISMIR.

[16]  Tsuhan Chen,et al.  Audio feature extraction and analysis for scene classification , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[17]  Zora Konjovic,et al.  Design of an XML-based extensible multimedia information retrieval system , 2002, Fourth International Symposium on Multimedia Software Engineering, 2002. Proceedings..

[18]  Yu-lung Lo,et al.  The numeric indexing for music data , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[19]  Frank Kurth,et al.  Identification of Highly Distorted Audio Material for Querying Large Scale Data Bases , 2002 .

[20]  Yannis Manolopoulos,et al.  Audio Indexing for Efficient Music Information Retrieval , 2005, 11th International Multimedia Modelling Conference.

[21]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[22]  Thomas S. Huang,et al.  Supporting Ranked Boolean Similarity Queries in MARS , 1998, IEEE Trans. Knowl. Data Eng..

[23]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[24]  Holger H. Hoos,et al.  GUIDO/MIR - an Experimental Musical Information Retrieval System based on GUIDO Music Notation , 2001, ISMIR.

[25]  Seungmin Rho,et al.  Music Information Retrieval Using a GA-based Relevance Feedback , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[26]  Thomas S. Huang,et al.  A novel relevance feedback technique in image retrieval , 1999, MULTIMEDIA '99.

[27]  Man-Kwan Shan,et al.  Looking for new, not known music only: music retrieval by melody style , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[28]  Michael A. Casey,et al.  Song Intersection by Approximate Nearest Neighbor Search , 2006, ISMIR.

[29]  Jonathan Foote,et al.  Content-based retrieval of music and audio , 1997, Other Conferences.

[30]  Shlomo Dubnov,et al.  Robust temporal and spectral modeling for query By melody , 2002, SIGIR '02.

[31]  Jyh-Shing Roger Query by Tapping: A New Paradigm for Content-Based Music Retrieval from Acoustic Input , 2001 .

[32]  Shenghuo Zhu,et al.  Integrating Features from Different Sources for Music Information Retrieval , 2006, Sixth International Conference on Data Mining (ICDM'06).

[33]  J. T. Foote,et al.  "Content-Based Retrieval of Music and Audio," Multimedia Storage and Archiving System II , 1997 .