High-Level Audio Features: Distributed Extraction and Similarity Search

Today, automatic extraction of high-level audio features suffers from two main scalability issues. First, the extraction algorithms are very demanding in terms of memory and computation resources. Second, copyright laws prevent the audio files to be shared among computers, limiting the use of existing distributed computation frameworks and reducing the transparency of the methods evaluation process. The iSound Music Warehouse (iSoundMW), presented in this paper, is a framework to collect and query high-level audio features. It performs the feature extraction in a two-step process that allows distributed computations while respecting copyright laws. Using public computers, the extraction can be performed on large scale music collections. However, to be truly valuable, data management tools to search among the extracted features are needed. The iSoundMW enables similarity search among the collected high-level features and demonstrates its flexibility and efficiency by using a weighted combination of high-level features and constraints while showing good search performance results.

[1]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[2]  George Tzanetakis,et al.  Distributed Audio Feature Extraction for Music , 2005, ISMIR.

[3]  Torben Bach Pedersen,et al.  Using Fuzzy Song Sets in Music Warehouses , 2009, Scalable Fuzzy Algorithms for Data Management and Analysis.

[4]  J. Stephen Downie,et al.  Evaluation of a simple and effective music information retrieval method , 2000, SIGIR '00.

[5]  Edward Y. Chang,et al.  DynDex: a dynamic and non-metric space indexer , 2002, MULTIMEDIA '02.

[6]  Christian S. Jensen,et al.  Efficient Similarity Retrieval In Music Databases , 2006, COMAD.

[7]  Daniel P. W. Ellis,et al.  Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs , 2007, ISMIR.

[8]  David P. Anderson,et al.  SETI@home: an experiment in public-resource computing , 2002, CACM.

[9]  Beth Logan,et al.  A music similarity function based on signal analysis , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[10]  Ichiro Fujinaga,et al.  Overview of OMEN , 2006, ISMIR.

[11]  Tim Pohle,et al.  GENERATING SIMILARITY-BASED PLAYLISTS USING TRAVELING SALESMAN ALGORITHMS , 2005 .

[12]  J. Stephen Downie,et al.  The Music Information Retrieval Evaluation eXchange (MIREX) , 2006 .

[13]  Guojun Lu,et al.  Improved Perceptual Tempo Detection of Music , 2005, 11th International Multimedia Modelling Conference.

[14]  Andreas F. Ehmann,et al.  The Music Information Retrieval Evaluation Exchange "Do-It-Yourself" Web Service , 2007, ISMIR.

[15]  Guojun Lu,et al.  Perceptual rhythm determination of music signal for emotion-based classification , 2006, 2006 12th International Multi-Media Modelling Conference.