Influence of music representation on compression-based clustering

Multimedia Information Retrieval is currently a hot research topic due the popularity of the World Wide Web and the huge amount of multimedia data available. There exists an increasing interest to design and develop new methods and techniques to represent and classify this kind of information. Among the different sources of multimedia information currently available, we have decided to work with music audio files. Three different music representations (binary code, wave information, and SAX) have been used to study how the selection of a particular representation could affect a clustering process based on a set of similarity clusters. Two different algorithms (a hierarchical clustering method based on the quartet tree method and a genetic algorithm) have been applied to automatically perform the clustering. A compression distance, the Normalized Compression Distance (NCD), has been used to generate the similarities among the music files. This distance is parameter-free and widely applicable so we can use it directly with different formats and representations. The paper shows some experimental results using these representations and compares the behavior of both clustering methods.

[1]  Manuel Cebrián,et al.  Reducing the Loss of Information through Annealing Text Distortion , 2011, IEEE Transactions on Knowledge and Data Engineering.

[2]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[3]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[4]  David Parry,et al.  Use of Kolmogorov distance identification of web page authorship , topic and domain , 2005 .

[5]  Wei Peng,et al.  Music Clustering with Constraints , 2007, ISMIR.

[6]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[7]  Barbara Hammer,et al.  Graph-Based Representation of Symbolic Musical Data , 2009, GbRPR.

[8]  Wei Peng,et al.  Music Clustering With Features From Different Information Sources , 2009, IEEE Transactions on Multimedia.

[9]  Xian Zhang,et al.  Information distance from a question to an answer , 2007, KDD '07.

[10]  Luis Filipe Coelho Antunes,et al.  Clustering Fetal Heart Rate Tracings by Compression , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[11]  Jörg Kindermann,et al.  Text Categorization with Support Vector Machines. How to Represent Texts in Input Space? , 2002, Machine Learning.

[12]  Manuel Cebrián,et al.  Evaluating the Impact of Information Distortion on Normalized Compression Distance , 2008, ICMCTA.

[13]  Manuel Cebrián,et al.  Contextual information retrieval based on algorithmic information theory and statistical outlier detection , 2007, 2008 IEEE Information Theory Workshop.

[14]  Vittorio Loreto,et al.  Language trees and zipping. , 2002, Physical review letters.

[15]  Ronald de Wolf,et al.  Algorithmic Clustering of Music Based on String Compression , 2004, Computer Music Journal.

[16]  Boris Ryabko,et al.  Compression-Based Methods for Nonparametric Prediction and Estimation of Some Characteristics of Time Series , 2009, IEEE Transactions on Information Theory.

[17]  Dennis K. Peters,et al.  Software Documents: Comparison and Measurement , 2007, SEKE.

[18]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[19]  Alexander Kraskov,et al.  Published under the scientific responsability of the EUROPEAN PHYSICAL SOCIETY Incorporating , 2002 .

[20]  李明,et al.  New Information Distance Measure and Its Application in Question Answering System , 2008 .

[21]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[22]  Bin Ma,et al.  The similarity metric , 2001, IEEE Transactions on Information Theory.

[23]  Ming Li,et al.  Clustering by compression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[24]  Hrishikesh Deshpande,et al.  CLASSIFICATION OF MUSIC SIGNALS IN THE VISUAL DOMAIN , 2001 .