FUSING BLOCK-LEVEL FEATURES FOR MUSIC SIMILARITY ESTIMATION

In this paper we present a novel approach to computing music similarity based on block-level features. We first introduce three novel block-level features — the Variance Delta Spectral Pattern (VDSP), the Correlation Pattern (CP) and the Spectral Contrast Pattern (SCP). Then we describe how to combine the extracted features into a single similarity function. A comprehensive evaluation based on genre classification experiments shows that the combined block-level similarity measure (BLS) is comparable, in terms of quality, to the best current method from the literature. But BLS has the important advantage of being based on a vector space representation, which directly facilitates a number of useful operations, such as PCA analysis, k-means clustering, visualization etc. We also show that there is still potential for further improve of music similarity measures by combining BLS with another stateof-the-art algorithm; the combined algorithm then outperforms all other algorithms in our evaluation. Additionally, we discuss the problem of album and artist effects in the context of similaritybased recommendation and show that one can detect the presence of such effects in a given dataset by analyzing the nearest neighbor classification results.

[1]  Elias Pampalk,et al.  Content-based organization and visualization of music archives , 2002, MULTIMEDIA '02.

[2]  Lie Lu,et al.  Music type classification by spectral contrast feature , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[3]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[4]  Masataka Goto,et al.  SmartMusicKIOSK: music listening station with chorus-search function , 2003, UIST '03.

[5]  Elias Pampalk,et al.  Please Scroll down for Article Journal of New Music Research the Som-enhanced Jukebox: Organization and Visualization of Music Collections Based on Perceptual Models , 2022 .

[6]  Andreas Rauber,et al.  Evaluation of Feature Extractors and Psycho-Acoustic Transformations for Music Genre Classification , 2005, ISMIR.

[7]  Katharina Morik,et al.  A Benchmark Dataset for Audio Classification and Clustering , 2005, ISMIR.

[8]  Mark Levy,et al.  Lightweight measures for timbral similarity of musical audio , 2006, AMCMM '06.

[9]  François Pachet,et al.  The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[10]  D. Schnitzer,et al.  STRIVING FOR AN IMPROVED AUDIO SIMILARITY MEASURE , 2007 .

[11]  Klaus Seyerlehner,et al.  FRAME LEVEL AUDIO SIMILARITY - A CODEBOOK APPROACH , 2008 .

[12]  Peter Knees,et al.  On Rhythm and General Music Similarity , 2009, ISMIR.

[13]  Yannis Stylianou,et al.  A scale transform based method for rhythmic similarity of music , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Søren Holdt Jensen,et al.  A tempo-insensitive representation of rhythmic patterns , 2009, 2009 17th European Signal Processing Conference.

[15]  J. Kepler,et al.  Album And Artist Effects For Audio Similarity At The Scale Of The Web , 2009 .

[16]  Markus Schedl,et al.  Block-Level Audio Features for Music Genre Classification , 2009 .

[17]  Klaus Seyerlehner,et al.  INFORMED SELECTION OF FRAMES FOR MUSIC SIMILARITY COMPUTATION , 2009 .