An evaluation of methodologies for melodic similarity in audio recordings of Indian art music

We perform a comparative evaluation of methodologies for computing similarity between short-time melodic fragments of audio recordings of Indian art music. We experiment with 560 different combinations of procedures and parameter values. These include the choices made for the sampling rate of the melody representation, pitch quantization levels, normalization techniques and distance measures. The dataset used for evaluation consists of 157 and 340 annotated melodic fragments of Carnatic and Hindustani music recordings, respectively. Our results indicate that melodic fragment similarity is particularly sensitive to distance measures and normalization techniques. Sampling rates do not have a significant impact for Hindustani music, but can significantly degrade the performance for Carnatic music. Overall, the performed evaluation provides a better understanding of the processing steps and parameter settings for melodic similarity in Indian art music. Importantly, it paves the way for developing unsupervised melodic pattern discovery approaches, whose evaluation is a challenging and, many times, ill-defined task.

[1]  Eamonn J. Keogh,et al.  Everything you know about Dynamic Time Warping is Wrong , 2004 .

[2]  Alain Daniélou The Ragas of Northern Indian music , 1968 .

[3]  Xavier Serra A Multicultural Approach in Music Information Research , 2011, ISMIR.

[4]  Xavier Serra,et al.  Essentia: An Audio Analysis Library for Music Information Retrieval , 2013, ISMIR.

[5]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[6]  José Miguel Díaz-Báñez,et al.  Tracking Melodic Patterns in Flamenco Singing by Analyzing Polyphonic Music Recordings , 2012, ISMIR.

[7]  Joe Cheri Ross,et al.  Detecting Melodic Motifs from Audio for Hindustani Classical Music , 2012, ISMIR.

[8]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[9]  Roger B. Dannenberg,et al.  Melody Matching Directly From Audio , 2001 .

[10]  Dimitrios Gunopulos,et al.  A survey of query-by-humming similarity methods , 2012, PETRA '12.

[11]  Hema A. Murthy,et al.  Motif Spotting in an Alapana in Carnatic Music , 2013, ISMIR.

[12]  Xavier Serra,et al.  ESSENTIA: an open-source library for sound and music analysis , 2013, ACM Multimedia.

[13]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[14]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[15]  Hema A. Murthy,et al.  Classification of Melodic Motifs in Raga Music with Time-series Matching , 2014 .

[16]  Xavier Serra,et al.  Automatic Tonic Identification in Indian Art Music: Approaches and Evaluation , 2014 .

[17]  Emilia Gómez,et al.  Tonal representations for music retrieval: from version identification to query-by-humming , 2012, International Journal of Multimedia Information Retrieval.

[18]  Emilia Gómez,et al.  Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Ning Hu,et al.  A comparative evaluation of search techniques for query-by-humming using the MUSART testbed , 2007, J. Assoc. Inf. Sci. Technol..

[20]  Preeti Rao,et al.  Detection of Raga-characteristic phrases from Hindustani Classical Music Audio , 2012 .

[21]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[22]  Eleanor Selfridge-Field,et al.  Melodic Similarity : concepts, procedures, and applications , 1998 .

[23]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[24]  Anssi Klapuri,et al.  Query by humming of midi and audio using locality sensitive hashing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  Dennis Shasha,et al.  Warping indexes with envelope transforms for query by humming , 2003, SIGMOD '03.