A Query by Humming System for Music Information Retrieval

In this paper we propose a music Query by Humming System made of two main functional blocks; the first implements a voice-to-midi transcription algorithm to process the query, the second implements a search engine based on a novel template matching technique for Dynamic Time Warping. The voice-to-midi algorithm transforms the sung or hummed query in a MIDI file by segmenting and identifying the notes' sequence. The search engine uses a Template Matching technique to produce a list of possible melodies that best match the searched one. In the test phase, first, we investigated performance of the search engine in retrieval using a synthetic test bench; a set of artificial queries is build placing and adjusting, in the queries, patterns of typical disturbance. Second, we use a genetic algorithm to automatically optimize the performance of the overall system using a real-life test bench. Results highlight that the proposed MIR system has good performances and is robust enough to be employed in real life applications.

[1]  J. Príncipe,et al.  A Novel Pitch Determination Algorithm based on Generalized Correlation Function , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[2]  Ning Hu,et al.  A comparison of melodic database retrieval techniques using sung queries , 2002, JCDL '02.

[3]  Avery Wang,et al.  The Shazam music recognition service , 2006, CACM.

[4]  S. Dixon ONSET DETECTION REVISITED , 2006 .

[5]  Anssi Klapuri,et al.  Query by humming of midi and audio using locality sensitive hashing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Jenq-Shiou Leu,et al.  Design and Implementation of Music Information Retrieval and Gathering Engine (MIRAGE) , 2009, 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks.

[7]  Seungmin Rho,et al.  An Efficient Voice Transcription Scheme for Music Retrieval , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[8]  Antonello Rizzi,et al.  A Correntropy-based voice to MIDI transcription algorithm , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[9]  José Manuel Iñesta Quereda,et al.  A Pattern Recognition Approach for Melody Track Selection in MIDI Files , 2006, ISMIR.

[10]  Ömer Egecioglu,et al.  An efficient uniform-cost normalized edit distance algorithm , 1999, 6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR00268).

[11]  Fernando Pereira,et al.  Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards , 2008, Proceedings of the IEEE.