AnEffective Design for Fast Query-by-Humming System with Melody Segmentation and Feature Extraction

A melody segmentation scheme based on pivotal points combined Deep Auto-Encoder (DAE) for Query-by-Humming (QBH) systems is investigated. In order to deal with personalization of human voice, especially the instability in rhythm, we adopt a joint melody segmentation and encoded features extraction method. Pitch sequence between pivotal points shows characteristic differences largely due to monolithic deviation on tones, dynamic diversification in vocal range and mutative scale of tempo. The pivotal points in melody involve extreme and some extreme midpoints which depict the global trends, simultaneously exhibit robust to humming melody. Based on feature extraction, a novel melody segmentation scheme is derived to solve the problem. Through the trained DAE from the deep learning, the deep level coding features in melody are acquired. The experiment results are carried out to demonstrate the validity of the proposed scheme.

[1]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[2]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[3]  Christos Faloutsos,et al.  Efficiently supporting ad hoc queries in large datasets of time sequences , 1997, SIGMOD '97.

[4]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[5]  Anssi Klapuri,et al.  Query by humming of midi and audio using locality sensitive hashing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Craig G. Nevill-Manning,et al.  Distance metrics and indexing strategies for a digital library of popular music , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[7]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[8]  Mohan S. Kankanhalli,et al.  Similarity matching of continuous melody contours for humming querying of melody databases , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[9]  Heggere S. Ranganath,et al.  An effective subsequence-to-subsequence time series matching approach , 2014, 2014 Science and Information Conference.

[10]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[11]  Qiang Wang,et al.  A query by humming system based on locality sensitive hashing indexes , 2013, Signal Process..