Audio Retrieval Based on Manifold Ranking

This paper proposes an audio information retrieval model based on Manifold Ranking (MR) and improving ranking results by relevance feedback algorithm. Timbre component has been employed as the main feature. To compute the timbre similarity, it is necessary to extract the spectrum features for each frame. The large set of frames is clustered by a Gaussian Mixture Model (GMM) and Expectation Maximization. The typical spectra frame from GMM is drawn as the data points, manifold ranking assigns each data point a relative ranking score, which is treated as a distance instead of traditional similarity metrics based on pair-wise distance. Furthermore, manifold ranking algorithm can be easily generalized by adding these positive examples by relevance feedback algorithm, and improves the final result. Experimental results show the proposed approach is effective to improve the ranking capability of the existing distance functions.

[1]  Jyh-Shing Roger Jang,et al.  Continuous HMM and Its Enhancement for Singing/Humming Query Retrieval , 2005, ISMIR.

[2]  Brian Christopher Smith,et al.  Query by humming: musical information retrieval in an audio database , 1995, MULTIMEDIA '95.

[3]  George Tzanetakis,et al.  MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[4]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[5]  Yong Zhu,et al.  Fast Manifold-Ranking for Content-Based Image Retrieval , 2009, 2009 ISECS International Colloquium on Computing, Communication, Control, and Management.

[6]  François Pachet,et al.  Music Similarity Measures: What's the use? , 2002, ISMIR.

[7]  Anssi Klapuri,et al.  Query by humming of midi and audio using locality sensitive hashing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Jyh-Shing Roger Jang,et al.  A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Masataka Goto,et al.  A Stochastic Representation of the Dynamics of Sung Melody , 2007, ISMIR.

[10]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[11]  Jingrui He,et al.  Manifold-ranking based image retrieval , 2004, MULTIMEDIA '04.

[12]  J.-S. Roger Jang MIREX SYMBOLIC MELODIC SIMILARITY AND QUERY BY SINGING / HUMMING , 2006 .

[13]  Jingrui He,et al.  Generalized Manifold-Ranking-Based Image Retrieval , 2006, IEEE Transactions on Image Processing.

[14]  Xiaojun Wan,et al.  Manifold-Ranking Based Topic-Focused Multi-Document Summarization , 2007, IJCAI.

[15]  Yuxin Peng,et al.  Audio retrieval by segment-based manifold-ranking , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[16]  Chun Chen,et al.  Efficient manifold ranking for image retrieval , 2011, SIGIR.

[17]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[18]  Elias Pampalk,et al.  Computational Models of Music Similarity and their Application in Music Information Retrieval , 2006 .

[19]  J. Stephen Downie,et al.  Ten Years of ISMIR: Reflections on Challenges and Opportunities , 2009, ISMIR.

[20]  Franz de Leon,et al.  USING TIMBRE MODELS FOR AUDIO MUSIC SIMILARITY ESTIMATION , 2013 .

[21]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.