The dangers of parsimony in query-by-humming applications

Query-by-humming systems attempt to address the needs of the non-expert user, for whom the most natural query format -- for the purposes of finding a tune, hook or melody of unknown providence -- is to sing it. While human listeners are quite tolerant of error in these queries, a music retrieval mechanism must explicitly model such errors in order to perform its task. We will present a unifying view of existing models, illuminating the assumptions underlying their respective designs, and demonstrating where such assumptions succeed and fail, through analysis and real-world experiments.

[1]  Ian H. Witten,et al.  Towards the digital music library: tune retrieval from acoustic input , 1996, DL '96.

[2]  David Sankoff,et al.  Comparison of musical sequences , 1990, Comput. Humanit..

[3]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[4]  Ian H. Witten,et al.  The New Zealand Digital Library MELody inDEX , 1997, D Lib Mag..

[5]  William P. Birmingham,et al.  Encoding Timing Information for Musical Query Matching , 2002, ISMIR.

[6]  Barry Vercoe,et al.  Melody retrieval on the web , 2001, IS&T/SPIE Electronic Imaging.

[7]  Emanuele Pollastri An Audio Front End for Query-by-Humming Systems , 2001, ISMIR.

[8]  Geraint A. Wiggins,et al.  SIA(M)ESE: An Algorithm for Transposition Invariant, Polyphonic Content-Based Music Retrieval , 2002, ISMIR.

[9]  Yuen-Hsien Tseng,et al.  Content-based retrieval for music collections , 1999, SIGIR '99.

[10]  William P. Birmingham,et al.  Johnny Can't Sing: A Comprehensive Error Model for Sung Music Queries , 2002, ISMIR.

[11]  J. Stephen Downie,et al.  Evaluating a simple approach to music information retrieval : conceiving melodic n-grams as text , 1999 .

[12]  William P. Birmingham,et al.  HMM-based musical query retrieval , 2002, JCDL '02.

[13]  W. D. Ward,et al.  Recognition of musical key: Exploratory study , 1982 .

[14]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15]  Donna K. Harman,et al.  Overview of the Fifth Text REtrieval Conference (TREC-5) , 1996, TREC.

[16]  Jyri Huopaniemi,et al.  Melodic Resolution in Music Retrieval , 2001 .

[17]  Steffen Pauws,et al.  CubyHum: a fully operational "query by humming" system , 2002, ISMIR.

[18]  Roger B. Dannenberg,et al.  Melody Matching Directly From Audio , 2001 .