论文信息 - The dangers of parsimony in query-by-humming applications

The dangers of parsimony in query-by-humming applications

Query-by-humming systems attempt to address the needs of the non-expert user, for whom the most natural query format -- for the purposes of finding a tune, hook or melody of unknown providence -- is to sing it. While human listeners are quite tolerant of error in these queries, a music retrieval mechanism must explicitly model such errors in order to perform its task. We will present a unifying view of existing models, illuminating the assumptions underlying their respective designs, and demonstrating where such assumptions succeed and fail, through analysis and real-world experiments.

William P. Birmingham | Colin Meek

[1] Ian H. Witten,et al. Towards the digital music library: tune retrieval from acoustic input , 1996, DL '96.

[2] David Sankoff,et al. Comparison of musical sequences , 1990, Comput. Humanit..

[3] L. Baum,et al. A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[4] Ian H. Witten,et al. The New Zealand Digital Library MELody inDEX , 1997, D Lib Mag..

[5] William P. Birmingham,et al. Encoding Timing Information for Musical Query Matching , 2002, ISMIR.

[6] Barry Vercoe,et al. Melody retrieval on the web , 2001, IS&T/SPIE Electronic Imaging.

[7] Emanuele Pollastri. An Audio Front End for Query-by-Humming Systems , 2001, ISMIR.

[8] Geraint A. Wiggins,et al. SIA(M)ESE: An Algorithm for Transposition Invariant, Polyphonic Content-Based Music Retrieval , 2002, ISMIR.

[9] Yuen-Hsien Tseng,et al. Content-based retrieval for music collections , 1999, SIGIR '99.

[10] William P. Birmingham,et al. Johnny Can't Sing: A Comprehensive Error Model for Sung Music Queries , 2002, ISMIR.

[11] J. Stephen Downie,et al. Evaluating a simple approach to music information retrieval : conceiving melodic n-grams as text , 1999 .

[12] William P. Birmingham,et al. HMM-based musical query retrieval , 2002, JCDL '02.

[13] W. D. Ward,et al. Recognition of musical key: Exploratory study , 1982 .

[14] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15] Donna K. Harman,et al. Overview of the Fifth Text REtrieval Conference (TREC-5) , 1996, TREC.

[16] Jyri Huopaniemi,et al. Melodic Resolution in Music Retrieval , 2001 .

[17] Steffen Pauws,et al. CubyHum: a fully operational "query by humming" system , 2002, ISMIR.

[18] Roger B. Dannenberg,et al. Melody Matching Directly From Audio , 2001 .