论文信息 - A Graphical Model for Recognizing Sung Melodies

A Graphical Model for Recognizing Sung Melodies

A method is presented for automatic transcription of sung melodic fragments to score-like representation, including metric values and pitch. A joint model for pitch, rhythm, segmentation, and tempo is dened for a sung fragment. We then discuss the identication of the globally optimal musical transcription, given the observed audio data. A post process estimates the location of the tonic, so the transcription can be presented into they key of C. Experimental results are presented for a small test collection.

Christopher Raphael

[1] Masataka Goto,et al. An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds , 2001 .

[2] Christopher Raphael,et al. A hybrid graphical model for rhythmic parsing , 2002, Artif. Intell..

[3] Ali Taylan Cemgil,et al. Bayesian Music Transcription , 1997 .

[4] Steffen Pauws,et al. CubyHum: a fully operational "query by humming" system , 2002, ISMIR.

[5] William P. Birmingham,et al. Name that tune: A pilot study in finding a melody from a sung query , 2004, J. Assoc. Inf. Sci. Technol..

[6] Mark D. Plumbley,et al. Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[7] William P. Birmingham,et al. Johnny Can't Sing: A Comprehensive Error Model for Sung Music Queries , 2002, ISMIR.

[8] Christopher Raphael,et al. Harmonic analysis with probabilistic graphical models , 2003, ISMIR.

[9] Christopher Raphael,et al. A Hybrid Graphical Model for Aligning Polyphonic Audio with Musical Scores , 2004, ISMIR.

[10] Ali Taylan Cemgil,et al. Monte Carlo Methods for Tempo Tracking and Rhythm Quantization , 2011, J. Artif. Intell. Res..

[11] Eric D. Scheirer,et al. Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[12] Ian H. Witten,et al. Towards the digital music library: tune retrieval from acoustic input , 1996, DL '96.

[13] Marc Leman,et al. An Auditory Model Based Transcriber of Singing Sequences , 2002, ISMIR.

[14] Simon Dixon,et al. Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[15] Emanuele Pollastri. An Audio Front End for Query-by-Humming Systems , 2001, ISMIR.

[16] Kyoungro Yoon,et al. Mid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System , 2002, ISMIR.