论文信息 - A Discriminative Model for Polyphonic Piano Transcription

A Discriminative Model for Polyphonic Piano Transcription

We present a discriminative model for polyphonic piano transcription. Support vector machines trained on spectral features are used to classify frame-level note instances. The classifier outputs are temporally constrained via hidden Markov models, and the proposed system is used to transcribe both synthesized and real piano recordings. A frame-level transcription accuracy of 68% was achieved on a newly generated test set, and direct comparisons to previous approaches are provided.

Daniel P. W. Ellis | Graham E. Poliner | D. Ellis

[1] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[2] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[3] B. Schölkopf,et al. Advances in kernel methods: support vector learning , 1999 .

[4] Andrew D. Sterian,et al. Model-based segmentation of time-frequency images for musical transcription. , 1999 .

[5] Ian Witten,et al. Data Mining , 2000 .

[6] Simon Dixon,et al. On the Computer Recognition of Solo Piano Music , 2000 .

[7] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[8] Simon J. Godsill,et al. Bayesian harmonic models for musical pitch estimation and analysis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[10] Juan Pablo Bello,et al. Time-domain polyphonic transcription using self-generating databases , 2002 .

[11] Ben Taskar,et al. Max-Margin Markov Networks , 2003, NIPS.

[12] Kunio Kashino,et al. Bayesian estimation of simultaneous musical notes based on frequency domain modelling , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13] Matija Marolt,et al. A connectionist approach to automatic transcription of polyphonic piano music , 2004, IEEE Transactions on Multimedia.

[14] A.P. Klapuri,et al. A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[15] M.P. Ryynanen,et al. Polyphonic music transcription using note event modeling , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[16] David Barber,et al. A generative model for music transcription , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[17] Daniel P. W. Ellis,et al. Classification-based melody transcription , 2006, Machine Learning.