Poisson point process modeling for polyphonic music transcription.

Peaks detected in the frequency domain spectrum of a musical chord are modeled as realizations of a nonhomogeneous Poisson point process. When several notes are superimposed to make a chord, the processes for individual notes combine to give another Poisson process, whose likelihood is easily computable. This avoids a data association step linking individual harmonics explicitly with detected peaks in the spectrum. The likelihood function is ideal for Bayesian inference about the unknown note frequencies in a chord. Here, maximum likelihood estimation of fundamental frequencies shows very promising performance on real polyphonic piano music recordings.

[1]  Simon J. Godsill,et al.  Bayesian harmonic models for musical signal analysis , 2003 .

[2]  Anssi Klapuri,et al.  Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[3]  Julius O. Smith,et al.  Bayesian identification of closely-spaced chords frim single-frame STFT peaks , 2004 .

[4]  D. Salmond,et al.  Spatial distribution model for tracking extended objects , 2005 .

[5]  Simon J. Godsill Computational Modeling of Musical Signals , 2004 .

[6]  Simon J. Godsill,et al.  Bayesian harmonic models for musical pitch estimation and analysis , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  David Barber,et al.  A generative model for music transcription , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  H. Akaike A new look at the statistical model identification , 1974 .

[9]  Mark B. Sandler,et al.  Automatic Piano Transcription Using Frequency and Time-Domain Information , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Simon J. Godsill,et al.  Poisson models for extended target and group tracking , 2005, SPIE Optics + Photonics.

[11]  Kunio Kashino,et al.  A fast search algorithm for background music signals based on the search for numerous small signal components , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[12]  J. Beauchamp,et al.  Fundamental frequency estimation of musical signals using a two‐way mismatch procedure , 1994 .