Bayesian harmonic models for musical pitch estimation and analysis

Estimating the pitch of musical signals is complicated by the presence of partials in addition to the fundamental frequency. In this paper, we propose developments to an earlier Bayesian model which describes each component signal in terms of fundamental frequency, partials (‘harmonics’), and amplitude. This basic model is modified for greater realism to include non-white residual spectrum, time-varying amplitudes and partials ‘detuned’ from the natural linear relationship. The unknown parameters of the new model are simulated using a reversible jump MCMC algorithm, leading to a highly accurate pitch estimator. The models and algorithms can be applied for feature extraction, polyphonic music transcription, source separation and restoration of musical sources.

[1]  Simon J. Godsill,et al.  Multidimensional optimisation of harmonic signals , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[2]  Christophe Andrieu,et al.  Joint Bayesian model selection and estimation of noisy sinusoids via reversible jump MCMC , 1999, IEEE Trans. Signal Process..

[3]  A. Doucet,et al.  Joint Bayesian detection and estimation of noisy sinusoids via reversible jump MCMC , 1998 .

[4]  Arye Nehorai,et al.  Adaptive comb filtering for harmonic signal enhancement , 1986, IEEE Trans. Acoust. Speech Signal Process..

[5]  Curtis Roads,et al.  Research in music and artificial intelligence , 1985, CSUR.

[6]  William A. Sethares,et al.  Periodicity transforms , 1999, IEEE Trans. Signal Process..

[7]  R. T. Schumacher,et al.  ON THE OSCILLATIONS OF MUSICAL-INSTRUMENTS , 1983 .

[8]  Simon J. Godsill,et al.  A Bayesian approach to the restoration of degraded audio signals , 1995, IEEE Trans. Speech Audio Process..

[9]  Simon J. Godsill,et al.  Digital audio restoration , 1998 .

[10]  Anssi Klapuri,et al.  Separation of harmonic sounds using multipitch analysis and iterative parameter estimation , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[11]  S. Hamid Nawab,et al.  A multiband exponential rate operator for musical transient analysis , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[12]  Judith C. Brown,et al.  A high resolution fundamental frequency determination based on phase changes of the Fourier transform , 1993 .

[13]  Kunio Kashino,et al.  A Sound Source Separation System with the Ability of Automatic Tone Modeling , 1993, International Conference on Mathematics and Computing.

[14]  Simon J. Godsill,et al.  Bayesian harmonic models for musical signal analysis , 2003 .

[15]  David A. Krubsack,et al.  A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech , 1987, IEEE Trans. Acoust. Speech Signal Process..

[16]  Patrick Flandrin,et al.  Time-Frequency/Time-Scale Analysis , 1998 .

[17]  Rémi Gribonval,et al.  Harmonic decomposition of audio signals with matching pursuit , 2003, IEEE Trans. Signal Process..

[18]  Guy J. Brown,et al.  Perceptual Grouping of Musical Sounds : A Computational Model , 1994 .

[19]  Simon J. Godsill,et al.  Robust noise reduction for speech and audio signals , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[20]  Xavier Serra,et al.  Musical Sound Modeling with Sinusoids plus Noise , 1997 .

[21]  Julius O. Smith,et al.  Techniques for Note Identification in Polyphonic Music , 1985, ICMC.

[22]  Peter J. W. Rayner,et al.  Digital Audio Restoration: A Statistical Model Based Approach , 1998 .

[23]  Anssi Klapuri,et al.  Multipitch estimation and sound separation by the spectral smoothness principle , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[24]  Kunio Kashino,et al.  Organization of Hierarchical Perceptual Sounds: Music Scene Analysis with Autonomous Processing Modules and a Quantitative Information Integration Mechanism , 1995, IJCAI.

[25]  Jean-Yves Tourneret,et al.  Classification of chirp signals using hierarchical Bayesian learning and MCMC methods , 2002, IEEE Trans. Signal Process..

[26]  Alain de Cheveigné,et al.  Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancell , 1993 .

[27]  Xavier Rodet,et al.  Analysis of sound signals with high resolution matching pursuit , 1996, Proceedings of Third International Symposium on Time-Frequency and Time-Scale Analysis (TFTS-96).

[28]  Hideki Kawahara,et al.  Multiple period estimation and pitch perception model , 1999, Speech Commun..

[29]  Stephen Travis Pope,et al.  Musical Signal Processing , 1997 .

[30]  Simon J. Godsill,et al.  Statistical reconstruction and analysis of autoregressive signals in impulsive noise using the Gibbs sampler , 1998, IEEE Trans. Speech Audio Process..

[31]  S. Schwerman,et al.  The Physics of Musical Instruments , 1991 .

[32]  Xavier Rodet,et al.  Sound Signals Decomposition Using a High Resolution Matching Pursuit , 1996, ICMC.

[33]  Simon J. Godsill,et al.  Detection of abrupt spectral changes using support vector machines an application to audio signal segmentation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[34]  Matti Karjalainen,et al.  Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[35]  J. Beauchamp,et al.  Fundamental frequency estimation of musical signals using a two‐way mismatch procedure , 1994 .

[36]  Kunio Kashino,et al.  A sound source identification system for ensemble music based on template adaptation and music stream extraction , 1999, Speech Commun..

[37]  Anssi Klapuri,et al.  Pitch estimation using multiple independent time-frequency windows , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[38]  C. Doncarli,et al.  Stationarity index for abrupt changes detection in the time-frequency plane , 1998, IEEE Signal Process. Lett..

[39]  Simon J. Godsill,et al.  MCMC methods for restoration of nonlinearly distorted autoregressive signals , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[40]  Simon J. Godsill,et al.  Polyphonic pitch tracking using joint Bayesian estimation of multiple frame parameters , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[41]  T. W. Parsons Separation of speech from interfering speech by means of harmonic selection , 1976 .