论文信息 - Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes

Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes

This paper proposes a conceptually simple and computationally efficient fundamental frequency (F0) estimator for polyphonic music signals. The studied class of estimators calculate the salience, or strength, of a F0 candidate as a weighted sum of the amplitudes of its harmonic partials. A mapping from the Fourier spectrum to a “F0 salience spectrum” is found by optimization using generated training material. Based on the resulting function, three different estimators are proposed: a “direct” method, an iterative estimation and cancellation method, and a method that estimates multiple F0s jointly. The latter two performed as well as a considerably more complex reference method. The number of concurrent sounds is estimated along with their F0s.

Anssi Klapuri | A. Klapuri | Anssi Klapuri

[1] M. Davy,et al. Bayesian analysis of polyphonic western tonal music. , 2006, The Journal of the Acoustical Society of America.

[2] Mark D. Plumbley,et al. Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[3] Masataka Goto,et al. A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals , 2004, Speech Commun..

[4] Wolfgang Hess,et al. Pitch Determination of Speech Signals , 1983 .

[5] Daniel P. W. Ellis,et al. A Classification Approach to Melody Transcription , 2005, ISMIR.

[6] Kunio Kashino,et al. A Sound Source Separation System with the Ability of Automatic Tone Modeling , 1993, International Conference on Mathematics and Computing.

[7] Paul J. Walmsley,et al. Signal separation of musical instruments: simulation-based methods for musical signal decomposition and transcription , 2001 .

[8] Alain de Cheveigné,et al. Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancell , 1993 .

[9] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[10] Anssi Klapuri,et al. Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[11] Matti Karjalainen,et al. A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[12] Mark D. Plumbley,et al. Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[13] A.P. Klapuri,et al. A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..