论文信息 - Multi-pitch estimation for polyphonic musical signals

Multi-pitch estimation for polyphonic musical signals

Automatic score transcription goal is to achieve an score-like (notes pitches through time) representation from musical signals. Reliable pitch extraction methods for monophonic signals exist, but polyphonic signals are much more difficult, often ambiguous, to analyze. We propose a computationally efficient technique for automatic recognition of notes from a polyphonic signal. It looks for correctly shaped (magnitude and phase wise) peaks in a, time and frequency oversampled, multiscale decomposition of the signal. Peaks (partial candidates) get accepted/discarded by their match to the window spectrum shape and continuity-across-scale constraints. The final partial list builds a resharpened and equalized spectrum. Note candidates are found by searching for harmonic patterns. Perceptual and source based rejection criteria help discard false notes, frame-by-frame. Slightly non-causal postprocessing uses continuity (across a <150 ms observation time) to kill too short notes, fill in the gaps, and correct (sub)octave jumps.

Francisco Javier Casajús-Quirós | Pablo Fernandez-Cid

[1] Guy J. Brown,et al. Modelling auditory scene analysis: a representational approach , 1992 .

[2] Guy J. Brown. Computational auditory scene analysis : a representational approach , 1993 .

[3] Andrew Choi. Real-time fundamental frequency estimation by least-square fitting , 1997, IEEE Trans. Speech Audio Process..

[4] F.J. Casajus Quiros,et al. Real-time, loose-harmonic matching fundamental frequency estimation for musical signals , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] Richard F. Lyon,et al. A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[6] Francisco Javier Casajús-Quirós,et al. Real-time, loose-harmonic matching fundamental frequency estimation for musical signals , 1994, ICASSP.

[7] Albert S. Bregman,et al. The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[8] Daniel Patrick Whittlesey Ellis,et al. Prediction-driven computational auditory scene analysis , 1996 .