Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
暂无分享,去创建一个
The authors propose an approach for estimating pitch of speech in noisy environments based on instantaneous frequency (IF). First, they define the IF amplitude spectrum, which is obtained by projecting the STFT amplitude spectrum onto the IF axis. Based on the IF amplitude spectrum, one can perform harmonics enhancement by suppressing the aperiodic components. Next, they define an evaluation function to find pitch. This is done by expanding the IF amplitude spectrum to the time region. Then they propose a method for obtaining a continuous pitch contour using dynamic programming. Experiments show accuracy and robustness of the method especially when noise exists.
[1] Hermann Ney. A dynamic programming technique for nonlinear smoothing , 1981, ICASSP.
[2] Takao Kobayashi,et al. Harmonics Estimation Based on Instantaneous Frequency and Its Application to Pitch Determination of Speech , 1995, IEICE Trans. Inf. Syst..
[3] P. Vaidyanathan. Multirate Systems And Filter Banks , 1992 .
[4] Martin Cooke,et al. Tracking spectral dominances in an auditory model , 1993 .
[5] J. L. Flanagan,et al. PHASE VOCODER , 2008 .