A New Pitch Estimation Method Based on AMDF

In this paper, a new modified average magnitude difference function (MAMDF) is proposed which is robust for noise-corrupt speech pitch estimation. The traditional technology in pitch estimation can easily give rise to the problem of detecting error pitch period. And their estimation performance behaves badly with the occurrence of background noise. In the process of calculation on speech samples, MAMDF presented in this paper has the property of strengthening the characteristic of pitch period and reducing the influence of background noise. And therefore, MAMDF can not only decrease the disadvantage brought by the decreasing tendency of pitch period but also overcome the error caused by severe variation between neighboring samples. The experiment which is implemented in CSTR database shows that MAMDF is greatly superior to AMDF and CAMDF both in clean speech environment and noisy speech environment, representing prominent precision and robustness in pitch estimation.

[1]  B. Yegnanarayana,et al.  Epoch extraction of voiced speech , 1975 .

[2]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[3]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[4]  DeLiang Wang,et al.  A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Hui Li,et al.  A Pitch Detection Algorithm Based on AMDF and ACF , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[6]  Chong Kwan Un,et al.  A performance comparison of pitch extraction algorithms for noisy speech , 1984, ICASSP.

[7]  W. Bastiaan Kleijn,et al.  Estimation of the Instantaneous Pitch of Speech , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  David A. Krubsack,et al.  An autocorrelation pitch detector and voicing decision with confidence measures developed for noise-corrupted speech , 1991, IEEE Trans. Signal Process..

[9]  Wang Yu-guo Circular AMDF and Pitch Estimation Based on It , 2003 .

[10]  M. Schroeder Period histogram and product spectrum: new methods for fundamental-frequency measurement. , 1968, The Journal of the Acoustical Society of America.

[11]  Liang Gu,et al.  Perceptual harmonic cepstral coefficients for speech recognition in noisy environment , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  B. Atal Automatic Speaker Recognition Based on Pitch Contours , 1969 .

[13]  Shwu-Huey Yen,et al.  Solving the Antisymmetry Problem Caused by Pitch Interval and Duration Ratio in Geometric Matching of Music , 2010, J. Multim..

[14]  Wei-Ping Zhu,et al.  A Robust Pitch Estimation Algorithm in Noise , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Hyun-Woo Lee,et al.  Speech Quality Measurement Methods with Applying PLC Algorithms on Real-time Transmission Control Scheme for VoIP Service , 2006, J. Multim..

[16]  Chin-Teng Lin,et al.  Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure , 2001, IEEE Trans. Speech Audio Process..

[17]  Y. Kuroiwa,et al.  An improvement of LPC based on noise reduction using pitch synchronous addition , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[18]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[19]  Keum-Young Jang,et al.  Pitch alteration technique in a speech synthesis system , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[20]  Ariel Salomon,et al.  Use of temporal information: detection of periodicity, aperiodicity, and pitch in speech , 2005, IEEE Transactions on Speech and Audio Processing.

[21]  Rakesh Taori,et al.  Speech compression using pitch synchronous interpolation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[22]  Douglas D. O'Shaughnessy,et al.  Automatic and reliable estimation of glottal closure instant and period , 1989, IEEE Trans. Acoust. Speech Signal Process..

[23]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..

[24]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[25]  Myung-Jin Bae,et al.  Pitch alteration technique in speech synthesis system , 2001, IEEE Trans. Consumer Electron..

[26]  Alan B. Bradley,et al.  Speech compression by vector quantization of epochs , 1999, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359).