Pitch extraction using dyadic wavelet transform and modified higher order moment

Pitch detection is the process of determining the period of the vocal cords closure or in another word the time duration of one glottal closed, open and returning phase. Dyadic wavelets transform (DyWT) and modified higher order moment, which is based on the autocorrelation function, are two pitch detection methods. DyWT is an accurate pitch detection method, however it has less accuracy compared to modified higher order moment. On the other hand modified higher order moment has high computational complexity and is time consuming. In this paper, we propose a pitch detection method based on DyWT which has use modified higher order moment. Modified higher order moment is applied only in some specific parts to improve the accuracy and keep the computational complexity low. Finally, our simulations and evaluations on some speech utterances indicate that the new pitch detection method improves previous DyWT method.

[1]  M. Puckette,et al.  Calculation of a "narrowed" autocorrelation function. , 1989, The Journal of the Acoustical Society of America.

[2]  Stéphane Mallat,et al.  Characterization of Signals from Multiscale Edges , 2011, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Gloria Faye Boudreaux-Bartels,et al.  A comparison of a wavelet functions for pitch detection of speech signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Juan Carlos,et al.  Review of "Discrete-Time Speech Signal Processing - Principles and Practice", by Thomas Quatieri, Prentice-Hall, 2001 , 2003 .

[5]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..

[6]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[7]  Saeed Vaseghi,et al.  Pitch extraction using modified higher order moments , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[9]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[10]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[11]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[12]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[13]  Thomas Quatieri,et al.  Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[14]  José A. R. Fonollosa,et al.  Pitch determination of noisy speech using higher order statistics , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[16]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[17]  LI Jing,et al.  A pitch detector based on the dyadic wavelet transform and the autocorrelation function , 2002, 6th International Conference on Signal Processing, 2002..

[18]  M. J. Cheng,et al.  Comparative performance study of several pitch detection algorithms , 1975 .

[19]  Bing Song,et al.  A new pitch detection algorithm based on wavelet transform , 2005 .