A Multiple Functions Multiplication Approach for Pitch Extraction of Noisy Speech

This paper addresses a new concept to produce a noise robust pitch extraction function, which is called multiple functions multiplication. A modified version of the autocorrelation function (ACF) (weighted ACF) is recognized as a multiplication of two functions; ACF and inverse of average magnitude difference function (AMDF). Extending the weighted ACF, a three functions multiplication version is derived where the cumulant based ACF (Cum-ACF) is utilized. A cepstrum (CEP) version of the Cum-ACF (Cum-CEP) is also considered instead of the Cum-ACF. The resulting function consists of a multiplication of three functions; ACF, inverse of AMDF, and Cum-ACF (or Cum-CEP). Through experiments, the performance of the proposed method is investigated. It is shown that the proposed method provides an excellent pitch extraction in several noise environments.

[1]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[2]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[3]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[4]  Wendi B. Heinzelman,et al.  BaNa: A Noise Resilient Fundamental Frequency Detection Algorithm for Speech and Music , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[5]  Tetsuya Shimamura,et al.  A modified cepstrum method for pitch extraction , 1998, IEEE. APCCAS 1998. 1998 IEEE Asia-Pacific Conference on Circuits and Systems. Microelectronics and Integrating Systems. Proceedings (Cat. No.98EX242).

[6]  José Carlos Príncipe,et al.  A Pitch Detector Based on a Generalized Correlation Function , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Tetsuya Shimamura,et al.  Windowless-Autocorrelation-Based Cepstrum Method for Pitch Extraction of Noisy Speech , 2012 .

[8]  PITCH DETERMINATION OF NOISY SPEECH USING HIGHER ORDER STATISTICS , 2004 .

[9]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[10]  Tetsuya Shimamura,et al.  Pitch extraction by using autocorrelation function on the log spectrum , 2000 .

[11]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[12]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[13]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[14]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..