A bifeature voiced/unvoiced discrimination algorithm for speech signals in the presense of noise

In this paper, a bifeature algorithm for the voiced/unvoiced (V/UV) discrimination of noise degraded speech signals is presented. A measure of merit for periodicity, and a normalized low frequency energy ratio are encompassed as two speech features which are proposed to derive from the linear prediction (LP) residual of the pre-processed noisy speech. Based on the statistically determined signal-dependent initial-thresholds of the features, a logical and multi-check V/UV classification criterion is developed which is capable of significantly removing the artifacts in the transitional frames between voiced and unvoiced regions. Through simulation results, a superior performance of the proposed V/UV discrimination scheme for noisy speech has been illustrated compared to some of the existing methods.

[1]  José A. R. Fonollosa,et al.  A comparison of several recent methods of fundamental frequency and voicing decision estimation , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  Takao Kobayashi,et al.  IFAS-based voiced/unvoiced classification of speech signal , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  Wei-Ping Zhu,et al.  A multifeature voiced/unvoiced decision algorithm for noisy speech , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[4]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[5]  Fabrice Plante,et al.  A pitch extraction reference database , 1995, EUROSPEECH.

[6]  Ariel Salomon,et al.  Use of temporal information: detection of periodicity, aperiodicity, and pitch in speech , 2005, IEEE Transactions on Speech and Audio Processing.

[7]  Wolfgang J. Hess,et al.  Pitch and voicing determination , 1992 .

[8]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.