论文信息 - On preprocessing of speech signals

On preprocessing of speech signals

Preprocessing of speech signals is considered a crucial step in the development of a robust and efficient speech or speaker recognition system. In this paper, we present some popular statistical outlier-detection based strategies to segregate the silence/unvoiced part of the speech signal from the voiced portion. The proposed methods are based on the utilization of the 3 σ edit rule, and the Hampel Identifier which are compared with the conventional techniques: (i) short-time energy (STE) based methods, and (ii) distribution based methods. The results obtained after applying the proposed strategies on some test voice signals are encouraging.

[1] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2] Lawrence R. Rabiner,et al. A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[3] Douglas M. Hawkins. Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[4] Chris Chatwin,et al. On shadow elimination after moving region segmentation based on different threshold selection strategies , 2007 .

[5] Donald G. Childers,et al. Silent and voiced/unvoiced/mixed excitation (four-way) classification of speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6] Ronald W. Schafer,et al. Digital Processing of Speech Signals , 1978 .

[7] N. Cox. Statistical Models in Engineering , 1970 .

[8] Abhijit Mitra,et al. Recognition of Isolated Speech Signals using Simplified Statistical Parameters , 2007 .

[9] Ronald K. Pearson,et al. Mining imperfect data - dealing with contamination and incomplete records , 2005 .

[10] Goutam Saha,et al. A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications , 2006 .

[11] A. Madansky. Identification of Outliers , 1988 .

[12] Ronald K. Pearson,et al. Outliers in process modeling and identification , 2002, IEEE Trans. Control. Syst. Technol..

[13] G. J. Hahn,et al. Statistical models in engineering , 1967 .

[14] Abhijit Mitra,et al. Identification of Primitive Speech Signals using TMS320C54x DSP Processor , 2009 .

[15] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[16] David G. Stork,et al. Pattern Classification , 1973 .

[17] V. Sarma,et al. Studies on pattern recognition approach to voiced-unvoiced-silence classification , 1978, ICASSP.