On preprocessing of speech signals

Preprocessing of speech signals is considered a crucial step in the development of a robust and efficient speech or speaker recognition system. In this paper, we present some popular statistical outlier-detection based strategies to segregate the silence/unvoiced part of the speech signal from the voiced portion. The proposed methods are based on the utilization of the 3 σ edit rule, and the Hampel Identifier which are compared with the conventional techniques: (i) short-time energy (STE) based methods, and (ii) distribution based methods. The results obtained after applying the proposed strategies on some test voice signals are encouraging.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Lawrence R. Rabiner,et al.  A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[3]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[4]  Chris Chatwin,et al.  On shadow elimination after moving region segmentation based on different threshold selection strategies , 2007 .

[5]  Donald G. Childers,et al.  Silent and voiced/unvoiced/mixed excitation (four-way) classification of speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[7]  N. Cox Statistical Models in Engineering , 1970 .

[8]  Abhijit Mitra,et al.  Recognition of Isolated Speech Signals using Simplified Statistical Parameters , 2007 .

[9]  Ronald K. Pearson,et al.  Mining imperfect data - dealing with contamination and incomplete records , 2005 .

[10]  Goutam Saha,et al.  A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications , 2006 .

[11]  A. Madansky Identification of Outliers , 1988 .

[12]  Ronald K. Pearson,et al.  Outliers in process modeling and identification , 2002, IEEE Trans. Control. Syst. Technol..

[13]  G. J. Hahn,et al.  Statistical models in engineering , 1967 .

[14]  Abhijit Mitra,et al.  Identification of Primitive Speech Signals using TMS320C54x DSP Processor , 2009 .

[15]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[16]  David G. Stork,et al.  Pattern Classification , 1973 .

[17]  V. Sarma,et al.  Studies on pattern recognition approach to voiced-unvoiced-silence classification , 1978, ICASSP.