Review of the anti-noise method in the speech recognition technology

With the development of science and technology, the requirement in the aspect of dealing with the relationship between human and machine is higher and higher. Speech recognition is one subject based on the pattern recognition and is closely related to computer science, psychology, linguistics, and signal processing, etc. The key step of speech recognition is the pretreatment process of speech signal. This paper introduces three common anti-noise speech processing technology, i.e., speech enhancement algorithm, feature compensation technology, and model compensation technology. Finally, some existing problems are enumerated in the speech recognition technology.

[1]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[2]  T. Martin,et al.  On the effects of varying filter bank parameters on isolated word recognition , 1982 .

[3]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[4]  Phil D. Green,et al.  Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study , 1999, EUROSPEECH.

[5]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .

[6]  John S. D. Mason,et al.  Speaker recognition models , 1995, EUROSPEECH.

[7]  Seiichi Nakagawa,et al.  A Survey on Automatic Speech Recognition , 2002 .

[8]  Chin-Hui Lee,et al.  On Automatic Speech Recognition at the Dawn of the 21st Century , 2003 .

[9]  Zdravko Kacic,et al.  A noise robust feature extraction algorithm using joint wavelet packet subband decomposition and AR modeling of speech signals , 2007, Signal Process..

[10]  Naoto Iwahashi Training Method for Pattern Classifier Based on the Performance after Adaptation , 2000 .

[11]  C. D. Forgie,et al.  Automatic Recognition of Spoken Digits , 1958 .

[12]  Hsiao-Chuan Wang,et al.  Bandwidth-adjusted LPC analysis for robust speech recognition , 2003, Pattern Recognit. Lett..

[13]  Chung-Hsien Wu,et al.  Multi-keyword spotting of telephone speech using a fuzzy search algorithm and keyword-driven two-level CBSM , 2001, Speech Commun..

[14]  Harry F. Olson,et al.  Phonetic typewriter , 1957 .

[15]  Haizhou Li,et al.  A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition , 2010, IEEE Transactions on Audio, Speech, and Language Processing.