论文信息 - Efficient System for Speech Recognition using General Regression Neural Network

Efficient System for Speech Recognition using General Regression Neural Network

In this paper we present an efficient system for independent speaker speech recognition based on neural network approach. The proposed architecture comprises two phases: a preprocessing phase which consists in segmental normalization and features extraction and a classification phase which uses neural networks based on nonparametric density estimation namely the general regression neural network (GRNN). The relative performances of the proposed model are compared to the similar recognition systems based on the Multilayer Perceptron (MLP), the Recurrent Neural Network (RNN) and the well known Discrete Hidden Markov Model (HMM-VQ) that we have achieved also. Experimental results obtained with Arabic digits have shown that the use of nonparametric density estimation with an appropriate smoothing factor (spread) improves the generalization power of the neural network. The word error rate (WER) is reduced significantly over the baseline HMM method. GRNN computation is a successful alternative to the other neural network and DHMM. Keywords—Speech Recognition, General Regression Neural Network, Hidden Markov Model, Recurrent Neural Network, Arabic Digits.

Abderrahmane Amrouche | Jean Michel Rouvaen | J. Rouvaen | A. Amrouche

[1] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2] Donald F. Specht,et al. A general regression neural network , 1991, IEEE Trans. Neural Networks.

[3] T. Cacoullos. Estimation of a multivariate density , 1966 .

[4] Kevin J. Lang. A time delay neural network architecture for speech recognition , 1989 .

[5] Mokhtar Sellami,et al. Arabic Word Recognition by Classifiers and Context , 2005, Journal of Computer Science and Technology.

[6] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7] Richard P. Lippmann,et al. Review of Neural Networks for Speech Recognition , 1989, Neural Computation.

[8] Donald F. Specht,et al. Probabilistic neural networks and general regression neural networks , 1996 .

[9] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .

[10] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[11] H. Bourlard,et al. Link between Markov Models and Multi-layer Perceptoron , 1990 .