Robust speech parameters extraction for word recognition in noise using neural networks
暂无分享,去创建一个
An attempt was made to enhance the performance of a DTW (dynamic time warping) speech recognizer by preprocessing speech parameters using a neural network transformation. A multilayer perceptron trained with speech utterances of a single speaker has been used in front of a DTW recognizer. Results show an improvement of about 15% in the recognition rate in all cases, even with a speaker that was not used for training. If the network is not completely speaker independent, a dynamic adaptation to the speaker could be performed.<<ETX>>
[1] Frank Fallside,et al. An adaptive training algorithm for back propagation networks , 1987 .
[2] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[3] G. Chollet,et al. Evaluating speech recognizers and data bases , 1988 .
[4] Alex Waibel,et al. Noise reduction using connectionist models , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.