论文信息 - Learning phoneme recognition using neural networks

Learning phoneme recognition using neural networks

The authors have applied two neural-network models (back-propagation network and radial-basis-functions network) to a static speech recognition problem. The radial-basis-functions network offers training times of over two orders of magnitude faster than back-propagation, when training networks to similar power and generality. The authors have computed recognition statistics of the two models with varying numbers of hidden units on this recognition problem. The back-propagation network may offer increased generalization and robustness. Both models compare favorably with a vector-quantized hidden Markov model on the same problem.<<ETX>>

Steve Renals | Richard Rohwer

[1] M. J. D. Powell,et al. Radial basis functions for multivariable interpolation: a review , 1987 .

[2] Thomas M. Cover,et al. Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[3] Richard P. Lippmann,et al. A neural net approach to speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4] D. Broomhead,et al. Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[5] A W Huggins,et al. Speech quality evaluation using "phoneme-specific" sentences. , 1985, The Journal of the Acoustical Society of America.

[6] Jonathan Harrington,et al. A connectionist approach to speech recognition using peripheral auditory modelling , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[8] Alex Waibel,et al. Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[9] Bernard Widrow,et al. Adaptive Signal Processing , 1985 .

[10] Richard W. Prager,et al. The modified Kanerva model for automatic speech recognition , 1989 .

[11] M. Jack,et al. Globally optimising formant tracker using generalised centroids , 1987 .

[12] Yasuo Ariki,et al. Parameter re-estimation in semicontinuous hidden Markov modelling of speech with feedback to vector quantisation codebook , 1988 .

[13] John Moody,et al. Speedy alternatives to back propagation , 1988, Neural Networks.