Learning phoneme recognition using neural networks

The authors have applied two neural-network models (back-propagation network and radial-basis-functions network) to a static speech recognition problem. The radial-basis-functions network offers training times of over two orders of magnitude faster than back-propagation, when training networks to similar power and generality. The authors have computed recognition statistics of the two models with varying numbers of hidden units on this recognition problem. The back-propagation network may offer increased generalization and robustness. Both models compare favorably with a vector-quantized hidden Markov model on the same problem.<<ETX>>

[1]  M. J. D. Powell,et al.  Radial basis functions for multivariable interpolation: a review , 1987 .

[2]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[3]  Richard P. Lippmann,et al.  A neural net approach to speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[5]  A W Huggins,et al.  Speech quality evaluation using "phoneme-specific" sentences. , 1985, The Journal of the Acoustical Society of America.

[6]  Jonathan Harrington,et al.  A connectionist approach to speech recognition using peripheral auditory modelling , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[8]  Alex Waibel,et al.  Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[9]  Bernard Widrow,et al.  Adaptive Signal Processing , 1985 .

[10]  Richard W. Prager,et al.  The modified Kanerva model for automatic speech recognition , 1989 .

[11]  M. Jack,et al.  Globally optimising formant tracker using generalised centroids , 1987 .

[12]  Yasuo Ariki,et al.  Parameter re-estimation in semicontinuous hidden Markov modelling of speech with feedback to vector quantisation codebook , 1988 .

[13]  John Moody,et al.  Speedy alternatives to back propagation , 1988, Neural Networks.