Phoneme classification experiments using radial basis functions

The application of a radial basis functions network to a static speech pattern classification problem is described. The radial basis functions network offers training times two to three orders of magnitude faster than backpropagation, when training networks of similar power and generality. Recognition results compare well with those obtained using backpropagation and a vector-quantized hidden Markov model on the same problem. A computationally efficient method of exactly solving linear networks in a noniterative fashion is also described. The method was applied to classification of vowels into 20 classes using three different types of input analysis and varying numbers of radial basis functions. The three types of input vectors consisted of linear-prediction-coding cepstral coefficient; formant tracks with frequency, amplitude, and bandwidth information; and bark-scaled formant tracks. All input analyses were supplemented with duration information. The best test results were obtained using the cepstral coefficients and 170 or more radial basis functions.<<ETX>>

[1]  R. L. Grimsdale Electronic Computers , 1957, Nature.

[2]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[3]  A W Huggins,et al.  Speech quality evaluation using "phoneme-specific" sentences. , 1985, The Journal of the Acoustical Society of America.

[4]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[5]  M. Jack,et al.  Globally optimising formant tracker using generalised centroids , 1987 .

[6]  M. J. D. Powell,et al.  Radial basis functions for multivariable interpolation: a review , 1987 .

[7]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[8]  John Moody,et al.  Speedy alternatives to back propagation , 1988, Neural Networks.

[9]  David Lowe,et al.  A Hybrid Optimisation Strategy for Adaptive Feed-Forward Layered Networks , 1988 .

[10]  David S. Broomhead,et al.  Multivariable Functional Interpolation and Adaptive Networks , 1988, Complex Syst..

[11]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[12]  Yasuo Ariki,et al.  Hierarchical phoneme discrimination by hidden Markov modelling using cepstrum and formant information , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  Richard W. Prager,et al.  The modified Kanerva model for automatic speech recognition , 1989 .

[14]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[15]  Steve Renals,et al.  Learning phoneme recognition using neural networks , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[16]  Mahesan Niranjan,et al.  Neural networks and radial basis functions in classifying static speech patterns , 1990 .