Phoneme recognition using time-dependent versions of self-organizing maps

Two modifications of the self-organizing map (SOM) are proposed that, unlike the original algorithm, take into account time-dependent features of the input signal. In the first, a time average of a sequence of responses of one SOM is found, and this is recognized by another SOM. In the second, successive input patterns are concatenated together and recognized by the SOM. Comparing the results to those of a recognition system utilizing the original SOM, it was found that one could improve the recognition of isolated phonemes from 10.4% of errors to 7.0% and 5.0% of errors for the integration model and concatenation model, respectively. The improvement in a full-scale system where phoneme segments are also to be located is from 9.2% of errors to 8.2% and 7.6% of errors for the new methods, respectively.<<ETX>>

[1]  K.-F. Lee,et al.  Speaker-independent recognition of connected utterances using recurrent and non-recurrent neural networks , 1989, International 1989 Joint Conference on Neural Networks.

[2]  Alex Waibel,et al.  Consonant recognition by modular construction of large phonemic time-delay neural networks , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[5]  Jari Kangas,et al.  Time-delayed self-organizing maps , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[6]  Shigeru Katagiri,et al.  Shift-invariant, multi-category phoneme recognition using Kohonen's LVQ2 , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7]  Teuvo Kohonen,et al.  Transient map method in stop consonant discrimination , 1989, EUROSPEECH.

[8]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Olli Ventä,et al.  Phonetic typewriter for Finnish and Japanese , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .