Use of Multi-Layered Networks for Coding Speech with Phonetic Features

Preliminary results on speaker-independant speech recognition are reported. A method that combines expertise on neural networks with expertise on speech recognition is used to build the recognition systems. For transient sounds, event-driven property extractors with variable resolution in the time and frequency domains are used. For sonorant speech, a model of the human auditory system is preferred to FFT as a front-end module.

[1]  Jean Rouat,et al.  Use of Procedural Knowledge for Automatic Speech Recognition , 1987, IJCAI.

[2]  Renato De Mori,et al.  Learning and Plan Refinement in a Knowledge-Based System for Automatic Speech Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  M. Sachs,et al.  Effects of nonlinearities on speech encoding in the auditory nerve. , 1979, The Journal of the Acoustical Society of America.

[4]  Pietro Laface,et al.  Parallel Algorithms for Syllable Recognition in Continuous Speech , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[6]  Stephanie Seneff,et al.  Pitch and spectral analysis of speech based on an auditory synchrony model , 1985 .

[7]  S. Seneff A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[8]  Stephanie Seneff,et al.  A computational model for the peripheral auditory system: Application of speech recognition research , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Stephanie Seneff Pitch and spectral estimation of speech based on auditory synchrony model , 1984, ICASSP.

[10]  B. Delgutte,et al.  Speech coding in the auditory nerve: I. Vowel-like sounds. , 1984, The Journal of the Acoustical Society of America.

[11]  M. Sachs,et al.  Representation of stop consonants in the discharge patterns of auditory-nerve fibers. , 1983, The Journal of the Acoustical Society of America.

[12]  C D Geisler,et al.  Responses of auditory-nerve fibers to consonant-vowel syllables. , 1981, The Journal of the Acoustical Society of America.

[13]  Yoshua Bengio,et al.  Data-Driven Execution of Multi-Layered Networks for Automatic Speech Recognition , 1988, AAAI.

[14]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[15]  B. Delgutte Representation of speech-like sounds in the discharge patterns of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.