论文信息 - Use of Multi-Layered Networks for Coding Speech with Phonetic Features

Use of Multi-Layered Networks for Coding Speech with Phonetic Features

Preliminary results on speaker-independant speech recognition are reported. A method that combines expertise on neural networks with expertise on speech recognition is used to build the recognition systems. For transient sounds, event-driven property extractors with variable resolution in the time and frequency domains are used. For sonorant speech, a model of the human auditory system is preferred to FFT as a front-end module.

[1] Jean Rouat,et al. Use of Procedural Knowledge for Automatic Speech Recognition , 1987, IJCAI.

[2] Renato De Mori,et al. Learning and Plan Refinement in a Knowledge-Based System for Automatic Speech Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] M. Sachs,et al. Effects of nonlinearities on speech encoding in the auditory nerve. , 1979, The Journal of the Acoustical Society of America.

[4] Pietro Laface,et al. Parallel Algorithms for Syllable Recognition in Continuous Speech , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] M. Sachs,et al. Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[6] Stephanie Seneff,et al. Pitch and spectral analysis of speech based on an auditory synchrony model , 1985 .

[7] S. Seneff. A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[8] Stephanie Seneff,et al. A computational model for the peripheral auditory system: Application of speech recognition research , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9] Stephanie Seneff. Pitch and spectral estimation of speech based on auditory synchrony model , 1984, ICASSP.

[10] B. Delgutte,et al. Speech coding in the auditory nerve: I. Vowel-like sounds. , 1984, The Journal of the Acoustical Society of America.

[11] M. Sachs,et al. Representation of stop consonants in the discharge patterns of auditory-nerve fibers. , 1983, The Journal of the Acoustical Society of America.

[12] C D Geisler,et al. Responses of auditory-nerve fibers to consonant-vowel syllables. , 1981, The Journal of the Acoustical Society of America.

[13] Yoshua Bengio,et al. Data-Driven Execution of Multi-Layered Networks for Automatic Speech Recognition , 1988, AAAI.

[14] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[15] B. Delgutte. Representation of speech-like sounds in the discharge patterns of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.