Speech recognition experiments with linear predication, bandpass filtering, and dynamic programming
暂无分享,去创建一个
Automatic speech recognition experiments are described in which several popular preprocessing and classification strategies are compared. Preprocessing is done either by linear predictive analysis or by bandpass filtering. The two approaches are shown to produce similar recognition scores. The classifier uses either linear time stretching or dynamic programming to achieve time alignment. It is shown that dynamic programming is of major importance for recognition of polysyllabic words. The speech is compressed into a quasi-phoneme character string or preserved uncompressed. Best results are obtained with uncompressed data, using nonlinear time registration for multisyllabic words.
[1] Hiroaki Sakoe,et al. A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .
[2] F. Itakura,et al. Minimum prediction residual principle applied to speech recognition , 1975 .
[3] J. Shearme,et al. Some experiments with a simple word recognition system , 1968 .
[4] George White. Speech recognition with character string encoding , 1972, CDC 1972.
[5] O. Fujimura,et al. Syllable as a unit of speech recognition , 1975 .