Speaker-independent isolated-word recognition LSI
暂无分享,去创建一个
Describes the architecture of a newly designed LSI for speaker-independent speech recognition. The recognition algorithm used in this LSI is based on a vector quantization technique and a dynamic time-warping technique using multiple word templates. In order to efficiently execute the complicated recognition algorithm, this LSI has the following architectural features: (1) address-generator independent of the data-calculating circuit, (2) pipelined architecture (3) structure of separate data buses, (4) multiplexed data bus with timing distribution, (5) horizontal-type micro-program. This LSI can recognize up to 32 speaker-independent isolated words (or up to 512 speaker dependent isolated words) within 0.4 seconds after speech endpoint detection. An average of recognition rate for Japanese 10-digit words is 97%. By using this LSI, a speech recognition system can be easily constructed on single board.<<ETX>>
[1] R. Nakatsu,et al. Speaker-independent isolated word recognition for telephone voice using phoneme-like templates , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2] Y. Suzuki. Design of an efficient dynamic time warping LSI , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Kiyohiro Shikano,et al. Isolated word recognition using phoneme-like templates , 1983, ICASSP.