A flexible linear array oriented VLSI processor for continuous speech recognition

A PE (Processing Element) LSI for a DTW (Dynamic Time Warping) linear array processor has been designed. In designing this LSI, major effort has been focused on achieving regular data-flow among adjacent PEs maintaining pipelined operation in the array. A three data channel structure, a triple buffer structure and sophisticated control schemes make it possible for the designed LSI to carry out MIMD (Multiple Instruction and Multiple Data streams) and continuous pipelined DTW processing in sync with regular pattern input. Due to the high speed real time operation, and the versatile function of this PE-LSI, a high performance linear array processor can be constructed using a small number of PEs. The high speed operation of LSI is pursuited to achieve real time processing. Continuous speech recognition with an approximate vocabulary of 1000 words can be achieved using only 20 to 30 of these PE-LSIs.

[1]  H. Kitazawa,et al.  An Integrated Design Automation System for VLSI Circuits , 1985, IEEE Design & Test of Computers.

[2]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[3]  Atsushi Iwata,et al.  A ring array processor architecture for highly parallel dynamic time warping , 1986, IEEE Trans. Acoust. Speech Signal Process..

[4]  B. Ackland,et al.  A systolic processing element for speech recognition , 1982, 1982 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.

[5]  R. Nakatsu,et al.  Japanese text input system based on continuous speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Bryan D. Ackland,et al.  Array configurations for dynamic time warping , 1984 .