A General Purpose Chipset For Speech Recognition

A high performance, flexible, low cost chipset for automatic speech recognition is presented which can be incorporated in a large range of consumer products. The chipset recognizes isolated words in speakerdependent mode. It implements a robust, modified, two stage DTW algorithm, achieving a recognition rate above 99%, when tested with a 10 words vocabulary in an office environment. This result is obtained using only two user trained templates per word. For enabling user-friendly man-machine interface, speech compression and synthesis are also performed by the chipset.

[1]  R. Haimi-Cohen,et al.  Dynamic Time Warping with Boundaries Constraint Relaxation , 1989, The Sixteenth Conference of Electrical and Electronics Engineers in Israel,.

[2]  J. Lynch,et al.  Speech/Silence segmentation for real-time coding via rule based adaptive endpoint detection , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .