The analysis and design of architecture systems for speech recognition on modern handheld-computing devices

Growing demand for high performance in embedded systems is creating new opportunities to use speech recognition systems. In several ways, the needs of embedded computing differ from those of more traditional general-purpose systems. Embedded systems have more stringent constraints on cost and power consumption that lead to design bottlenecks for many computationally-intensive applications. This paper characterizes the speech recognition process on handheld mobile devices and evaluates the use of modern architecture features and compiler techniques for performing real-time speech recognition. We evaluate the University of Colorado sonic speech recognition software on the IMPACT architectural simulator and compiler framework. Experimental results show that by using a strategic set of compiler optimization, a 500 MHz processor with moderate levels of instruction-level parallelism and cache resources can meet the real-time computing and power constraints of an advanced speech recognition application.

[1]  J.H.L. Hansen,et al.  Fast likelihood computation techniques in nearest-neighbor based search for continuous speech recognition , 2001, IEEE Signal Processing Letters.

[2]  Kadri Hacioglu,et al.  Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  Richard A. Uhlig,et al.  Performance of Natural I/O Applications , 2000 .

[4]  Margaret Martonosi,et al.  Wattch: a framework for architectural-level power analysis and optimizations , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[5]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[6]  Steve Young,et al.  Token passing: a simple conceptual model for connected speech recognition systems , 1989 .

[7]  Mosur Ravishankar,et al.  Efficient Algorithms for Speech Recognition. , 1996 .

[8]  A. Guyot,et al.  Low power CMOS digital design , 1998, Proceedings of the Tenth International Conference on Microelectronics (Cat. No.98EX186).

[9]  Doug Burger,et al.  A characterization of speech recognition on modern computer systems , 2001 .

[10]  R. Cole,et al.  THE OGI KIDS’ SPEECH CORPUS AND RECOGNIZERS , 2000 .

[11]  I Memory Multicomputers Center for Reliable and High-Performance Computing , 1994 .