Architectural Design and Implementation of an FPGA Softcore Based Speech Recognition System

In spite of ever increasing logic capacity of FPGAs, the implementation of a large vocabulary speech recognition system encounters insufficient I/O bandwidth and internal memory capacity problems. In this paper, a speech recognition system architecture was described based on a softcore with hardware accelerators for the emission probability computation and the Viterbi beam search. The hardware accelerator for emission probability computation is equipped with the internal memory to effectively capture the access pattern of the acoustic model data which depend on the language model. The optimal memory configuration is determined by the proposed data partitioning strategy. The developed system has been implemented on a Xilinx Virtex-4 FPGA with MicroBlaze softcore processor along with various peripherals. The experimental results show that the proposed architecture speeds up the recognition by reducing the memory bandwidth requirement thereby the system is capable of performing real-time recognition for the DARPA resource management task which supports about 1000 words continuous speech recognition

[1]  Scott A. Mahlke,et al.  Architectural optimizations for low-power, real-time speech recognition , 2003, CASES '03.

[2]  Zhen Fang,et al.  A low-power accelerator for the SPHINX 3 speech recognition system , 2003, CASES '03.

[3]  Eric A. Brewer,et al.  Hardware speech recognition for user interfaces in low cost, low power devices , 2005, Proceedings. 42nd Design Automation Conference, 2005..

[4]  Daniel A. Connors,et al.  Analysis of hardware acceleration in reconfigurable embedded systems , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[5]  Frank Vahid,et al.  A study of the speedups and competitiveness of FPGA soft processor cores using dynamic hardware/software partitioning , 2005, Design, Automation and Test in Europe.

[6]  Kurt Keutzer,et al.  An FPGA-based soft multiprocessor system for IPv4 packet forwarding , 2005, International Conference on Field Programmable Logic and Applications, 2005..

[7]  Doug Burger,et al.  A characterization of speech recognition on modern computer systems , 2001 .