论文信息 - Open and extendable speech recognition application architecture for mobile environments

Open and extendable speech recognition application architecture for mobile environments

This paper describes a cloud-based speech recognition architecture primarily intended for mobile environments. The system consists of a speech recognition server and a client for the Android mobile operating system. The system enables to implement Android’s Voice Input functionality for languages that are not supported by the default Google implementation. The architecture supports both large vocabulary speech recognition as well as grammar-based recognition, where grammars can be implemented in JSGF or Grammatical Framework. The system is open source and easily extendable. We used the architecture to implement Estonian speech recognition for Android.

Kaarel Kaljurand | Tanel Alumäe

[1] Francoise Beaufays,et al. “Your Word is my Command”: Google Search by Voice: A Case Study , 2010 .

[2] Einar Meister,et al. Strengthening the Estonian Language Technology , 2008, LREC.

[3] Tanel Alumäe,et al. TSAB - Web Interface for Transcribed Speech Collections , 2011, INTERSPEECH.

[4] Einar Meister,et al. Estonian Large Vocabulary Speech Recognition System for Radiology , 2010, Baltic HLT.