Open and extendable speech recognition application architecture for mobile environments
暂无分享,去创建一个
This paper describes a cloud-based speech recognition architecture primarily intended for mobile environments. The system consists of a speech recognition server and a client for the Android mobile operating system. The system enables to implement Android’s Voice Input functionality for languages that are not supported by the default Google implementation. The architecture supports both large vocabulary speech recognition as well as grammar-based recognition, where grammars can be implemented in JSGF or Grammatical Framework. The system is open source and easily extendable. We used the architecture to implement Estonian speech recognition for Android.
[1] Francoise Beaufays,et al. “Your Word is my Command”: Google Search by Voice: A Case Study , 2010 .
[2] Einar Meister,et al. Strengthening the Estonian Language Technology , 2008, LREC.
[3] Tanel Alumäe,et al. TSAB - Web Interface for Transcribed Speech Collections , 2011, INTERSPEECH.
[4] Einar Meister,et al. Estonian Large Vocabulary Speech Recognition System for Radiology , 2010, Baltic HLT.