A review of speech recognition with Sphinx engine in language detection

Speech recognition is the process of the computer i dentifying human speech to generate a string of wor ds or commands. The output of speech recognition syste ms can be applied in various fields. Besides, there are many artificial intelligent techniques available fo r Automatic Speech Recognition (ASR) development, a d hybrid technology is one of it. The common hybrid t echnique in speech recognition is the combination o f Hidden Markov Models (HMMs) and Artificial Neural N etworks (ANNs). In this research, Sphinx approach is applied to integrate the advantage of t he sequential modeling structure and its pattern classification. Outcome from this paper will assist in next phase of the research which is focusing on building an Arab language speech recognizer by Sphi nx4 engine process approach.

[1]  Jonathan G. Fiscus,et al.  2000 NIST EVALUATION OF CONVERSATIONAL SPEECH RECOGNITION OVER THE TELEPHONE: ENGLISH AND MANDAR IN PERFORMANCE RESULTS , 2000 .

[2]  Steve Young A review of large-vocabulary continuous-speech , 1996 .

[3]  Jie Zhao,et al.  Implementation and analysis of speech recognition front-ends , 1999, Proceedings IEEE Southeastcon'99. Technology on the Brink of 2000 (Cat. No.99CH36300).

[4]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[5]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[6]  Paul Lamere,et al.  Sphinx-4: a flexible open source framework for speech recognition , 2004 .

[7]  I.A. Maaly,et al.  Speech Recognition using Artificial Neural Networks , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[8]  Kai-Fu Lee,et al.  Automatic Speech Recognition , 1989 .

[9]  Chee Peng Lim,et al.  Speech recognition using artificial neural networks , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[10]  Douglas D. O'Shaughnessy,et al.  Speech Recognition by Intelligent Machines , 2001 .

[11]  Shrikanth S. Narayanan,et al.  A review of ASR technologies for children's speech , 2009, WOCCI.

[12]  Lawrence R. Rabiner,et al.  Automatic Speech Recognition - A Brief History of the Technology Development , 2004 .

[13]  Steve Young,et al.  A review of large-vocabulary continuous-speech , 1996, IEEE Signal Process. Mag..

[14]  Jay G. Wilpon,et al.  Voice communication between humans and machines , 1994 .