Speech Recognition System: A Review

able to control devices by voice has always intrigued mankind. Today after intense research, Speech Recognition System, have made a niche for themselves and can be seen in many walks of life. The accuracy of Speech Recognition Systems remains one of the most important research challenges e.g. noise, speaker variability, language variability, vocabulary size and domain. The design of speech recognition system requires careful attentions to the challenges such as various types of Speech Classes and Speech Representation, Speech Preprocessing stages, Feature Extraction techniques, Database and Performance evaluation. This paper presents the advances made as well as highlights the pressing problems for a speech recognition system. The paper also classifies the system into Front End and Back End for better understanding and representation of speech recognition system in each part.

[1]  Saudi Arabia,et al.  FAST ALGORITHM FOR NOISY SPEAKER RECOGNITION USING ANN , 2014 .

[2]  Anand singh,et al.  Database Development and Analysis of Spoken Hindi Hybrid Words Using Endpoint Detection , 2012 .

[3]  Andreas Spanias,et al.  Cepstrum-based pitch detection using a new statistical V/UV classification algorithm , 1999, IEEE Trans. Speech Audio Process..

[4]  Shaila D. Apte,et al.  Speech and Audio Processing , 2012 .

[5]  Chang D. Yoo,et al.  Wavelet speech enhancement based on voiced/unvoiced decision , 2003 .

[6]  Achyuta Nand Mishra,et al.  Robust Features for Connected Hindi Digits Recognition , 2011 .

[7]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[8]  Nidhi Srivastava Speech Recognition using Artificial Neural Network , 2014 .

[9]  K. Dutta,et al.  Dynamic segmentation of vocal extract for Assamese Speech to Text Conversion using RNN , 2012, 2012 2nd National Conference on Computational Intelligence and Signal Processing (CISP).

[10]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[11]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[12]  Douglas D. O'Shaughnessy,et al.  Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition , 1999, IEEE Trans. Speech Audio Process..

[13]  Lingyun Gu,et al.  A new robust algorithm for isolated word endpoint detection , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  G. Bachur,et al.  1 Separation of Voiced and Unvoiced using Zero crossing rate and Energy of the Speech Signal , 2008 .

[15]  Keikichi Hirose,et al.  A minimax search algorithm for robust continuous speech recognition , 2000, IEEE Trans. Speech Audio Process..

[16]  Qiru Zhou,et al.  Robust endpoint detection and energy normalization for real-time speech and speaker recognition , 2002, IEEE Trans. Speech Audio Process..

[17]  Jacob Benesty,et al.  Springer handbook of speech processing , 2007, Springer Handbooks.

[18]  M. J. Cheng,et al.  Comparative performance study of several pitch detection algorithms , 1975 .

[19]  Ibrahim Patel,et al.  Speech Recognition Using HMM with MFCC-An Analysis Using Frequency Specral Decomposion Technique , 2010 .

[20]  Vikash Kumar Singh,et al.  Broad Acoustic Classification of Spoken Hindi Hybrid Paired Words using Artificial Neural Networks , 2012 .

[21]  Manan Vyas A Gaussian Mixture Model Based Speech Recognition System Using Matlab , 2013 .

[22]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[23]  K. Dutta,et al.  Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application , 2012, 2012 International Conference on Communications, Devices and Intelligent Systems (CODIS).

[24]  Khurram Waheed,et al.  A robust algorithm for detecting speech segments using an entropic contrast , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[25]  Valeri Mladenov,et al.  Neural networks used for speech recognition , 2010 .

[26]  Lai-Wan Chan,et al.  Isolated word recognition using modular recurrent neural networks , 1998, Pattern Recognit..

[27]  Hazarathaiah Malepati,et al.  Speech and Audio Processing , 2010 .

[28]  N. N. Lokhande,et al.  Voice activity detection Algorithm for Speech Recognition Applications , 2012 .

[29]  Kuldip K. Paliwal Effect of preemphasis on vowel recognition performance , 1984, Speech Commun..