Isolated and continuous bangla speech recognition: implementation, performance and application perspective
暂无分享,去创建一个
Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov Model (HMM) and the stochastic language model (B. H. Juang, 2005). However, the existing reported research works on Bangla speech recognition didn’t yet incorporate the HMM technique and language model. This paper presents two different type of Bangla speech recognition from the implementation, performance and application perspective. We used HMM technique for pattern classification and also incorporate stochastic language model with the system. At the signal preprocessing level we perform adaptive noise elimination and end point detection. Spectral feature vectors such as Mel Frequency Cepstral Coefficients (MFCC) with the addition of first and second order coefficients are extracted from each speech wave signal. HMM is used for pattern classification. The system is implemented using the Cambridge Hidden Markov Modeling Toolkit (HTK) (S. Young, 2001-2005).
[1] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[2] 古井 貞煕,et al. Digital speech processing, synthesis, and recognition , 1989 .
[3] G. Clark,et al. Reference , 2008 .
[4] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[5] Steve Young,et al. The HTK book , 1995 .