HMM-based speech recognition using adaptive framing

A common approach in mapping a signal to discrete events is to define a set of symbols that correspond to useful acoustic features of the signal over a short constant time interval. This paper proposes a Hidden Markov Models (HMM) based speech recognition by using cepstrum feature of the signal over adaptive time interval. First pitch period is detected by dyadic wavelet transform and divides the voiced speech signal according to the detected period. Then, system performs HMM-based speech recognition using cepstrum feature to classify the speech signals. Two speech recognition systems have been developed, one is based on constant time framing and the other is adaptive framing. The results are compared and found that adaptive framing method shows better result in both data distribution and recognition rate.

[1]  S. Young Large Vocabulary Continuous Speech Recognition : a ReviewSteve , 1996 .

[2]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3]  A. Kaelin,et al.  HMM-based speech enhancement using pitch period information in voiced speech segments , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[4]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[5]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[6]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[7]  Steve Young,et al.  A review of large-vocabulary continuous-speech , 1996, IEEE Signal Process. Mag..

[8]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.