Development of Isolated Word Speech Recognition System

The isolated word speech recognition system based on dynamic time warping (DTW) has been developed. Speaker adaptation is performed using speaker recognition techniques. Vector quantization is used to create reference templates for speaker recognition. Linear predictive coding (LPC) parameters are used as features for recognition. Performance is evaluated using 12 words of Lithuanian language pronounced ten times by ten speakers.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  古井 貞煕,et al.  Digital speech processing, synthesis, and recognition , 1989 .

[3]  Sadaoki Furui,et al.  Digital Speech Processing, Synthesis, and Recognition , 1989 .

[4]  Antanas Lipeika,et al.  Speaker identification methods based on pseudostationary segments of voiced sounds , 1996 .

[5]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Jean-Claude Junqua,et al.  Robustness in Automatic Speech Recognition: Fundamentals and Applications , 1995 .

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[9]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[10]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[11]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[12]  Antanas Lipeika,et al.  The use of pseudostationary segments for speaker identification , 1993, EUROSPEECH.

[13]  Kuldip K. Paliwal,et al.  Automatic Speech and Speaker Recognition , 1996 .

[14]  A. Gray,et al.  Distortion performance of vector quantization for LPC voice coding , 1982 .

[15]  C. D. Forgie,et al.  Automatic Recognition of Spoken Digits , 1958 .

[16]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[17]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[18]  Aaron E. Rosenberg,et al.  Speaker independent recognition of isolated words using clustering techniques , 1979, ICASSP.