Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition

This paper applied speech recognition and RFID technologies to develop an omni-directional mobile robot into a robot with voice control and guide introduction functions. For speech recognition, the speech signals were captured by short-time processing. The speaker first recorded the isolated words for the robot to create speech database of specific speakers. After the speech pre-processing of this speech database, the feature parameters of cepstrum and delta-cepstrum were obtained using linear predictive coefficient (LPC). Then, the Hidden Markov Model (HMM) was used for model training of the speech database, and the Viterbi algorithm was used to find an optimal state sequence as the reference sample for speech recognition. The trained reference model was put into the industrial computer on the robot platform, and the user entered the isolated words to be tested. After processing by the same reference model and comparing with previous reference model, the path of the maximum total probability in various models found using the Viterbi algorithm in the recognition was the recognition result. Finally, the speech recognition and RFID systems were achieved in an actual environment to prove its feasibility and stability, and implemented into the omni-directional mobile robot.

[1]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[2]  Chia-Feng Juang,et al.  Hierarchical Singleton-Type Recurrent Neural Fuzzy Networks for Noisy Speech Recognition , 2007, IEEE Transactions on Neural Networks.

[3]  Chanwoo Kim,et al.  Robust DTW-based recognition algorithm for hand-held consumer devices , 2005, IEEE Transactions on Consumer Electronics.

[4]  Naoya Wada,et al.  Scalable architecture for word HMM-based speech recognition and VLSI implementation in complete system , 2006, IEEE Transactions on Circuits and Systems I: Regular Papers.

[5]  Yuan Yujin,et al.  Research of speaker recognition based on combination of LPCC and MFCC , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[6]  Soo-Young Lee,et al.  Unified Training of Feature Extractor and HMM Classifier for Speech Recognition , 2012, IEEE Signal Processing Letters.

[7]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[8]  Jialong He,et al.  On the use of orthogonal GMM in speaker recognition , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[9]  Yunghsiang Sam Han,et al.  Robust Decoding for Convolutionally Coded Systems Impaired by Memoryless Impulsive Noise , 2013, IEEE Trans. Commun..