论文信息 - Mel-frequency cepstral coefficient analysis in speech recognition

Mel-frequency cepstral coefficient analysis in speech recognition

Speech recognition is a major topic in speech signal processing. Speech recognition is considered as one of the most popular and reliable biometric technologies used in automatic personal identification systems. Speech recognition systems are used for variety of applications such as multimedia browsing tool, access centre, security and finance. It allows people work in active environment to use computer. For a reliable and high accuracy of speech recognition, simple and efficient representation methods are required. In this paper, the zero crossing extraction and the energy level detection are applied to the recorded speech signal for voiced/unvoiced area detection. The detected voiced signals are applied for segmentation. Further, the MFCC method is applied to all of the segmented windows. The extracted MFCC data are further used as inputs for neural network training.

[1] Mazin G. Rahim,et al. Artificial Neural Networks for Speech Analysis/Synthesis , 1994 .

[2] Thomas Quatieri,et al. Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[3] Mark A. Greenwood,et al. SUVING: AUTOMATIC SILENCE /UNVOICED/VOICED CLASSIFICATION OF SPEECH , 1999 .

[4] M. Buscema,et al. Introduction to artificial neural networks. , 2007, European journal of gastroenterology & hepatology.

[5] Liu Jia,et al. Feature selection in Mandarin large vocabulary continuous speech recognition , 2002, 6th International Conference on Signal Processing, 2002..

[6] Juan Carlos,et al. Review of "Discrete-Time Speech Signal Processing - Principles and Practice", by Thomas Quatieri, Prentice-Hall, 2001 , 2003 .

[7] D. Howard,et al. Speech and audio signal processing: processing and perception of speech and music [Book Review] , 2000 .

[8] Volume Assp,et al. ACOUSTICS. SPEECH. AND SIGNAL PROCESSING , 1983 .

[9] John G. Ackenhusen. Real-time signal processing - design and implementation of signal processing systems , 1999 .

[10] Xiang-Sun Zhang,et al. Introduction to Artificial Neural Network , 2000 .