MATLAB Based Feature Extraction Using Mel Frequency Cepstrum Coefficients for Automatic Speech Recognition
暂无分享,去创建一个
1820 ISSN: 2278 – 7798 All Rights Reserved © 2014 IJSETR Abstract— Speech interface to computer is the next big step that the technology needs to take for general users. Automatic speech recognition (ASR) will play an important role in taking technology to the people. There are numerous applications of speech recognition such as direct voice input in aircraft, data entry, speech-to-text processing, voice user interfaces such as voice dialing. ASR system can be divided into two different parts, namely feature extraction and feature recognition. In this paper we present MATLAB based feature extraction using Mel Frequency Cepstrum Coefficients (MFCC) for ASR. MFCC algorithm makes use of Mel-frequency filter bank along with several other signal processing operations. Matrix of MFCC features obtained from our implementation of MFCC algorithm has number of rows equal to number of input frames and it is used in feature recognition stage.
[1] Hermann Ney,et al. Computing Mel-frequency cepstral coefficients on the power spectrum , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[2] Abhisek Paul,et al. Speech Recognition in Hindi , 2011 .
[3] Mahua Bhattacharya,et al. Development of Application Specific Continuous Speech Recognition System in Hindi , 2012 .
[4] V. Tiwari. MFCC and its applications in speaker recognition , 2010 .