论文信息 - ISOLATED SPEECH RECOGNITIONUSING MFCC AND DTW

ISOLATED SPEECH RECOGNITIONUSING MFCC AND DTW

This paper describes an approach of isolated speech recognition by using the Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW). Several features are extracted from speech signal of spoken words. An experimental database of total five speakers, speaking 10 digits each is collected under acoustically controlled room is taken. MFCC are extracted from speech signal of spoken words. To cope with different speaking speeds in speech recognition Dynamic Time Warping (DTW) is used. DTW is an algorithm, which is used for measuring similarity between two sequences, which may vary in time or speed.

Geeta Nijhawan | P Poonam | Shivanker Dev Dhingra | Geeta Nijhawan | Poonam Pandit

[1] Tomi Kinnunen,et al. Real-time speaker identification , 2004, INTERSPEECH.

[2] D. A. van Leeuwen,et al. Speech and Audio Signal Processing , 2011 .

[3] Simon King,et al. Speech and Audio Signal Processing , 2011 .