Speaker Independent Speech Recognition using MFCC with Cubic-Log Compression and VQ Analysis

Speech processing is developed as one of the paramount requisition region of digital signal processing. Different fields for research in speech processing are speech recognition, speaker identification, speech bland, speech coding etc. The objective of Speaker Independent Speech Recognition is to concentrate, describe and distinguish information about speech signal and methodology towards creating the speaker free speech recognition system. Extracted information will be valuable for the directing and working different electronic contraptions and hardware through the human voice proficiently. Feature extraction is the first venture for speech recognition. Numerous algorithms are recommended / created by the scientists for feature extraction. In this work, the cubiclog compression in Mel-Frequency Cepstrum Coefficient (MFCC) feature extraction system is utilized to concentrate the characteristics from speech sign for outlining a speaker independent speaker recognition system. Extracted features are used to train and test this system with the help of Vector Quantization approach.

[1]  Hongzhi Wang,et al.  Study on the MFCC similarity-based voice activity detection algorithm , 2011, 2011 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC).

[2]  E. B. Newman,et al.  A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[3]  Thaweesak Yingthawornsuk,et al.  Speech Recognition using MFCC , 2012 .

[4]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[5]  Satyanand Singh,et al.  Vector Quantization Approach for Speaker Recognition using MFCC and Inverted MFCC , 2011 .

[6]  Om Prakash Prabhakar,et al.  A Survey On: Voice Command Recognition Technique , 2013 .

[7]  T. Ravichandran,et al.  A novel approach for speech feature extraction by Cubic-Log compression in MFCC , 2013, 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering.

[8]  H. B. Kekre,et al.  Speech Data Compression using Vector Quantization , 2008 .

[9]  H. Ney,et al.  Linear discriminant analysis for improved large vocabulary continuous speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  William M. Campbell,et al.  A new kernel for SVM MLLR based speaker recognition , 2007, INTERSPEECH.

[11]  P. Babu Anto,et al.  Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[12]  D. O'Shaughnessy,et al.  Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm , 2004, Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004..

[13]  H. B. Kekre,et al.  New Clustering Algorithm for Vector Quantization using Rotation of Error Vector , 2010, ArXiv.

[14]  Wu Junqin,et al.  An improved arithmetic of MFCC in speech recognition system , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[15]  D.P. Skinner,et al.  The cepstrum: A guide to processing , 1977, Proceedings of the IEEE.

[16]  Mark A Gregory,et al.  A novel approach for MFCC feature extraction , 2010, 2010 4th International Conference on Signal Processing and Communication Systems.

[17]  Werner Hemmert,et al.  Automatic speech recognition with an adaptation model motivated by auditory processing , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  John H. L. Hansen,et al.  A Review on Speech Recognition Technique , 2010 .

[19]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[20]  Dipmoy Gupta Isolated Word Speech Recognition Using Vector Quantization (VQ) , 2012 .

[21]  Er Meng Joo,et al.  Improved linear predictive coding method for speech recognition , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[22]  M. A. Anusuya,et al.  Speech Recognition by Machine, A Review , 2010, ArXiv.

[23]  D.R. Reddy,et al.  Speech recognition by machine: A review , 1976, Proceedings of the IEEE.