Speaker recognition and verification using artificial neural network

Speaker recognition is a biometrie technique which uses individual voice samples for recognition purpose. Speaker recognition is mainly divided into speaker identification and speaker verification. In this paper, a comparative study is made between various combinations of features for speaker identification. Mel frequency Cepstral Coefficient (MFCC) features are combined with spectral centroid and spectral subtraction and tested for improvement in efficiency. Feed forward artificial neural network is used as a classifier. System was tested for 30 speakers. For speaker identification, an average identification rate of 65.3% is achieved when MFCC is combined with centroid features and an identification rate of 60% is achieved when MFCC is combined with spectral subtraction. For speaker verification, an average verification rate of 65.7% is achieved when MFCC is combined with spectral subtraction and a verification rate of 75.3% is achieved when MFCC is used along with centroid.

[1]  R Togneri,et al.  An Overview of Speaker Identification: Accuracy and Robustness Issues , 2011, IEEE Circuits and Systems Magazine.

[2]  James R. Glass,et al.  Robust Speaker Recognition in Noisy Conditions , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Zhigang Cao,et al.  Improved MFCC-based feature for robust speaker identification , 2005 .

[4]  Kuldip K. Paliwal,et al.  USE OF VOICING AND PITCH INFORMATION FOR SPEAKER RECOGNITION , 2000 .

[5]  M. S. Sinith,et al.  A novel method for Text-Independent speaker identification using MFCC and GMM , 2010, 2010 International Conference on Audio, Language and Image Processing.

[6]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[7]  R.A. Goubran,et al.  Pitch-based feature extraction for audio classification , 2003, The 2nd IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, 2003. HAVE 2003. Proceedings..

[8]  B. Anuradha,et al.  Speaker identification and Spoken word recognition in noisy background using artificial neural networks , 2016, 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT).

[9]  Shaiful Jahari Hashim,et al.  Feature extraction using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification , 2014, 2014 IEEE Student Conference on Research and Development.

[10]  Mahendra Kumar,et al.  Comparative study of different classifiers based speaker recognition system using modified MFCC for noisy environment , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[11]  J.M. Naik,et al.  Speaker verification: a tutorial , 1990, IEEE Communications Magazine.

[12]  Héctor M. Pérez Meana,et al.  Speaker recognition using Mel frequency Cepstral Coefficients (MFCC) and Vector quantization (VQ) techniques , 2012, CONIELECOMP 2012, 22nd International Conference on Electrical Communications and Computers.

[13]  Goutam Saha,et al.  Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition , 2012, Speech Commun..

[14]  B. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[15]  Larry P. Heck,et al.  An adaptive speaker verification system with speaker dependent a priori decision thresholds , 2002, INTERSPEECH.