Multilingual speaker recognition using ANFIS

Feature based Recognition Systems has been an area of intense research for long. The creation of a reliable, robust and sufficiently efficient recognition system has been tried using features from several sources including textual and image sources. Speech based sources have also been used for the creation of such a recognition system. However, variations caused due to differences in individual speaker characteristics, mood variations and inter-mingled noise disturbances make the realization of such a system very difficult. This paper proposes a recognition system for identification of the speaker, language and the words spoken. The system makes use of Adaptive Neuro-Fuzzy Inference paradigm for the same. First, the sampling frequency and the speech features are extracted from the speech database to form speech feature vectors. The features used are LPC, LPCC, RC, LAR, LSF and ARSCIN. The speech database is prepared using 25 speakers including male and female speakers. Five different speaking texts of different languages having same meaning are used to get the best speaker identification accuracy. The languages spoken by the speakers include English, Hindi, Punjabi, Sanskrit and Telugu. The Feature vectors, thus prepared, are fed to an Adaptive Neuro-Fuzzy Inference System for speaker, language and word recognition. The experimental results show the system to be amply efficient and successful in the recognition tasks involved.

[1]  Wahyudi,et al.  Intelligent Voice-Based Door Access Control System Using Adaptive-Network-based Fuzzy Inference Systems (ANFIS) for Building Security , 2007 .

[2]  Xihong Wu,et al.  On the importance of components of the MFCC in speech and speaker recognition , 2000, INTERSPEECH.

[3]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[4]  A. Enis Çetin,et al.  Interframe differential vector coding of line spectrum frequencies , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  J. Harrington,et al.  Techniques in Speech Acoustics , 1999, Computational Linguistics.

[6]  Adel El-Hennawy,et al.  Speech recognition using a wavelet transform to establish fuzzy inference system through subtractive clustering and neural network (ANFIS) , 2008, ICONS 2008.

[7]  G. Ruske,et al.  Robust speaker clustering in eigenspace , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[8]  S. Singh,et al.  MULTILINGUAL SPEAKER RECOGNITION USING NEURAL NETWORK , 2009 .

[9]  Ehab F. Badran,et al.  Speaker recognition using artificial neural networks based on vowel phonemes , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[10]  X. Zhang,et al.  Automatic speechreading with application to speaker verification , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[12]  Jacob Benesty,et al.  Springer handbook of speech processing , 2007, Springer Handbooks.

[13]  Tanja Schultz,et al.  Multilingual articulatory features , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[14]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[15]  Tomi Kinnunen,et al.  Spectral Features for Automatic Text-Independent Speaker Recognition , 2003 .

[16]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[17]  Jian-Da Wu,et al.  Speaker identification based on the frame linear predictive coding spectrum technique , 2009, Expert Syst. Appl..

[18]  Hui Lin,et al.  Learning Methods in Multilingual Speech Recognition , 2008, NIPS 2008.

[19]  Marley M. B. R. Vellasco,et al.  A comparison of different spectral analysis models for speech recognition using neural networks , 1996, Proceedings of the 39th Midwest Symposium on Circuits and Systems.

[20]  Aditya Sharma,et al.  Hybrid wavelet based LPC features for Hindi speech recognition , 2008, Int. J. Inf. Commun. Technol..

[21]  Engin Avci,et al.  The speaker identification by using genetic wavelet adaptive network based fuzzy inference system , 2009, Expert Syst. Appl..

[22]  P.P. Rege,et al.  Language Independent Speaker Identification , 2006, 2006 IEEE International Conference on Industrial Technology.

[23]  Sadaoki Furui,et al.  A text-independent speaker recognition method robust against utterance variations , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.