Wavelet Formants Speaker Identification Based System via Neural Network

In this paper Discrete wavelet Transform with logarithmic Power Spectrum Density (PSD) are combined for speaker formants extraction, to be used as evident classification features. For classification, Feed Forward Back Propagation Neural Network FFBNN method is proposed. The Discrete Wavelet formants Neural Network DWFNNT system works with excellent capability of features tracking even with 0dB SNR. Text - dependant system is used, so that the system can be applied in password or PINs identification in any security system. The proposed system is compared with K-means algorithm based clustering method. The results show excellent performance with 93.21% Recognition Rate (RR).

[1]  Keikichi Hirose,et al.  Tone nucleus modeling for Chinese lexical tone recognition , 2004, Speech Commun..

[2]  Haizhou Li,et al.  Normalization of the Speech Modulation Spectra for Robust Speech Recognition , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Exhibitor,et al.  International Conference On Acoustics, Speech, And Signal Processing , 1993, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Goutam Saha,et al.  Improved Text-Independent Speaker Identification using Fused MFCC and IMFCC Feature Sets based on Gaussian Filter , 2009 .

[5]  Oscal T.-C. Chen,et al.  A text-independent speaker identification system using PARCOR and AR model , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[6]  Mohamad Adnan Al-Alaoui,et al.  Application of constrained generalized inverse to pattern classification , 1976, Pattern Recognit..

[7]  Prashant Parikh A Theory of Communication , 2010 .

[8]  B. Hofmann-Wellenhof,et al.  Introduction to spectral analysis , 1986 .

[9]  Vijendra Raj Apsingekar,et al.  Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Conrad Sanderson,et al.  Automatic Person Verification Using Speech and Face Information , 2003 .

[11]  Baxter F. Womack,et al.  An Adaptive Pattern Classification System , 1966, IEEE Trans. Syst. Sci. Cybern..

[12]  Marc El-Bèze,et al.  A Clustering Method for Information Retrieval , 1999 .

[13]  Elisabeth Zetterholm PhD Abstract. Voice Imitation. A phonetic study of perceptual illusions and acoustic success , 2003 .

[14]  Ujjwal Maulik,et al.  An evolutionary technique based on K-Means algorithm for optimal clustering in RN , 2002, Inf. Sci..

[15]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[16]  Sadaoki Furui,et al.  Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  A. Grossmann,et al.  Cycle-octave and related transforms in seismic signal analysis , 1984 .

[18]  Mohamad Adnan Al-Alaoui,et al.  A New Weighted Generalized Inverse Algorithm for Pattern Recognition , 1977, IEEE Transactions on Computers.

[19]  Sadaoki Furui,et al.  Concatenated phoneme models for text-variable speaker recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  Tomoko Matsui,et al.  Distance measures for text-independent speaker recognition based on MAR model , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  George R. Doddington,et al.  Speaker verification over long distance telephone lines , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[22]  Driss Aboutajdine,et al.  Organizing Gaussian mixture models into a tree for scaling up speaker retrieval , 2007, Pattern Recognit. Lett..

[23]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[24]  V. Kroupa,et al.  Digital spectral analysis , 1983, Proceedings of the IEEE.

[25]  M. Adrian Al-Alaoui,et al.  Some applications of generalized inverse to pattern recognition (Ph.D. Thesis abstr.) , 1976, IEEE Trans. Inf. Theory.

[26]  William G. Wee,et al.  Generalized Inverse Approach to Adaptive Multiclass Pattern Classification , 1968, IEEE Transactions on Computers.

[28]  Goutam Saha,et al.  Improved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks , 2008 .

[29]  Dante Augusto Couto Barone,et al.  Fractal dimension applied to speaker identification , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[30]  Khaled Daqrouq,et al.  Discrete Wavelet Transform with Enhancement Filter for ECG Signal , 2010 .

[31]  Larry P. Heck,et al.  A model-based transformational approach to robust speaker recognition , 2000, INTERSPEECH.

[32]  Lawrence G. Bahler,et al.  Speaker verification using randomized phrase prompting , 1991, Digit. Signal Process..

[33]  C. K. Yuen,et al.  Digital spectral analysis , 1979 .