The use of wavelets in speaker feature tracking identification system using neural network

Continuous and Discrete Wavelet Transform (WT) are used to create text-dependent robust to noise speaker recognition system. In this paper we investigate the accuracy of identification the speaker identity in non- stationary signals. Three methods are used to extract the essential speaker features based on Continuous, Discrete Wavelet Transform and Power Spectrum Density (PSD). To have better identification rate, two types of Neural Networks (NNT) are studied: The first is Feed Forward Back Propagation Neural Network (FFBNN) and the second is perceptron. Up to 98.44% identification rate is achieved. The presented system depends on the multi-stage features extracting due to its better accuracy. The multistage features tracking based system shows good capability of features tracking for tested signals with SNR equals to -9 dB using Wavelet Transform, which is suitable for non-stationary signal.

[1]  Adel El-Shahat,et al.  Neural Unit for PM Synchronous Machine Performance Improvement Used for Renewable Energy , 2009 .

[2]  Mohamad Adnan Al-Alaoui,et al.  A New Weighted Generalized Inverse Algorithm for Pattern Recognition , 1977, IEEE Transactions on Computers.

[3]  Oscal T.-C. Chen,et al.  A text-independent speaker identification system using PARCOR and AR model , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[4]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[5]  M. Adrian Al-Alaoui,et al.  Some applications of generalized inverse to pattern recognition (Ph.D. Thesis abstr.) , 1976, IEEE Trans. Inf. Theory.

[6]  William G. Wee,et al.  Generalized Inverse Approach to Adaptive Multiclass Pattern Classification , 1968, IEEE Transactions on Computers.

[8]  Baxter F. Womack,et al.  An Adaptive Pattern Classification System , 1966, IEEE Trans. Syst. Sci. Cybern..

[9]  Prashant Parikh A Theory of Communication , 2010 .

[10]  Tomoko Matsui,et al.  Distance measures for text-independent speaker recognition based on MAR model , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  George R. Doddington,et al.  Speaker verification over long distance telephone lines , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[12]  Mohamad Adnan Al-Alaoui,et al.  Application of constrained generalized inverse to pattern classification , 1976, Pattern Recognit..

[13]  Khaled Daqrouq,et al.  Discrete Wavelet Transform with Enhancement Filter for ECG Signal , 2010 .

[14]  Dante Augusto Couto Barone,et al.  Fractal dimension applied to speaker identification , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[15]  Goutam Saha,et al.  Improved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks , 2008 .

[16]  Conrad Sanderson,et al.  Automatic Person Verification Using Speech and Face Information , 2003 .

[17]  Larry P. Heck,et al.  A model-based transformational approach to robust speaker recognition , 2000, INTERSPEECH.

[18]  Lawrence G. Bahler,et al.  Speaker verification using randomized phrase prompting , 1991, Digit. Signal Process..

[19]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[20]  Simon Lucey,et al.  Audio-visual Speech Processing , 2002 .

[21]  Elisabeth Zetterholm PhD Abstract. Voice Imitation. A phonetic study of perceptual illusions and acoustic success , 2003 .

[22]  Goutam Saha,et al.  Improved Text-Independent Speaker Identification using Fused MFCC and IMFCC Feature Sets based on Gaussian Filter , 2009 .

[23]  Sadaoki Furui,et al.  Concatenated phoneme models for text-variable speaker recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Sadaoki Furui,et al.  Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  A. Grossmann,et al.  Cycle-octave and related transforms in seismic signal analysis , 1984 .