Wavelet fuzzy LVQ based speaker verification system

This work evaluates the performance of speaker verification system based on Wavelet based Fuzzy Learning Vector Quantization (WLVQ) algorithm. The parameters of Gaussian mixture model (GMM) are designed using this proposed algorithm. Mel Frequency Cepstral Coefficients (MFCC) are extracted from the speech data and vector quantized through Wavelet based FLVQ algorithm. This algorithm develops a multi resolution codebook by updating both winning and nonwinning prototypes through an unsupervised learning process. This codebook is used as mean vector of GMM. The other two parameters, weight and covariance are determined from the clusters formed by the WLVQ algorithm. The multi resolution property of wavelet transform and ability of FLVQ in regulating the competition between prototypes during learning are combined in this algorithm to develop an efficient codebook for GMM. Because of iterative nature of Expectation Maximization (EM) algorithm, the applicability of alternative training algorithms is worth investigation. In this work, the performance of speaker verification system using GMM trained by LVQ, FLVQ and WLVQ algorithms are evaluated and compared with EM algorithm. FLVQ and WLVQ based training algorithms for modeling speakers using GMM yields better performance than EM based GMM.

[1]  Vikram M. Gadre,et al.  Function learning using wavelet neural networks , 2000, Proceedings of IEEE International Conference on Industrial Technology 2000 (IEEE Cat. No.00TH8482).

[2]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[3]  Nicholaos Zervos,et al.  Image compression based on fuzzy algorithms for learning vector quantization and wavelet image decomposition , 1998, IEEE Trans. Image Process..

[4]  Nicolaos B. Karayiannis,et al.  Fuzzy algorithms for learning vector quantization , 1996, IEEE Trans. Neural Networks.

[5]  Günther Palm,et al.  A new codebook training algorithm for VQ-based speaker recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Daniel W. C. Ho,et al.  Fuzzy wavelet networks for function learning , 2001, IEEE Trans. Fuzzy Syst..

[7]  Muzhir Shaban Al-Ani,et al.  Speaker Identification: A Hybrid Approach Using Neural Networks and Wavelet Transform , 2007 .

[8]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[9]  James C. Bezdek,et al.  Fuzzy Kohonen clustering networks , 1994, Pattern Recognit..

[10]  Qinghua Zhang,et al.  Wavelet networks , 1992, IEEE Trans. Neural Networks.

[11]  Saeed Bagheri Shouraki,et al.  A novel fuzzy approach to recognition of online Persian handwriting , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[12]  Bernard Delyon,et al.  Accuracy analysis for wavelet approximations , 1995, IEEE Trans. Neural Networks.

[13]  Vijendra Raj Apsingekar,et al.  Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  James C. Bezdek,et al.  An integrated approach to fuzzy learning vector quantization and fuzzy c-means clustering , 1997, IEEE Trans. Fuzzy Syst..

[15]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[16]  Jun Zhang,et al.  Wavelet neural networks for function learning , 1995, IEEE Trans. Signal Process..

[17]  H. S. Jayanna,et al.  An experimental comparison of modelling techniques for speaker recognition under limited data condition , 2009 .

[18]  Yagyensh C. Pati,et al.  Analysis and synthesis of feedforward neural networks using discrete affine wavelet transformations , 1993, IEEE Trans. Neural Networks.