Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

IIn this paper, we propose a novel architecture of wavelet network called Multi-input Multi-output Wavelet Network MIMOWN as a generalization of the old architecture of wavelet network. This newel prototype was applied to speech recognition application especially to model acoustic unit of speech. The originality of our work is the proposal of MIMOWN to model acoustic unit of speech. This approach was proposed to overcome limitation of old wavelet network model. The use of the multi-input multi-output architecture will allows training wavelet network on various examples of acoustic units. The development of robust systems for speech recognition is now one of the main issues of language processing. Many systems were based on concepts that were relatively close. However, despite those progresses, the performances of current systems remain much lower than the capacity of the human auditory system. In this respect, the prospects for improvement are strong. Economic issues related to speech recognition make this area a sector in constant evolution and pushes us to constantly imagine new approaches in order to be able to at least match or exceed the performance of our hearing. To solve some problems of modelling and recognition of speech, we suggest a new method based on wavelet networks (21) that exploit the intrinsic properties of the speech signal, while the statistical models are concerned with statistical properties of the speech signal (2). These models are similar to neural network for the structure and the training approach. But, training algorithms for wavelet network require a smaller number of iterations when compared with neural network. Wavelet network model, the single-input single-output wavelet network, was introduced, firstly, by Zhang and Benveniste in 1992 (8). This model was inspired from neural network architecture as a combination of neuronal contraption and wavelets as activation functions (20). Those models have supplied an access to all frequency of signal thanks to the use of wavelets in the hidden layer of each neurone. It has more advantages than common networks such as faster convergence, avoiding local minimum, easy decision and adaptation of structure (4). Despite the contribution of these models in different fields of pattern recognition (4), (5), (7), they remained limited in the field of modelling. These prototypes cannot instil entities at different occurrences. To overcome these limitations, we advanced a new model for the training of several instances of a single entity at the same time. These models are called multi- input multi-output wavelet network MIMOWN (23)(24).

[1]  S. Sitharama Iyengar,et al.  Foundations of Wavelet Networks and Applications , 2002 .

[2]  Jen-Tzung Chien,et al.  Joint acoustic and language modeling for speech recognition , 2010, Speech Commun..

[3]  C. Ben Amar,et al.  Training of the Beta wavelet networks by the frames theory: Application to face recognition , 2008, 2008 First Workshops on Image Processing Theory, Tools and Applications.

[4]  Qinghua Zhang,et al.  Wavelet networks , 1992, IEEE Trans. Neural Networks.

[5]  Chokri Ben Amar,et al.  FBWN: An architecture of fast beta wavelet networks for image classification , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[6]  Chokri Ben Amar,et al.  Wavelet Networks Approach for Image Compression , 2007 .

[7]  Zhiguo Zhang,et al.  Learning algorithm of wavelet network based on sampling theory , 2007, Neurocomputing.

[8]  Juan Manuel Gutiérrez,et al.  WAVELET NEURAL NETWORK AS A MULTIVARIATE CALIBRATION METHOD IN VOLTAMMETRIC ELECTRONIC TONGUES , 2009 .

[9]  Zhiguo Zhang Iterative algorithm of wavelet network learning from nonuniform data , 2009, Neurocomputing.

[10]  Harold H. Szu,et al.  Neural network adaptive wavelets for signal representation and classification , 1992 .

[11]  Chokri Ben Amar,et al.  Pyramidal Hybrid Approach: Wavelet Network with OLS Algorithm-Based Image Classification , 2011, Int. J. Wavelets Multiresolution Inf. Process..

[12]  Yasar Becerikli,et al.  Nonlinear System Modeling Using Wavelet Networks , 2005, ISNN.

[13]  Chokri Ben Amar,et al.  Wavelet network for recognition system of Arabic word , 2010, Int. J. Speech Technol..

[14]  John Daugman,et al.  Demodulation by Complex-Valued Wavelets for Stochastic Pattern Recognition , 2003, Int. J. Wavelets Multiresolution Inf. Process..

[15]  Chokri Ben Amar,et al.  Beta wavelets. Synthesis and application to lossy image compression , 2005, Adv. Eng. Softw..

[16]  Jianping Li,et al.  RESEARCH FOR MULTI-INPUT WAVELET NEURAL NETWORK , 2006 .

[17]  Chokri Ben Amar,et al.  Beta wavelet networks for function approximation , 2005 .

[18]  Chokri Ben Amar,et al.  Beta Wavelet Networks for Face Recognition , 2005, J. Decis. Syst..

[19]  Stephen A. Billings,et al.  A new class of wavelet networks for nonlinear system identification , 2005, IEEE Transactions on Neural Networks.

[20]  Jing Zhao,et al.  Feedforward Wavelet Neural Network and Multi-variable Functional Approximation , 2004, CIS.

[21]  Qinghua Zhang,et al.  Using wavelet network in nonparametric estimation , 1997, IEEE Trans. Neural Networks.

[22]  Mokhtar Sellami,et al.  Combination of vector quantization and hidden Markov models for Arabic speech recognition , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[23]  Chokri Ben Amar,et al.  Intelligent Approach to Train Wavelet Networks for Recognition System of Arabic Words , 2010, KDIR.