论文信息 - Neuro-Inspired Speech Recognition Based on Reservoir Computing

Neuro-Inspired Speech Recognition Based on Reservoir Computing

This chapter investigates the potential of recurrent spiking neurons for classification problems. It presents a hybrid approach based on the paradigm of Reservoir Computing. The practical applications based on recurrent spiking neurons are limited due to the lack of learning algorithms. Most of the previous work in the literature has focused on feed forward networks because computation in these networks is comparatively easy to analyse. The details of such networks have been reported in detail in (Haykin, 1999) (Pavlidis et al., 2005) (Bohte et al., 2000). Recently, a strategy proposed by Maass (Maass et al., 2002) and Jaeger (Jaeger, 2001) offers to overcome the burden of recurrent neural networks training. In this paradigm, instead of training the whole recurrent network only the output layer (known as readout neuron) is trained. This chapter investigates the potential of recurrent spiking neurons as the basic building blocks for the liquid or so called reservoir. These recurrent neural networks are termed as microcircuits which are viewed as basic computational units in cortical computation (Maass et al., 2002). These microcircuits are connected as columns which are linked with other neighboring columns in cortical areas. These columns read out information from each other and serve both as reservoir and readout. The reservoir is modeled as a dynamical system perturbed by the input stream where only readouts are trained to extract information from the reservoir. The basic motivation behind investigating recurrent neurons is their potential to memorise relevant events over short periods of time (Maass et al., 2002). The use of feedback enables recurrent networks to acquire state representation which makes them suitable for temporal based applications such as speech recognition. It is challenging to solve such problems with recurrent networks due to the burden of training. The paradigm of reservoir computing also referred to as liquid computing relaxes the burden of training because only an output layer is trained instead of training the whole network. The work presented in this chapter analyses the theoretical framework of Reservoir Computing and demonstrates results in terms of classification accuracy through the application of speech recognition. The design space for this paradigm is split into three domains; front end, reservoir, and back end. This work contributes to the identification of suitable front and back end processing techniques along with stable reservoir dynamics, which provides a reliable framework for classification related problems. The work presented in this chapter suggests a simple and efficient biologically plausible approach based on a hybrid implementation of recurrent spiking neurons and classical feed

[1] V.P. Plagianakos,et al. Spiking neural network training using evolutionary algorithms , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[2] L. Abbott,et al. Synaptic plasticity: taming the beast , 2000, Nature Neuroscience.

[3] John G. Harris,et al. Automatic speech recognition using a predictive echo state network classifier , 2007, Neural Networks.

[4] S. Kosslyn,et al. Findings and current opinion in cognitive neuroscience , 1998 .

[5] Henry Markram,et al. Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.

[6] Harald Haas,et al. Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication , 2004, Science.

[7] Soo-Hoon Kim,et al. A study on the recognition of the isolated digits using recurrent neural predictive HMM , 1999, Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030).

[8] T. Harkany,et al. Pyramidal cell communication within local networks in layer 2/3 of rat neocortex , 2003, The Journal of physiology.

[9] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[10] E. Callaway,et al. Excitatory cortical neurons form fine-scale functional networks , 2005, Nature.

[11] Shyamala C. Sivakumar,et al. Isolated digit recognition using a block diagonal recurrent neural network , 2000, 2000 Canadian Conference on Electrical and Computer Engineering. Conference Proceedings. Navigating to a New Era (Cat. No.00TH8492).

[12] Z. Zhao. Connectionist training of non-linear hidden Markov models for speech recognition , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[13] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .

[14] Carl G. Looney,et al. Pattern recognition using neural networks: theory and algorithms for engineers and scientists , 1997 .

[15] H. Markram,et al. Organizing principles for a diversity of GABAergic interneurons and synapses in the neocortex. , 2000, Science.

[16] Thomas M. Cover,et al. Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[17] Wolfgang Maass,et al. Movement Generation with Circuits of Spiking Neurons , 2005, Neural Computation.

[18] John G. Harris,et al. Spike-Based Feature Extraction for Noise Robust Speech Recognition Using Phase Synchrony Coding , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[19] Prof. Dr. Dr. Valentino Braitenberg,et al. Cortex: Statistics and Geometry of Neuronal Connectivity , 1998, Springer Berlin Heidelberg.

[20] Wolfgang Maass,et al. Spiking neurons and the induction of finite state machines , 2002, Theor. Comput. Sci..

[21] Chin Luh Tan,et al. Digit Recognition Using Neural Networks , 2004 .

[22] Henry Markram,et al. Computational models for generic cortical microcircuits , 2004 .

[23] Herbert Jaeger,et al. Adaptive Nonlinear System Identification with Echo State Networks , 2002, NIPS.

[24] J J Hopfield,et al. What is a moment? Transient synchrony as a collective mechanism for spatiotemporal integration. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25] Ivan Soltesz,et al. Structure of cortical microcircuit theory , 2005, The Journal of physiology.

[26] T. Martin McGinnity,et al. Neuro-inspired Speech Recognition with Recurrent Spiking Neurons , 2008, ICANN.

[27] B. Schrauwen,et al. Isolated word recognition with the Liquid State Machine: a case study , 2005, Inf. Process. Lett..

[28] G. R. Doddington,et al. Computers: Speech recognition: Turning theory to practice: New ICs have brought the requisite computer power to speech technology; an evaluation of equipment shows where it stands today , 1981, IEEE Spectrum.

[29] J.J. Steil,et al. Backpropagation-decorrelation: online recurrent learning with O(N) complexity , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[30] T. M. Mayhew,et al. Anatomy of the Cortex: Statistics and Geometry. , 1991 .

[31] Benjamin Schrauwen,et al. An experimental unification of reservoir computing methods , 2007, Neural Networks.

[32] Masaaki Honda,et al. LPC speech coding based on variable-length segment quantization , 1988, IEEE Trans. Acoust. Speech Signal Process..

[33] Amir F. Atiya,et al. New results on recurrent network training: unifying the algorithms and accelerating convergence , 2000, IEEE Trans. Neural Networks Learn. Syst..

[34] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[35] Barak A. Pearlmutter. Gradient calculations for dynamic recurrent neural networks: a survey , 1995, IEEE Trans. Neural Networks.

[36] J J Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[37] Yoshua Bengio,et al. Neural networks for speech and sequence recognition , 1996 .

[38] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..