论文信息 - On the application of reservoir computing networks for noisy image recognition

On the application of reservoir computing networks for noisy image recognition

Abstract Reservoir Computing Networks (RCNs) are a special type of single layer recurrent neural networks, in which the input and the recurrent connections are randomly generated and only the output weights are trained. Besides the ability to process temporal information, the key points of RCN are easy training and robustness against noise. Recently, we introduced a simple strategy to tune the parameters of RCNs. Evaluation in the domain of noise robust speech recognition proved that this method was effective. The aim of this work is to extend that study to the field of image processing, by showing that the proposed parameter tuning procedure is equally valid in the field of image processing and conforming that RCNs are apt at temporal modeling and are robust with respect to noise. In particular, we investigate the potential of RCNs in achieving competitive performance on the well-known MNIST dataset by following the aforementioned parameter optimizing strategy. Moreover, we achieve good noise robust recognition by utilizing such a network to denoise images and supplying them to a recognizer that is solely trained on clean images. The experiments demonstrate that the proposed RCN-based handwritten digit recognizer achieves an error rate of 0.81 percent on the clean test data of the MNIST benchmark and that the proposed RCN-based denoiser can effectively reduce the error rate on the various types of noise.

[1] David Zhang,et al. LSDT: Latent Sparse Domain Transfer Learning for Visual Adaptation , 2016, IEEE Transactions on Image Processing.

[2] Jürgen Schmidhuber,et al. Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Luca Maria Gambardella,et al. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Flexible, High Performance Convolutional Neural Networks for Image Classification , 2022 .

[4] Honglak Lee,et al. An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[5] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[6] Luca Maria Gambardella,et al. Handwritten Digit Recognition with a Committee of Deep Neural Nets on GPUs , 2011, ArXiv.

[7] Marc'Aurelio Ranzato,et al. Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[8] Narendra Ahuja,et al. Cresceptron: a self-organizing neural network which grows adaptively , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[9] Yoshua Bengio,et al. ReNet: A Recurrent Neural Network Based Alternative to Convolutional Networks , 2015, ArXiv.

[10] Jean-Pierre Martens,et al. Feature enhancement with a Reservoir-based Denoising Auto Encoder , 2013, IEEE International Symposium on Signal Processing and Information Technology.

[11] Guang-Bin Huang,et al. Extreme Learning Machine for Multilayer Perceptron , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[12] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.

[13] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .

[14] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.

[15] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[16] Cheng Wu,et al. Semi-Supervised and Unsupervised Extreme Learning Machines , 2014, IEEE Transactions on Cybernetics.

[17] Wesley De Neve,et al. Towards using Reservoir Computing Networks for noise-robust image recognition , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[18] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[19] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.

[20] Jean-Pierre Martens,et al. Robust continuous digit recognition using Reservoir Computing , 2015, Comput. Speech Lang..

[21] David Zhang,et al. Visual Understanding via Multi-Feature Shared Learning With Global Consistency , 2015, IEEE Transactions on Multimedia.

[22] Yoshua Bengio,et al. Unitary Evolution Recurrent Neural Networks , 2015, ICML.

[23] Nitish Srivastava,et al. Improving Neural Networks with Dropout , 2013 .

[24] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[26] Enhong Chen,et al. Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[27] David Zhang,et al. Domain Adaptation Extreme Learning Machines for Drift Compensation in E-Nose Systems , 2015, IEEE Transactions on Instrumentation and Measurement.

[28] Honglak Lee,et al. Adaptive Multi-Column Deep Neural Networks with Application to Robust Image Denoising , 2013, NIPS.

[29] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[30] Chris Eliasmith,et al. Deep networks for robust visual recognition , 2010, ICML.

[31] Chee Kheong Siew,et al. Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[32] Hongming Zhou,et al. Optimization method based extreme learning machine for classification , 2010, Neurocomputing.

[33] Dong Yu,et al. Deep Convex Net: A Scalable Architecture for Speech Pattern Classification , 2011, INTERSPEECH.

[34] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[35] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..