论文信息 - Similarity Learning and Generalization with Limited Data: A Reservoir Computing Approach

Similarity Learning and Generalization with Limited Data: A Reservoir Computing Approach

We investigate the ways in which a machine learning architecture known as Reservoir Computing learns concepts such as “similar” and “different” and other relationships between image pairs and generalizes these concepts to previously unseen classes of data. We present two Reservoir Computing architectures, which loosely resemble neural dynamics, and show that a Reservoir Computer (RC) trained to identify relationships between image pairs drawn from a subset of training classes generalizes the learned relationships to substantially different classes unseen during training. We demonstrate our results on the simple MNIST handwritten digit database as well as a database of depth maps of visual scenes in videos taken from a moving camera. We consider image pair relationships such as images from the same class; images from the same class with one image superposed with noise, rotated 90°, blurred, or scaled; images from different classes. We observe that the reservoir acts as a nonlinear filter projecting the input into a higher dimensional space in which the relationships are separable; i.e., the reservoir system state trajectories display different dynamical patterns that reflect the corresponding input pair relationships. Thus, as opposed to training in the entire high-dimensional reservoir space, the RC only needs to learns characteristic features of these dynamical patterns, allowing it to perform well with very few training examples compared with conventional machine learning feed-forward techniques such as deep learning. In generalization tasks, we observe that RCs perform significantly better than state-of-the-art, feed-forward, pair-based architectures such as convolutional and deep Siamese Neural Networks (SNNs). We also show that RCs can not only generalize relationships, but also generalize combinations of relationships, providing robust and effective image pair classification. Our work helps bridge the gap between explainable machine learning with small datasets and biologically inspired analogy-based learning, pointing to new directions in the investigation of learning processes.

[1] Ganesh K. Venayagamoorthy,et al. Effects of spectral radius and settling time in the performance of echo state networks , 2009, Neural Networks.

[2] Danko Nikolic,et al. Temporal dynamics of information content carried by neurons in the primary visual cortex , 2006, NIPS.

[3] Paul W. Burgess,et al. Specialization of the Rostral Prefrontal Cortex for Distinct Analogy Processes , 2010, Cerebral cortex.

[4] Wei Xiong,et al. Stacked Convolutional Denoising Auto-Encoders for Feature Representation , 2017, IEEE Transactions on Cybernetics.

[5] Henry Markram,et al. Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.

[6] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[7] Herbert Jaeger,et al. Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..

[8] Wolfgang Maass,et al. Cerebral Cortex Advance Access published February 15, 2006 A Statistical Analysis of Information- Processing Properties of Lamina-Specific , 2022 .

[9] Raphaël Couturier,et al. Echo State Networks-Based Reservoir Computing for MNIST Handwritten Digits Recognition , 2016, 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES).

[10] Peter Ford Dominey,et al. Reservoir Computing Properties of Neural Dynamics in Prefrontal Cortex , 2016, PLoS Comput. Biol..

[11] Yu Zhang,et al. Very deep convolutional networks for end-to-end speech recognition , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[13] Rune Rasmussen,et al. Chaotic Dynamics Mediate Brain State Transitions, Driven by Changes in Extracellular Ion Concentrations. , 2017, Cell systems.

[14] R. Brockett,et al. Reservoir observers: Model-free inference of unmeasured variables in chaotic systems. , 2017, Chaos.

[15] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16] Benjamin Schrauwen,et al. Information Processing Capacity of Dynamical Systems , 2012, Scientific Reports.

[17] Masanobu Inubushi,et al. Reservoir Computing Beyond Memory-Nonlinearity Trade-off , 2017, Scientific Reports.

[18] Umapada Pal,et al. Compact correlated features for writer independent signature verification , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[19] Honggang Zhang,et al. Sketch-based image retrieval via Siamese convolutional neural network , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[20] Herbert Jaeger,et al. Controlling Recurrent Neural Networks by Conceptors , 2014, ArXiv.

[21] Jaideep Pathak,et al. Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach. , 2018, Physical review letters.

[22] Benjamin Schrauwen,et al. Stable Output Feedback in Reservoir Computing Using Ridge Regression , 2008, ICANN.

[23] H. Sompolinsky,et al. Chaos in Neuronal Networks with Balanced Excitatory and Inhibitory Activity , 1996, Science.

[24] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[25] Ajmal Mian,et al. Learning a Deep Model for Human Action Recognition from Novel Viewpoints , 2016 .

[26] Kavita Bala,et al. Learning visual similarity for product design with convolutional neural networks , 2015, ACM Trans. Graph..

[27] Nuno Maçarico da Costa,et al. The proportion of synapses formed by the axons of the lateral geniculate nucleus in layer 4 of area 17 of the cat , 2009, The Journal of comparative neurology.

[28] Jaideep Pathak,et al. Using machine learning to replicate chaotic attractors and calculate Lyapunov exponents from data. , 2017, Chaos.

[29] Esa Rahtu,et al. Siamese network features for image matching , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[30] Wu Liu,et al. Siamese neural network based gait recognition for human identification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[31] John J. Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities , 1999 .

[32] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .

[33] Zachary Chase Lipton. A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[34] Marcin Andrychowicz,et al. One-Shot Imitation Learning , 2017, NIPS.

[35] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[36] Edward A Wasserman,et al. ASSOCIATIVE CONCEPT LEARNING IN ANIMALS: ISSUES AND OPPORTUNITIES. , 2014, Journal of the experimental analysis of behavior.

[37] Jonas Mueller,et al. Siamese Recurrent Architectures for Learning Sentence Similarity , 2016, AAAI.

[38] J. Bullier. Integrated model of visual processing , 2001, Brain Research Reviews.

[39] Clint J. Perry,et al. Peak shift in honey bee olfactory learning , 2014, Animal Cognition.

[40] Baogang Wei,et al. Dependency-based Siamese long short-term memory network for learning sentence representations , 2018, PloS one.

[41] José Carlos Príncipe,et al. Analysis and Design of Echo State Networks , 2007, Neural Computation.

[42] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[43] M. Srinivasan,et al. The concepts of ‘sameness’ and ‘difference’ in an insect , 2001, Nature.

[44] Kuan-Ting Yu,et al. Multi-view self-supervised deep learning for 6D pose estimation in the Amazon Picking Challenge , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).