Non-uniform Initialization of Inputs Groupings in Contextual Neural Networks

Contextual neural networks which are using neurons with conditional aggregation functions were found to be efficient and useful generalizations of classical multilayer perceptrons. They allow to generate neural classification models with good generalization and low activity of connections between neurons in hidden layers. The key factor to build such solutions is achieving self-consistency between continuous values of weights of neurons’ connections and their mutually related non-continuous aggregation priorities. This allows to optimize neuron inputs aggregation priorities by simultaneous gradient-based optimization of connections’ weights with generalized BP algorithm. But such method additionally needs initial setting of connections groupings (scan-paths) to define priorities of signals during first ω epochs of training. In earlier studies all connections were initially assigned to a single group to give neurons access to all input signals at the beginning of training. We found out that such uniform solution not always is the best one. Thus within this text we compare efficiency of training of contextual neural networks with uniform and non-uniform, random initialization of connections groupings. On this basis we also discuss the properties of analyzed training algorithm which are related to characteristics of used scan-paths initialization methods.

[1]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[2]  Maciej Huk Measuring the Effectiveness of Hidden Context Usage by Machine Learning Methods under Conditions of Increased Entropy of Noise , 2017, 2017 3rd IEEE International Conference on Cybernetics (CYBCON).

[3]  Michal Szczepanik,et al.  Data Management for Fingerprint Recognition Algorithm Based on Characteristic Points' Groups , 2012, ADBIS Workshops.

[4]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[5]  Maciej Huk,et al.  Backpropagation generalized delta rule for the selective attention Sigma-if artificial neural network , 2012, Int. J. Appl. Math. Comput. Sci..

[6]  Maciej Huk,et al.  Notes on the generalized backpropagation algorithm for contextual neural networks with conditional aggregation functions , 2017, J. Intell. Fuzzy Syst..

[7]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[8]  Maciej Huk,et al.  Learning Distributed Selective Attention Strategies with the Sigma-if Neural Network , 2009 .

[9]  Andrew Canning,et al.  Thomas-Fermi charge mixing for obtaining self-consistency in density functional calculations , 2001 .

[10]  Claudio M. Privitera,et al.  Locating regions-of-interest for the Mars Rover expedition , 2000 .