A fast algorithm is proposed for optimal supervised learning in multiple-layer neural networks. The proposed algorithm is based on random optimization methods with dynamic annealing. The algorithm does not require the computation of error function gradients and guarantees convergence to global minima. When applied to multiple-layer neural networks, the proposed algorithm updates, in batch mode, all neuron weights by Gaussian-distributed increments in a direction which reduces total decision error. The variance of the Gaussian distribution is automatically controlled so that the random search step is concentrated in potential minimum energy/error regions. Also demonstrated is a hybrid method which combines a gradient-descent phase followed by a phase of dynamically annealed random search suitable for optimal search in difficult learning tasks like parity. Extensive simulations are performed which show substantial convergence speedup of the proposed learning method as compared to gradient search methods like backpropagation. The proposed algorithm is also shown to be simple to implement and computationally effective and to lead to global minima over wide ranges of parameter settings
[1]
Roger J.-B. Wets,et al.
Minimization by Random Search Techniques
,
1981,
Math. Oper. Res..
[2]
Norio Baba,et al.
A new approach for finding the global minimum of error function of neural networks
,
1989,
Neural Networks.
[3]
C. D. Gelatt,et al.
Optimization by Simulated Annealing
,
1983,
Science.
[4]
Geoffrey E. Hinton,et al.
Learning internal representations by error propagation
,
1986
.
[5]
Robert A. Jacobs,et al.
Increased rates of convergence through learning rate adaptation
,
1987,
Neural Networks.