Assuming that the reader is already familiar with the general concept of Artificial Neural Network and with the Perceptron learning rule, this paper introduces the Delta learning rule, as a basis for the Backpropagation learning rule. After discussing the necessity of using multi-layer Artificial Neural Networks for solving non-linearly separable problems, the paper describes all the mathematical steps that allow us to pass from the simple gradient descent formulation to the Backpropagation algorithm, still nowadays one of the most used methods to train feed-forward multi-layer Artificial Neural Networks. The paper is concluded by discussing issues related to overfitting in feed-forward multi-layer Artificial Neural Networks, and by presenting some heuristics and ideas for an appropriate parameter setting.
[1]
Kurt Hornik,et al.
Multilayer feedforward networks are universal approximators
,
1989,
Neural Networks.
[2]
F ROSENBLATT,et al.
The perceptron: a probabilistic model for information storage and organization in the brain.
,
1958,
Psychological review.
[3]
David Haussler,et al.
What Size Net Gives Valid Generalization?
,
1989,
Neural Computation.
[4]
Simon Haykin,et al.
Neural Networks: A Comprehensive Foundation
,
1998
.
[5]
Marvin Minsky,et al.
Perceptrons: expanded edition
,
1988
.