On Virtually Binary Nature of Probabilistic Neural Networks

A sequential design of multilayer probabilistic neural networks is considered in the framework of statistical decision-making. Parameters and interconnection structure are optimized layer-by-layer by estimating unknown probability distributions on input space in the form of finite distribution mixtures. The components of mixtures correspond to neurons which perform an information preserving transform between consecutive layers. Simultaneously the entropy of the transformed distribution is minimized. It is argued that in multidimensional spaces and particularly at higher levels of multilayer feedforward neural networks, the output variables of probabilistic neurons tend to be binary. It is shown that the information loss caused by the binary approximation of neurons can be suppressed by increasing the approximation accuracy.

[1]  D. F. Specht,et al.  Probabilistic neural networks for classification, mapping, or associative memory , 1988, IEEE 1988 International Conference on Neural Networks.

[2]  Roy L. Streit,et al.  Maximum likelihood training of probabilistic neural networks , 1994, IEEE Trans. Neural Networks.

[3]  Hans Christian Palm,et al.  A new method for generating statistical classifiers assuming linear mixtures of Gaussian densities , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[4]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[5]  Jirí Grim Maximum-likelihood design of layered neural networks , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[6]  Jirí Grim,et al.  Multivariate statistical pattern recognition with nonreduced dimensionality , 1986, Kybernetika.

[7]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[8]  Jirí Grim,et al.  On numerical evaluation of maximum-likelihood estimates for finite mixtures of distributions , 1982, Kybernetika.

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .