Using A Priori Knowledge to Prestructure ANNs

The objective of the present work is to develop a constructive method that uses certain a priori information about a problem domain to design the starting structure of an artificial neural network (ANN). The method explored is based on a general systems theory methodology (here called GSM) that calculates a kind of structural information of the problem domain via analyzing I/O pairs from that domain. A modularized ANN structure is developed based on the GSM information provided. The notion of performance subset (PS) of an ANN structure is described, and extensive experiments on 3-input, 1-output Boolean mappings indicate that the resulting modularized-ANN design is 'conservative' in the sense that the PS of the modularized ANN contains at least all the mappings included in the GSM category used to design the ANN. Partial experiments on 5-input, 1-output Boolean functions indicate further success. The extended experimental results also suggest the possibility of using a measure of the learning curve of specified ANNs on a series of (in this case Boolean) functions to serve as a proxy measure for the complexity of those functions. This proxy measure seems to correlate well with a measure known as Boolean Length. Determining a function's Boolean Length is a non-trivial undertaking; perhaps it will turn out that training an ANN on the function and measuring its learning experience will be a useful measure of function complexity, and easier to determine than the function's Boolean Length.

[1]  W. Ashby,et al.  Constraint analysis of many-dimensional relations. , 1965, Progress in biocybernetics.

[2]  George G. Lendaris,et al.  Structure and constraint in discrete adaptive networks , 1965 .

[3]  G. Klir IDENTIFICATION OF GENERATIVE STRUCTURES IN EMPIRICAL DATA , 1976 .

[4]  John N. Warfield,et al.  SOCIETAL SYSTEMS Planning, Policy and Complexity , 1978 .

[5]  Klaus Krippendorff,et al.  On the Identification of Structures in Multivariate Data by the Spectral Analysis of Relations , 1979 .

[6]  George G. Lendaris,et al.  Structural Modeling a Tutorial Guide , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Roger C. Conant Structural modelling using a simple information measure , 1980 .

[8]  Bush Jones,et al.  Architecture of systems problem solving , 1986, Journal of the American Society for Information Science.

[9]  Klaus Krippendorff,et al.  Information Theory: Structural Models for Qualitative Data. , 1988 .

[10]  Roger C. Conant,et al.  EXTENDED DEPENDENCY ANALYSIS OF LARGE SYSTEMS , 1988 .

[11]  George G. Lendaris A proposal for indicating quality of generalization when evaluating ANNs , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[12]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[13]  Tsu-Chang Lee,et al.  Structure level adaptation for artificial neural networks , 1991 .

[14]  David E. Rumelhart,et al.  BACK-PROPAGATION, WEIGHT-ELIMINATION AND TIME SERIES PREDICTION , 1991 .

[15]  Charles E. Taylor,et al.  Artificial Life II , 1991 .

[16]  George G. Lendaris,et al.  Use of a structured problem domain to explore development of modularized neural networks , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[17]  Russell Reed,et al.  Pruning algorithms-a survey , 1993, IEEE Trans. Neural Networks.

[18]  Dušan Petrovački,et al.  Evolutional development of a multilevel neural network , 1993, Neural Networks.