论文信息 - Stacking Diverse Models to Achieve Reliable Error Response Distributions

Stacking Diverse Models to Achieve Reliable Error Response Distributions

Artificial Neural Networks (ANNs) can be useful for modeling real- world processes such as time series weather, financial, or chaotic data. The generalization and robustness of these models can be improved, and estimates of the modeling error distributions can be made, using a technique called Stacked Generalization (SG). SG uses a number of diverse models, each of which is trained and queried on independent cross validation subsets of the process data. The models are then combined in the stacking process to provide error estimates and improved accuracy. These improvements depend on the individual model response diversity between networks. Modified Series Association (MSA), an extension to SG, presents the various models with different input subspaces from the raw data as a catalyst for increased diversity. Model diversity is formulated, and an alternative model combination approach is derived from it, called Diversified Committee Machines (DCM). A framework for quantifying error estimation reliability is presented and discussed. Using this framework, the predictive accuracy of SG and DCM are compared in terms of both the modeled target function and the model's confidence interval about it. This is achieved through a new measure called the confidence coefficient. A benchmark problem is also introduced as a generic data set for future comparison between inductive learning machines.

Eric B. Bartlett | Craig Garlin Carmichael | E. Bartlett | C. Carmichael

[1] Eric B. Bartlett,et al. Dynamic node architecture learning: An information theoretic approach , 1994, Neural Networks.

[2] K S Narendra,et al. IDENTIFICATION AND CONTROL OF DYNAMIC SYSTEMS USING NEURAL NETWORKS , 1990 .

[3] Robert A. Lordo,et al. Learning from Data: Concepts, Theory, and Methods , 2001, Technometrics.

[4] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[5] ERIC B. BARTLETT,et al. ON THE USE OF VARIOUS INPUT SUBSETS FOR STACKED GENERALIZATION , 1999 .

[6] Eric B. Bartlett,et al. Process modeling using stacked neural networks , 1996 .

[7] Vera Kurková,et al. Kolmogorov's theorem and multilayer neural networks , 1992, Neural Networks.

[8] Kumpati S. Narendra,et al. Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[9] Edward K. Blum,et al. Approximation theory and feedforward networks , 1991, Neural Networks.

[10] Eric B. Bartlett,et al. Nuclear power plant status diagnostics using an artificial neural network , 1992 .

[11] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[12] Eric B. Bartlett,et al. Nuclear power plant fault diagnosis using neural networks with error estimation by series association , 1996 .

[13] David H. Wolpert,et al. Stacked generalization , 1992, Neural Networks.