论文信息 - A Hybrid Connectionist for Multiple Regression

A Hybrid Connectionist for Multiple Regression

We present a hybrid system for generating multiple linear regression lines for the solution of a function approximation problem. The system consists of three components: (1) a neural network which is trained to fit the data samples; (2) a simple algorithm for splitting the input space of the data into subregions and (3) the traditional multiple regression technique for finding the coefficients of the regression lines. While neural networks work particularly well for nonlinear function approximation, their outputs are difficult to explain to a human user. Our proposed system predicts the network outputs as linear functions of the input attributes. In order to maintain the predictive accuracy of the networks, the system divides the space of the input data into several subspaces. In each of these subspaces, a linear equation is generated for predicting the target values for all samples that belong to the subspace. We illustrate the effectiveness of the system using two data sets, one is an artificial data set, while the other is a real world data set for predicting the fuel consumption of automobiles.

Jacek M. Zurada | Rudy Setiono | R. Setiono | J. Zurada

[1] John E. Dennis,et al. Numerical methods for unconstrained optimization and nonlinear equations , 1983, Prentice Hall series in computational mathematics.

[2] Gilles Pagès,et al. Approximations of Functions by a Multilayer Perceptron: a New Approach , 1997, Neural Networks.

[3] Kurt Hornik,et al. Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[4] Wee Kheng Leow,et al. Pruned Neural Networks for Regression , 2000, PRICAI.

[5] Luís Torgo,et al. Search-Based Class Discretization , 1997, ECML.

[6] Gerhard Widmer,et al. Relative Unsupervised Discretization for Regresseion Problems , 2000, ECML.

[7] Alberto Maria Segre,et al. Programs for Machine Learning , 1994 .

[8] Jacek M. Zurada,et al. Extraction of rules from artificial neural networks for nonlinear regression , 2002, IEEE Trans. Neural Networks.

[9] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[10] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[11] D. Kilpatrick,et al. Numeric Prediction Using Instance-Based Learning with Encoding Length Selection , 1997, ICONIP.

[12] Dayanand N. Naik,et al. Applied Multivariate Statistics with SAS Software , 1997 .

[13] Ron Kohavi,et al. Irrelevant Features and the Subset Selection Problem , 1994, ICML.