Inductive regression: overcoming OLS limitations with the general regression neural network

Predicting and modeling relationships between spatially varying phenomena is central to understanding the geography of disease, politics, economics, and land use, for example. The purpose of this article is to introduce the General Regression Neural Network (GRNN) to a general geography audience as a powerful tool to use in place of OLS for function approximation with geographic data. Specifically, GRNN is a simple to use, robust predictor that is capable of modeling non-linear relationships without a pre-determined functional form and avoids four OLS assumptions that are frequently violated when using spatially referenced datasets. In OLS, assumptions about data characteristics are encoded into the predictive model and when the assumptions are violated the regression coefficients are biased. Separating model parameters from assumptions of data structure and distribution allows the GRNN to be partially molded by the characteristics of the data themselves, an inductive prediction. The four frequently violated OLS assumptions that do not affect GRNN are (1) linear functional relationship, (2) data distribution, (3) resilience to outliers, and (4) independence of observations. The reasons why these assumptions cause problems for OLS are reviewed and then reexamined in GRNN context. GRNN operation is described in three parts: probability density function estimation, network structure, and interpretation of results. Finally, the case of maize production in the US Great Plains is presented as an example of the practical use of GRNN in function approximation.

[1]  Edward J. Rzempoluck,et al.  Neural Network Data Analysis Using Simulnet™ , 1997, Springer New York.

[2]  Huan Liu,et al.  Book review: Machine Learning, Neural and Statistical Classification Edited by D. Michie, D.J. Spiegelhalter and C.C. Taylor (Ellis Horwood Limited, 1994) , 1996, SGAR.

[3]  Halbert White,et al.  Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.

[4]  Glyn M. Rimmington,et al.  Mathematical descriptions of plant growth and development. , 1987 .

[5]  Michael Tiefelsdorf,et al.  Modelling Spatial Processes , 2000 .

[6]  S. Fotheringham,et al.  Geographically weighted regression : modelling spatial non-stationarity , 1998 .

[7]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[8]  Murray Aitkin,et al.  A general maximum likelihood analysis of overdispersion in generalized linear models , 1996, Stat. Comput..

[9]  L. Hamilton 217-249 inRegression with Graphics: A Second Course in Applied Statistics , 1991 .

[10]  S. Fotheringham,et al.  Geographically Weighted Regression , 1998 .

[11]  E. Casetti,et al.  Applications of the Expansion Method , 1991 .

[12]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[13]  Daniel L. Civco,et al.  Artificial Neural Networks for Land-Cover Classification and Mapping , 1993, Int. J. Geogr. Inf. Sci..

[14]  Mark Gahegan,et al.  Neural network architectures for the classification of temporal image sequences , 1996 .

[15]  H. Goldstein,et al.  Multilevel Models in Educational and Social Research. , 1989 .

[16]  Timothy Masters,et al.  Advanced algorithms for neural networks: a C++ sourcebook , 1995 .

[17]  Leo Breiman,et al.  [Neural Networks: A Review from Statistical Perspective]: Comment , 1994 .

[18]  Robert Tibshirani,et al.  [Neural Networks: A Review from Statistical Perspective]: Comment , 1994 .

[19]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[20]  Martin Charlton,et al.  The Geography of Parameter Space: An Investigation of Spatial Non-Stationarity , 1996, Int. J. Geogr. Inf. Sci..

[21]  A. Stewart Fotheringham,et al.  Trends in quantitative methods I: stressing the local , 1997 .

[22]  Marc P. Armstrong,et al.  Geography and Computational Science , 2000 .

[23]  N. Rosenberg,et al.  Processes for identifying regional influences of and responses to increasing atmospheric CO{sub 2} and climate change - the MINK project: An overview , 1991 .

[24]  Manfred M. Fischer,et al.  ARTIFICIAL NEURAL NETWORKS: A NEW APPROACH TO MODELING INTERREGIONAL TELECOMMUNICATION FLOWS* , 1994 .

[25]  George L. Benwell,et al.  The integration of ecological, neural and spatial modelling for monitoring and prediction for semi-arid landscapes , 1996 .

[26]  K. Wisiol Choosing a basis for yield forecasts and estimates. , 1987 .

[27]  D. M. Titterington,et al.  Neural Networks: A Review from a Statistical Perspective , 1994 .

[28]  P. Gould Is Statistix Inferens the Geographical Name for A Wild Goose , 1970 .

[29]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .