Information Discovery in Ecological Systems by Artificial Neural Networks: Algal Blooms at Gippsland Lakes

This paper aims to discuss two aspects of working with large ecological data sets; analysis and modelling of ecological data sets, and subdivision of data into smaller subsets for the purpose of analysis and modelling. Different approaches to the information discovery in ecological systems based on Artificial Neural Networks (ANNs) are considered ANNs are powerful modelling tools. Their strength is derived from their ability to model complex, non-linear relationships. However, a drawback of ANNs is that they cannot distinguish between noise and actual data in a system. Ecological systems are prone to greater noise than many other systems. The solution therefore lies in applying ANNs to ecological problems more creatively. In algal blooms and similar ecological problems the use of ANNs has been primarily limited to the predictive modelling and sensitivity analysis. This paper proposes a multi-stage analysis comprising of predictive function modelling, sensitivity analysis, principal component analysis (PCA) and non-linear principal component analysis (NLPCA). The most common method of data subdivision for training, validating and testing is a method of random or redundant subsets. This method of data subdivision is not always appropriate because ecological systems represent open sets with complex relationships. Ecological data are often incomplete and contaminated by noise. This paper proposes a systematic approach to subdivision of data into training, validating and testing datasets.

[1]  J. Sprott Chaos and time-series analysis , 2001 .

[2]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[3]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[4]  A. J. Conway,et al.  Time series, neural networks and the future of the Sun , 1998 .

[5]  Sovan Lek,et al.  Artificial neural networks as a tool in ecological modelling, an introduction , 1999 .

[6]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[7]  L. R. Mur,et al.  Cyanobacteria in the environment , 1999 .

[8]  D. Roelke,et al.  Health and Ecological Impacts of Harmful Algal Blooms: Risk Assessment Needs , 2001 .

[9]  S. Golden,et al.  Adaptive significance of circadian programs in cyanobacteria. , 1998, Trends in microbiology.

[10]  J. Bartram,et al.  DETERMINATION OF CYANOBACTERIA IN THE LABORATORY , 1999 .

[11]  H. R. Maier,et al.  Modelling Cyanbacteria (blue-green algae) in the River Murray using artificial neural networks , 1997 .

[12]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[13]  Holger R. Maier,et al.  Understanding the behaviour and optimising the performance of back-propagation neural networks: an empirical study , 1998 .

[14]  Holger R. Maier,et al.  Neural networks for the prediction and forecasting of water resource variables: a review of modelling issues and applications , 2000, Environ. Model. Softw..

[15]  Adam H. Monahan,et al.  Nonlinear Principal Component Analysis by Neural Networks: Theory and Application to the Lorenz System , 2000 .