Chapter 2 - Toward a new generation of ecological modelling techniques: Review and bibliometrics

Modelling techniques have long been routinely employed in understanding complex ecological problems over the last several decades. It is therefore necessary to outline both the development history and research trends of ecological modelling. This chapter contributes to a global view of the development and research trend of modelling techniques in ecological studies. First of all, the history of five generations of ecological modelling were determined and reviewed over the last several decades. Thereafter, a bibliometric analysis method was performed to systematically reveal the research trends of ecological modelling applications during the period 1991–2013, from the following perspectives: publication output and language, subject categories, country distribution and international cooperation networks, and author keyword analysis. Last, based on the quantitative results, some frequently used and fast-developing models and algorithms are briefly reviewed to provide a primer for ecological model users and contributors.

[1]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[2]  R. M. Durand,et al.  Redundancy analysis: An alternative to canonical correlation and multivariate multiple regression in exploring interset associations. , 1988 .

[3]  Sovan Lek,et al.  Artificial Neuronal Networks: Application To Ecology And Evolution , 2012 .

[4]  A. Luczak,et al.  Estimating neuronal variable importance with Random Forest , 2003, 2003 IEEE 29th Annual Proceedings of Bioengineering Conference.

[5]  Thomas A. Davidson,et al.  The simultaneous inference of zooplanktivorous fish and macrophyte density from sub-fossil cladoceran assemblages: a multivariate regression tree approach , 2010 .

[6]  R. Death,et al.  Predictive modelling and spatial mapping of freshwater fish and decapod assemblages using GIS and neural networks , 2004 .

[7]  S. Lek,et al.  Applications of artificial neural networks for patterning and predicting aquatic insect species richness in running waters , 2003 .

[8]  G. De’ath MULTIVARIATE REGRESSION TREES: A NEW TECHNIQUE FOR MODELING SPECIES–ENVIRONMENT RELATIONSHIPS , 2002 .

[9]  Robert I. McKay Variants of genetic programming for species distribution modelling — fitness sharing, partial functions, population evaluation , 2001 .

[10]  Lael Parrott,et al.  Hybrid modelling of complex ecological systems for decision support: Recent successes and future perspectives , 2011, Ecol. Informatics.

[11]  Young-Seuk Park,et al.  Hazard rating of coastal pine forests for a black pine bast scale using self-organizing map (SOM) and random forest approaches , 2015, Ecol. Informatics.

[12]  E. M. Anderson,et al.  The dynamics of faulting : and dyke formation with applications to Britain , 1953, Geological Magazine.

[13]  Yuh-Shan Ho,et al.  Bibliometric analysis of tsunami research , 2007, Scientometrics.

[14]  D. DeAngelis,et al.  New Computer Models Unify Ecological TheoryComputer simulations show that many ecological patterns can be explained by interactions among individual organisms , 1988 .

[15]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[16]  Ding-Geng Chen,et al.  A fuzzy logic model with genetic algorithm for analyzing fish stock-recruitment relationships , 2000 .

[17]  Nitin Muttil,et al.  Genetic programming for analysis and real-time prediction of coastal algal blooms , 2005 .

[18]  Mathieu Marmion,et al.  The performance of state-of-the-art modelling techniques depends on geographical distribution of species. , 2009 .

[19]  Paresh Chandra Deka,et al.  Support vector machine applications in the field of hydrology: A review , 2014, Appl. Soft Comput..

[20]  Sandhya Samarasinghe,et al.  Mixed-method integration and advances in fuzzy cognitive maps for computational policy simulations for natural hazard mitigation , 2013, Environ. Model. Softw..

[21]  W. Thuiller Patterns and uncertainties of species' range shifts under climate change , 2004 .

[22]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[23]  A. Pritchard,et al.  Statistical bibliography or bibliometrics , 1969 .

[24]  Matthew E Falagas,et al.  A bibliometric analysis of global trends of research productivity in tropical medicine. , 2006, Acta tropica.

[25]  G. De’ath Boosted trees for ecological modeling and prediction. , 2007, Ecology.

[26]  Sorin Draghici,et al.  Machine Learning and Its Applications to Biology , 2007, PLoS Comput. Biol..

[27]  Hone-Jay Chu,et al.  Modeling waterbird diversity in irrigation ponds of Taoyuan, Taiwan using an artificial neural network approach , 2009, Paddy and Water Environment.

[28]  Jayson L. Lusk,et al.  ALTERNATIVE CALIBRATION AND AUCTION INSTITUTIONS FOR PREDICTING CONSUMER WILLINGESS TO PAY FOR NONGENETICALLY MODIFIED CORN CHIPS , 2001 .

[29]  Tae-Soo Chon,et al.  Biologically-inspired machine learning implemented to ecological informatics ☆ , 2007 .

[30]  L. Comte,et al.  Do stream fish track climate change? Assessing distribution shifts in recent decades , 2013 .

[31]  Sven Erik Jørgensen,et al.  The Application of Structurally Dynamic Models in Ecology and Ecotoxicology , 2009 .

[32]  Tae-Soo Chon,et al.  Self-Organizing Maps applied to ecological sciences , 2011, Ecol. Informatics.

[33]  J. Peters,et al.  Random forests as a tool for ecohydrological distribution modelling , 2007 .

[34]  Young-Seuk Park,et al.  Predicting potential impacts of climate change on freshwater fish in Korea , 2015, Ecol. Informatics.

[35]  Young-Seuk Park,et al.  Patterning and predicting aquatic macroinvertebrate diversities using artificial neural network. , 2003, Water research.

[36]  F. Recknagel,et al.  Artificial neural network approach for modelling and prediction of algal blooms , 1997 .

[37]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[38]  Florian Jeltsch,et al.  Grazing response patterns indicate isolation of semi‐natural European grasslands , 2014 .

[39]  S. Tarkowski Environmental health research in Europe: bibliometric analysis. , 2007, European journal of public health.

[40]  S. Jørgensen Integration of Ecosystem Theories: A Pattern , 1994, Ecology & Environment.

[41]  William Stafford Noble,et al.  Support vector machine , 2013 .

[42]  Sovan Lek,et al.  Stochastic models that predict trout population density or biomass on a mesohabitat scale , 1996, Hydrobiologia.

[43]  Michael J. Friedel,et al.  Data-driven modeling of surface temperature anomaly and solar activity trends , 2012, Environ. Model. Softw..

[44]  Jing Zhang,et al.  Assessment of world aerosol research trends by bibliometric analysis , 2008, Scientometrics.

[45]  V. Volterra Fluctuations in the Abundance of a Species considered Mathematically , 1926 .

[46]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[47]  Young-Seuk Park,et al.  Effects of meteorological factors and global warming on rice insect pests in Korea , 2012 .

[48]  László Orlóci,et al.  Applying Metric and Nonmetric Multidimensional Scaling to Ecological Studies: Some New Results , 1986 .

[49]  Sovan Lek,et al.  Utilisation of non-supervised neural networks and principal component analysis to study fish assemblages , 2001 .

[50]  Calyampudi R. Rao The use and interpretation of principal component analysis in applied research , 1964 .

[51]  Young-Seuk Park,et al.  Patternizing communities by using an artificial neural network , 1996 .

[52]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[53]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[54]  Young-Seuk Park,et al.  Hazard rating of pine trees from a forest insect pest using artificial neural networks , 2006 .

[55]  Pierre Legendre,et al.  Numerical Ecology with R , 2011 .

[56]  Fred E. Smeins,et al.  Predicting grassland community changes with an artificial neural network model , 1996 .

[57]  Vivek Srivastava,et al.  Modeling potential invasion range of alien invasive species, Hyptis suaveolens (L.) Poit. in India: Comparison of MaxEnt and GARP , 2014, Ecol. Informatics.

[58]  J. R. Wallis,et al.  Some ecological consequences of a computer model of forest growth , 1972 .

[59]  Georg H. Engelhard,et al.  Maturity changes in Norwegian spring-spawning herring before, during, and after a major population collapse , 2004 .

[60]  M. Tipayamongkholgul,et al.  Socio-geographical factors in vulnerability to dengue in Thai villages: a spatial regression analysis. , 2011, Geospatial health.

[61]  Maggi Kelly,et al.  Support vector machines for predicting distribution of Sudden Oak Death in California , 2005 .

[62]  Thomas C. Kelly,et al.  The response of the woodpigeon (Columba palumbus) to relaxation of intraspecific competition: A hybrid modelling approach , 2012 .

[63]  Young-Seuk Park,et al.  Determining temporal pattern of community dynamics by using unsupervised learning algorithms , 2000 .

[64]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[65]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[66]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[67]  A. M. Kalteh,et al.  Review of the self-organizing map (SOM) approach in water resources: Analysis, modelling and application , 2008, Environ. Model. Softw..

[68]  Y-S Park,et al.  Typology of diatom communities and the influence of hydro-ecoregions: a study on the French hydrosystem scale. , 2005, Water research.

[69]  Bernhard H. Schmid,et al.  Artificial Neural Network Modeling of Dissolved Oxygen in a Wetland Pond: The Case of Hovi, Finland , 2006 .

[70]  I. C. Prentice,et al.  NON-METRIC ORDINATION METHODS IN ECOLOGY , 1977 .

[71]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[72]  Hartmut Bossel,et al.  Real-structure process description as the basis of understanding ecosystems and their development , 1992 .

[73]  I. Dimopoulos,et al.  Role of some environmental variables in trout abundance models using neural networks , 1996 .

[74]  E. Lederer,et al.  Occurrence of D-Phenylalanine, D-Allothreonine and other D-Amino-Acids in Peptido-Lipids of Bacterial Origin , 1960, Nature.

[75]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[76]  Malka Gorfine,et al.  Sensitivity analysis for complex ecological models - A new approach , 2011, Environ. Model. Softw..

[77]  Chi Leung Patrick Hui,et al.  Artificial Neural Networks - Application , 2011 .

[78]  D. DeAngelis,et al.  Trait contributions to fish community assembly emerge from trophic interactions in an individual-based model , 2013 .

[79]  Julian D Olden,et al.  Machine Learning Methods Without Tears: A Primer for Ecologists , 2008, The Quarterly Review of Biology.

[80]  Teruhisa Komatsu,et al.  Prediction of the Catch of Japanese Sardine Larvae in Sagami Bay Using a Neural Network , 1994 .

[81]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[82]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[83]  J Uchmański,et al.  Individual-based modelling in ecology: what makes the difference? , 1996, Trends in ecology & evolution.

[84]  Martin P. Girardin,et al.  Response of tree growth to a changing climate in boreal central Canada: A comparison of empirical, process-based, and hybrid modelling approaches , 2008 .

[85]  R. Naiman,et al.  Freshwater biodiversity: importance, threats, status and conservation challenges , 2006, Biological reviews of the Cambridge Philosophical Society.

[86]  Gabriela Espinosa,et al.  Modeling airborne benzene in space and time with self-organizing maps and Bayesian techniques , 2013, Environ. Model. Softw..

[87]  Kazuaki Hiramatsu,et al.  Assessment of spatial habitat heterogeneity by coupling data-driven habitat suitability models with a 2D hydrodynamic model in small-scale streams , 2015, Ecol. Informatics.

[88]  D. DeAngelis,et al.  Individual-based models in ecology after four decades , 2014, F1000prime reports.

[89]  Andreas Hamann,et al.  Developing seed zones and transfer guidelines with multivariate regression trees , 2011, Tree Genetics & Genomes.

[90]  A. J. Lotka Elements of mathematical biology , 1956 .

[91]  Sovan Lek,et al.  Genetic population structure of Chinese sturgeon (Acipenser sinensis) in the Yangtze River revealed by artificial neural network , 2006 .

[92]  Milan Straškraba Natural control mechanisms in models of aquatic ecosystems , 2001 .

[93]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[94]  Young-Seuk Park,et al.  Modelling Community Structure in Freshwater Ecosystems , 2014 .

[95]  Estelle Russek-Cohen,et al.  Use of Maryland Biological Stream Survey Data to Determine Effects of Agricultural Riparian Buffers on Measures of Biological Stream Health , 2006, Environmental monitoring and assessment.

[96]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[97]  Ans Mouton,et al.  Ecological relevance of' performance criteria for species distribution models , 2010 .

[98]  Bernhard Schölkopf,et al.  Training Invariant Support Vector Machines , 2002, Machine Learning.

[99]  Ulf Dieckmann,et al.  Age at maturation predicted from routine scale measurements in Norwegian spring-spawning herring (Clupea harengus) using discriminant and neural network analyses , 2003 .

[100]  David R. B. Stockwell,et al.  The GARP modelling system: problems and solutions to automated spatial prediction , 1999, Int. J. Geogr. Inf. Sci..

[101]  A. Peterson,et al.  Use of niche models in invasive species risk assessments , 2011, Biological Invasions.

[102]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[103]  D. DeAngelis,et al.  Cannibalism and size dispersal in young-of-the-year largemouth bass: Experiment and model , 1980 .

[104]  G. Booth,et al.  Modelling food web complexity: The consequences of individual-based, spatially explicit behavioural ecology on trophic interactions , 1997, Evolutionary Ecology.

[105]  S. E. Jorgensen Ecological Modelling: An Introduction , 2009 .

[106]  G. De’ath,et al.  Species richness and community structure of reef-building corals on the nearshore Great Barrier Reef , 2006, Coral Reefs.

[107]  A. L. V. D. Wollenberg Redundancy analysis an alternative for canonical correlation analysis , 1977 .

[108]  Young-Seuk Park,et al.  Hazard ratings of pine forests to a pine wilt disease at two spatial scales (individual trees and stands) using self-organizing map and random forest , 2013, Ecol. Informatics.

[109]  Nello Cristianini,et al.  Support Vector Machines and Kernel Methods: The New Generation of Learning Machines , 2002, AI Mag..

[110]  M. Fasham,et al.  A Comparison of Nonmetric Multidimensional Scaling, Principal Components and Reciprocal Averaging for the Ordination of Simulated Coenoclines, and Coenoplanes , 1977 .

[111]  I. Dimopoulos,et al.  Application of neural networks to modelling nonlinear relationships in ecology , 1996 .

[112]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[113]  Stanley V. Gregory,et al.  Ecological uses for genetic algorithms: predicting fish distributions in complex physical habitats , 1995 .

[114]  Sven Erik Jørgensen,et al.  Structural dynamic model , 1986 .

[115]  Brent N. Holben,et al.  Extracting forest age in a Pacific Northwest Forest from thematic mapper and topographic data , 1996 .

[116]  M. Termansen,et al.  The use of genetic algorithms and Bayesian classification to model species distributions , 2006 .

[117]  Paul H. Williams,et al.  Biodiversity conservation planning tools , 2006 .

[118]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[119]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[120]  S. Lek,et al.  Uncertainty in ensemble forecasting of species distribution , 2010 .

[121]  George R. Parker,et al.  Ecosystem classification in a flat, highly fragmented region of Indiana, USA , 2005 .

[122]  Walter J. Rawls,et al.  Use of the Nonparametric Nearest Neighbor Approach to Estimate Soil Hydraulic Properties , 2006 .

[123]  A. Peterson,et al.  Global patterns of connectivity and isolation of populations of forest bird species in the late Pleistocene , 2013 .

[124]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[125]  Yuh-Shan Ho,et al.  Bibliometric analysis on global Parkinson's disease research trends during 1991–2006 , 2008, Neuroscience Letters.

[126]  A. Budden,et al.  Big data and the future of ecology , 2013 .

[127]  Shang-Lien Lo,et al.  Diagnosing reservoir water quality using self-organizing maps and fuzzy theory. , 2002, Water research.

[128]  E. Prepas,et al.  The application of artificial neural networks to flow and phosphorus dynamics in small streams on the Boreal Plain, with emphasis on the role of wetlands , 2006 .

[129]  E. Schrödinger What Is Life , 1946 .

[130]  Sovan Lek,et al.  Predicting assemblages and species richness of endemic fish in the upper Yangtze River. , 2010, The Science of the total environment.

[131]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[132]  Matthew B Jones,et al.  Ecoinformatics: supporting ecology as a data-intensive science. , 2012, Trends in ecology & evolution.

[133]  Sven Erik Jørgensen,et al.  Application of eco-exergy for assessment of ecosystem health and development of structurally dynamic models , 2010 .

[134]  Young-Seuk Park,et al.  Characterization of Ecological Exergy Based on Benthic Macroinvertebrates in Lotic Ecosystems , 2013, Entropy.

[135]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[136]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[137]  Young-Seuk Park,et al.  Implementation of artificial neural networks in patterning and prediction of exergy in response to temporal dynamics of benthic macroinvertebrate communities in streams , 2001 .

[138]  Sovan Lek,et al.  Improved estimation, using neural networks, of the food consumption of fish populations , 1995 .

[139]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[140]  M. J. Hatcher,et al.  Modeling Biological Systems: Principles and Applications , 1997 .

[141]  Xiao‐Hai Yan,et al.  A Neural Network Model for Estimating Sea Surface Chlorophyll and Sediments from Thematic Mapper Imagery , 1998 .

[142]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[143]  Young-Seuk Park,et al.  Water quality assessment using diatom assemblages and advanced modelling techniques , 2004 .

[144]  Marco Gamba,et al.  Support vector machines to model presence/absence of Alburnus alburnus alborella (Teleostea, Cyprinidae) in North-Western Italy: comparison with other machine learning techniques. , 2012, Comptes rendus biologies.