Ecological state evaluation of lake ecosystems revisited: Latent variables with kSVM algorithm approach for assessment automatization and data comprehension

Abstract Automated and reproducible methodology for assessing the ecological condition of lakes is essential for effective monitoring and facilitating the decision-making process aimed at achieving the stated environmental goals. At the same time, multidimensional measurement datasets are often an obstacle to drawing insightful conclusions, thus becoming an incentive for overly simplified analyzes. In this article, a set of measurements and ecological status assessment results for a collection of 499 lakes in Poland was used. Expert assessment process was recreated using the supervised kernel Support Vector Machine algorithm on dataset with reduced dimensionality, thus a model that automates the ecological assessment process was obtained. The use of the explanatory skill of latent variables made it possible to present the assessed objects along with their position in individual classes. The visualization of the results in reduced dimensionality increased, without interfering with the size of the classes, the informative evaluation potential, which should be considered as an acompanying assessment parameter in the future. The primary target of this paper is the ecological expert coping with automatization of assessment process and obtaining latent information for sense-making visual comprehension during consultations regarding ecosystem-oriented ecological decision making.

[1]  Erik Jeppesen,et al.  Water Framework Directive: ecological classification of Danish lakes , 2005 .

[2]  Sebastian Birk,et al.  Intercalibration of aquatic ecological assessment methods in the European Union: Lessons learned and way forward , 2014 .

[3]  Martin A Nowak,et al.  Evolutionary dynamics in set structured populations , 2009, Proceedings of the National Academy of Sciences.

[4]  Lennart Olsson,et al.  Categorising tools for sustainability assessment , 2007 .

[5]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[6]  Robert Tibshirani,et al.  The Entire Regularization Path for the Support Vector Machine , 2004, J. Mach. Learn. Res..

[7]  J. Josse,et al.  missMDA: A Package for Handling Missing Values in Multivariate Data Analysis , 2016 .

[8]  Permani C Weerasekara,et al.  The United Nations World Water Development Report 2017 Wastewater: The Untapped Resource , 2017 .

[9]  D. M. Allen Mean Square Error of Prediction as a Criterion for Selecting Variables , 1971 .

[10]  Daniel P. Loucks,et al.  Water Resources Planning and Management: An Overview , 2017 .

[11]  R. Couture,et al.  Climate change, cyanobacteria blooms and ecological status of lakes: A Bayesian network approach , 2016 .

[12]  Zoubin Ghahramani,et al.  Unifying linear dimensionality reduction , 2014, 1406.0873.

[13]  Nathalie Niquil,et al.  Using ecological models to assess ecosystem status in support of the European Marine Strategy Framework Directive , 2015 .

[14]  W. Christopher Lenhardt,et al.  The Tao of open science for ecology , 2015 .

[15]  Jui-Sheng Chou,et al.  Determining quality of water in reservoir using machine learning , 2018, Ecol. Informatics.

[16]  J. Schaumburg,et al.  Macrophytes and phytobenthos as indicators of ecological status in German lakes — a contribution to the implementation of the water framework directive , 2004 .

[17]  Giles M. Foody,et al.  Feature Selection for Classification of Hyperspectral Data by SVM , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Lael Parrott,et al.  Complexity and the limits of ecological engineering , 2002 .

[19]  H. V. D. Klis,et al.  Uncertainty analysis of a spatial habitat suitability model and implications for ecological management of water bodies , 2006, Landscape Ecology.

[20]  M. D. Nelson,et al.  Comparison of statistical and theoretical habitat models for conservation planning: the benefit of ensemble prediction. , 2011, Ecological applications : a publication of the Ecological Society of America.

[21]  M. Zwaan An introduction to hilbert space , 1990 .

[22]  Jieping Ye,et al.  Least squares linear discriminant analysis , 2007, ICML '07.

[23]  Toshihisa Tanaka,et al.  Robust Kernel Principal Component Analysis With ℓ2,1-Regularized Loss Minimization , 2020, IEEE Access.

[24]  K. Farnsworth,et al.  How many dimensions of biodiversity do we need , 2012 .

[25]  Francisco Herrera,et al.  Learning from Imbalanced Data Sets , 2018, Springer International Publishing.

[26]  M. Acreman,et al.  Environmental flows and the European Water Framework Directive. , 2010 .

[27]  Saygin Abdikan,et al.  COMPARISON OF CROP CLASSIFICATION METHODS FOR THE SUSTAINABLE AGRICULTURE MANAGEMENT , 2016 .

[28]  B. Martín‐López,et al.  Trade-offs across value-domains in ecosystem services assessment. , 2014 .

[29]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[30]  Marko Järvinen,et al.  Defining the ecological status of small forest lakes using multiple biological quality elements and palaeolimnological analysis. , 2009 .

[31]  Eric Hervet,et al.  Applications for deep learning in ecology , 2019, Methods in Ecology and Evolution.

[32]  Dimitris Bertsimas,et al.  From Predictive Methods to Missing Data Imputation: An Optimization Approach , 2017, J. Mach. Learn. Res..

[33]  Steven Hamblin,et al.  On the practical usage of genetic algorithms in ecology and evolution , 2013 .

[34]  Richard K. Johnson,et al.  Assessing temporal scales and patterns in time series: Comparing methods based on redundancy analysis , 2015 .

[35]  M. Morrissey In search of the best methods for multivariate selection analysis , 2014 .

[36]  Ana Cristina Cardoso,et al.  Assessing water ecosystem services for water resource management , 2016 .

[37]  Nitin Muttil,et al.  Machine-learning paradigms for selecting ecologically significant input variables , 2007, Eng. Appl. Artif. Intell..

[38]  Chabane Djeraba,et al.  Sets, Relations, and Functions , 2014 .

[39]  P. Tryjanowski,et al.  The dark side of the “redundancy hypothesis” and ecosystem assessment , 2016 .

[40]  Y. P. Li,et al.  Integrated ecosystem health assessment of a macrophyte-dominated lake , 2013 .

[41]  O. Weyl,et al.  Lake Malawi: fishes, fisheries, biodiversity, health and habitat , 2010 .

[42]  J. Padisák,et al.  Use of Phytoplankton Assemblages for Monitoring Ecological Status of Lakes within the Water Framework Directive: The Assemblage Index , 2005, Hydrobiologia.

[43]  P. Verburg,et al.  Mapping ecosystem services demand: A review of current research and future perspectives , 2015 .

[44]  R. Cortes,et al.  Evaluation of the ecological status of an impaired watershed by using a multi-index approach , 2011, Environmental monitoring and assessment.

[45]  Dunja Mladenic,et al.  Feature Selection for Dimensionality Reduction , 2005, SLSFS.

[46]  Ioannis N. Athanasiadis,et al.  Machine learning for ecosystem services , 2018, Ecosystem Services.

[47]  Laura Uusitalo,et al.  An overview of methods to evaluate uncertainty of deterministic models in decision support , 2015, Environ. Model. Softw..

[48]  G. Stewart,et al.  An Algorithm for Generalized Matrix Eigenvalue Problems. , 1973 .

[49]  Birgitta König-Ries,et al.  Towards an ecological trait‐data standard , 2019, Methods in Ecology and Evolution.

[50]  W. Jetz,et al.  Downscaling the environmental associations and spatial patterns of species richness. , 2014, Ecological applications : a publication of the Ecological Society of America.

[51]  O. C. Zienkiewicz,et al.  Discrete element methods , 2005 .

[52]  E. García‐Berthou,et al.  Ecological classification of a set of Mediterranean reservoirs applying the EU Water Framework Directive: A reasonable compromise between science and management , 2009 .

[53]  U. Dieckmann,et al.  Complexity and stability of ecological networks: a review of the theory , 2018, Population Ecology.

[54]  Hannu Toivonen,et al.  BAYESIAN ANALYSIS OF METAPOPULATION DATA , 2002 .

[55]  S. Džeroski,et al.  Using classification trees to analyze the impact of exotic species on the ecological assessment of polder lakes in Flanders, Belgium , 2011 .

[56]  Alan L. Flint,et al.  Downscaling future climate scenarios to fine scales for hydrologic and ecological modeling and analysis , 2012, Ecological Processes.

[57]  Gerald J. Niemi,et al.  Application of Ecological Indicators , 2004 .

[58]  A Michelle Lawing,et al.  Environmental filtering improves ecological niche models across multiple scales , 2019, Methods in Ecology and Evolution.

[59]  Donald A. Jackson,et al.  GIVING MEANINGFUL INTERPRETATION TO ORDINATION AXES: ASSESSING LOADING SIGNIFICANCE IN PRINCIPAL COMPONENT ANALYSIS , 2003 .

[60]  D. Pierson,et al.  A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins , 2018, Scientific data.

[61]  R. Manne,et al.  Missing values in principal component analysis , 1998 .

[62]  Lijuan Cao,et al.  A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine , 2003, Neurocomputing.

[63]  T. Berman,et al.  Water Quality Assessment , 2020, Modern Trends in Diatom Identification.

[64]  F. Chapin,et al.  EFFECTS OF BIODIVERSITY ON ECOSYSTEM FUNCTIONING: A CONSENSUS OF CURRENT KNOWLEDGE , 2005 .

[65]  Stephen R. Carpenter,et al.  Assessing Future Ecosystem Services: a Case Study of the Northern Highlands Lake District, Wisconsin , 2003 .

[66]  S Birk,et al.  Intercalibrating classifications of ecological status: Europe's quest for common management objectives for aquatic ecosystems. , 2013, The Science of the total environment.

[67]  F. Kelly,et al.  Development and application of an ecological classification tool for fish in lakes in Ireland , 2012 .

[68]  Katarzyna Chrobak,et al.  The Use of Common Knowledge in Fuzzy Logic Approach for Vineyard Site Selection , 2020, Remote. Sens..

[69]  N. Willby,et al.  Using aquatic macrophyte community indices to define the ecological status of European lakes , 2008, Aquatic Ecology.

[70]  Samina Khalid,et al.  A survey of feature selection and feature extraction techniques in machine learning , 2014, 2014 Science and Information Conference.

[71]  N. Willby,et al.  Redundancy in the ecological assessment of lakes: Are phytoplankton, macrophytes and phytobenthos all necessary? , 2016, The Science of the total environment.

[72]  Alex Smola,et al.  Kernel methods in machine learning , 2007, math/0701907.

[73]  Kevin Leyton-Brown,et al.  An Efficient Approach for Assessing Hyperparameter Importance , 2014, ICML.

[74]  A. Dahl Achievements and gaps in indicators for sustainability , 2012 .

[75]  Yoshua Bengio,et al.  GMNN: Graph Markov Neural Networks , 2019, ICML.

[76]  Tao Li,et al.  Using discriminant analysis for multi-class classification: an experimental investigation , 2006, Knowledge and Information Systems.

[77]  M. Rask,et al.  Fish‐based assessment of ecological status of Finnish lakes loaded by diffuse nutrient pollution from agriculture , 2010 .

[78]  Jian Dong,et al.  Contextualizing Object Detection and Classification , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[79]  Alex S. Mayer,et al.  Classification of watersheds into integrated social and biophysical indicators with clustering analysis , 2014 .

[80]  Jiawei Han,et al.  Linear Discriminant Dimensionality Reduction , 2011, ECML/PKDD.

[81]  K. Wallace Classification of ecosystem services: Problems and solutions , 2007 .

[82]  Christian Igel,et al.  Evolutionary tuning of multiple SVM parameters , 2005, ESANN.

[83]  Philippe Desjardins-Proulx,et al.  Artificial Intelligence for Ecological and Evolutionary Synthesis , 2019, Front. Ecol. Evol..

[84]  Stephanie E Hampton,et al.  Open science, reproducibility, and transparency in ecology. , 2018, Ecological applications : a publication of the Ecological Society of America.

[85]  David J. Crisp,et al.  A Geometric Interpretation of ?-SVM Classifiers , 1999, NIPS 2000.

[86]  Sidneyf Elder,et al.  ELEMENTS OF SET THEORY , 1995 .

[87]  José Antonio Lozano,et al.  Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[88]  M. White,et al.  Habitat Condition Assessment System: a new way to assess the condition of natural habitats for terrestrial biodiversity across whole regions using remote sensing data , 2016 .

[89]  Jun Wang,et al.  Evaluating four downscaling methods for assessment of climate change impact on ecological indicators , 2017, Environ. Model. Softw..

[90]  S. Larsen,et al.  Ecological classification of lakes: Uncertainty and the influence of year-to-year variability , 2016 .

[91]  Eulalia Szmidt Similarity Measures between Intuitionistic Fuzzy Sets , 2014 .

[92]  Jacinto Benhadi-Marín,et al.  A conceptual framework to deal with outliers in ecology , 2018, Biodiversity and Conservation.

[93]  Julie Josse,et al.  Principal component analysis with missing values: a comparative survey of methods , 2015, Plant Ecology.

[94]  Ans Mouton,et al.  Ecological relevance of' performance criteria for species distribution models , 2010 .

[95]  N. Oppelt,et al.  Remote sensing for lake research and monitoring – Recent advances , 2016 .

[96]  Song A. An,et al.  Conceptualizing and organizing content for teaching and learning in selected Chinese, Japanese and US mathematics textbooks: the case of fraction division , 2009 .

[97]  Max Kuhn,et al.  The caret Package , 2007 .

[98]  Annukka Lehikoinen,et al.  How to value biodiversity in environmental management , 2015 .

[99]  Ioannis Manakos,et al.  Integration of satellite remote sensing data in ecosystem modelling at local scales: Practices and trends , 2018, Methods in Ecology and Evolution.

[100]  S. Larsen,et al.  Using chlorophyll a and cyanobacteria in the ecological classification of lakes , 2011 .

[101]  Aboul Ella Hassanien,et al.  Linear discriminant analysis: A detailed tutorial , 2017, AI Commun..

[102]  S. Juggins,et al.  Assessment of ecological status in UK lakes using benthic diatoms , 2014, Freshwater Science.

[103]  R W Dawson,et al.  Lake ecosystem health assessment: indicators and methods. , 2001, Water research.

[104]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[105]  Susan Holmes,et al.  Ten quick tips for effective dimensionality reduction , 2019, PLoS Comput. Biol..

[106]  J. Romero,et al.  Ecological status of seagrass ecosystems: An uncertainty analysis of the meadow classification based on the Posidonia oceanica multivariate index (POMI). , 2011, Marine pollution bulletin.

[107]  Arturas Kaklauskas,et al.  Intelligent Decision Support Systems , 2015 .

[108]  Sami Domisch,et al.  How to make ecological models useful for environmental management , 2019, Ecological Modelling.

[109]  Shu Tao,et al.  An ecosystem health index methodology (EHIM) for lake ecosystem health assessment , 2005 .

[110]  Hiroshi Yajima,et al.  Application of the Random Forest model for chlorophyll-a forecasts in fresh and brackish water bodies in Japan, using multivariate long-term databases , 2018 .

[111]  Dennis L. Murray,et al.  Using multiple imputation to estimate missing data in meta‐regression , 2015 .

[112]  Erik Jeppesen,et al.  Submerged macrophytes as indicators of the ecological quality of lakes , 2010 .

[113]  Panayotis Panayotidis,et al.  An insight to the ecological evaluation index (EEI) , 2003 .

[114]  A. Forslund Securing water for ecosystems and human well-being: the importance of environmental flows. , 2009 .

[115]  José J. Lahoz-Monfort,et al.  Revealing beliefs: using ensemble ecosystem modelling to extrapolate expert beliefs to novel ecological scenarios , 2017 .