Knowledge-Based Dynamic Systems Modeling: A Case Study on Modeling River Water Quality

Modeling real-world phenomena is a focus of many science and engineering efforts, from ecological modeling to financial forecasting. Building an accurate model for complex and dynamic systems improves understanding of underlying processes and leads to resource efficiency. Knowledge-driven modeling builds a model based on human expertise, yet is often suboptimal. At the opposite extreme, data-driven modeling learns a model directly from data, requiring extensive data and potentially generating overfitting. We focus on an intermediate approach, model revision, in which prior knowledge and data are combined to achieve the best of both worlds. We propose a genetic model revision framework based on tree-adjoining grammar (TAG) guided genetic programming (GP), using the TAG formalism and GP operators in an effective mechanism making data-driven revisions while incorporating prior knowledge. Our framework is designed to address the high computational cost of evolutionary modeling of complex systems. Via a case study on the challenging problem of river water quality modeling, we show that the framework efficiently learns an interpretable model, with higher modeling accuracy than existing methods.

[1]  Christos Faloutsos,et al.  Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks , 2019, KDD.

[2]  Aravind K. Joshi,et al.  Tree-Adjoining Grammars , 1997, Handbook of Formal Languages.

[3]  Yun-Geun Lee,et al.  Improvement of complex and refractory ecological models: Riverine water quality modelling using evolutionary computation , 2014 .

[4]  Pei Hongping,et al.  Study on the algal dynamic model for West Lake, Hangzhou , 2002 .

[5]  Nagiza F. Samatova,et al.  Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[6]  Kangil Kim,et al.  Stable Forecasting of Environmental Time Series via Long Short Term Memory Recurrent Neural Network , 2018, IEEE Access.

[7]  Renu Vyas,et al.  Genetic Programming Applications in Chemical Sciences and Engineering , 2015, Handbook of Genetic Programming Applications.

[8]  Nguyen Xuan Hoai,et al.  Flexible representation for genetic programming : lessons from natural language processing , 2004 .

[9]  David J. Montana,et al.  Strongly Typed Genetic Programming , 1995, Evolutionary Computation.

[10]  Anuj Karpatne,et al.  Physics-guided Neural Networks (PGNN): An Application in Lake Temperature Modeling , 2017, ArXiv.

[11]  A. Malik,et al.  Artificial neural network modeling of the river water quality—A case study , 2009 .

[12]  Xin Luna Dong,et al.  MultiImport: Inferring Node Importance in a Knowledge Graph from Multiple Input Signals , 2020, KDD.

[13]  Peter A. Whigham,et al.  Grammatical bias for evolutionary learning , 1996 .

[14]  Saso Dzeroski,et al.  Computational Discovery of Scientific Knowledge , 2007, Computational Discovery of Scientific Knowledge.

[15]  Soroosh Sorooshian,et al.  Optimal use of the SCE-UA global optimization method for calibrating watershed models , 1994 .

[16]  Nguyen Xuan Hoai,et al.  Evolving the best known approximation to the Q function , 2012, GECCO '12.

[17]  U Kang,et al.  Predicting acute kidney injury in cancer patients using heterogeneous and irregular data , 2018, PloS one.

[18]  Nguyen Xuan Hoai,et al.  Ecological application of evolutionary computation: Improving water quality forecasts for the Nakdong River, Korea , 2010, IEEE Congress on Evolutionary Computation.

[19]  Geoffrey I. Webb,et al.  # 2001 Kluwer Academic Publishers. Printed in the Netherlands. Machine Learning for User Modeling , 1999 .

[20]  Chih-Chou Chiu,et al.  Financial time series forecasting using independent component analysis and support vector regression , 2009, Decis. Support Syst..

[21]  Conor Ryan,et al.  Grammatical evolution , 2007, GECCO '07.

[22]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[23]  Stephan M. Winkler,et al.  Genetic Algorithms and Genetic Programming - Modern Concepts and Practical Applications , 2009 .