Constructing a model hierarchy with background knowledge for structural risk minimization: application to biological treatment of wastewater

This article introduces a novel approach to the issue of learning from empirical data coming from complex systems that are continuous, dynamic, highly nonlinear, and stochastic. The main feature of this approach is that it attempts to integrate the powerful statistical learning theoretic methods and the valuable background knowledge that one possesses about the system under study. The learning machines that have been used, up to now, for the implementation of Vapnik's inductive principle of structural risk minimization (IPSRM) are of the "black-box" type, such as artificial neural networks, ARMA models, or polynomial functions. These are generic models that contain absolutely no knowledge about the problem at hand. They are used to approximate the behavior of any system and are prodigal in their requirements of training data. In addition, the conditions that underlie the theory of statistical learning would not hold true when these "black-box" models are used to describe highly complex systems. In this paper, it is argued that the use of a learning machine whose structure is developed on the basis of the physical mechanisms of the system under study is more advantageous. Such a machine will indeed be specific to the problem at hand and will require many less data points for training than their black-box counterparts. Furthermore, because this machine contains background knowledge about the system, it will provide better approximations of the various dynamic modes of this system and will, therefore, satisfy some of the prerequisites that are needed for meeting the conditions of statistical learning theory (SLT). This paper shows how to develop such a mechanistically based learning machine (i.e., a machine that contains background knowledge) for the case of biological wastewater treatment systems. Fuzzy logic concepts, combined with the results of the research in the area of wastewater engineering, will be utilized to construct such a machine. This machine has a hierarchical property and can, therefore, be used to implement the IPSRM.

[1]  Guillaume Patry,et al.  Using Statistical Learning Theory to Rationalize System Model Identification and Validation Part I: Mathematical Foundations , 2003, Complex Syst..

[2]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[3]  花田 茂久,et al.  Activated Sludge Model No.3 (ASM3) へのリン除去モジュールの導入とそのキャリブレーション方法の開発 , 2004 .

[4]  R. Fisher,et al.  Contributions to Mathematical Statistics , 1951 .

[5]  Mogens Henze,et al.  The Activated Sludge Model No. 2: Biological Phosphorus Removal , 1995 .

[6]  G. Marais,et al.  Evaluation of the General Activated Sludge Model Proposed by the IAWPRC Task Group , 1986 .

[7]  Thorsten Neuerer,et al.  Biological wastewater treatment plants of BIOGEST ® , 2022 .

[8]  G. A. Ekama,et al.  A GENERAL MODEL FOR THE ACTIVATED SLUDGE PROCESS , 1981 .

[9]  Aziz Guergachi Connecting traditional sciences with the OLAP and data mining paradigms , 2003, SPIE Defense + Commercial Sensing.

[10]  윤석표 Activated Sludge Model No.2를 이용한 하수의 생물학적 질소·인 제거 공정의 처리성 평가 , 1999 .

[11]  Guillaume Patry,et al.  Statistical learning theory, model identification and system information content , 2002, Int. J. Gen. Syst..

[12]  Abdelaziz. Guergachi Uncertainty management in the activated sludge process: Innovative applications of computational learning theory. , 2000 .

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[15]  Stefano Marsili-Libelli,et al.  Modelling, Identification and Control of the Activated Sludge Process , 1989, Lignocellulosic Materials.

[16]  W. Gujer,et al.  A general model for single-sludge wastewater treatment systems , 1987 .

[17]  Ulf Jeppsson,et al.  Modelling Aspects of Wastewater Treatment Processes , 1996 .