Application of unsupervised learning and process simulation for energy optimization of a WWTP under various weather conditions.

This paper outlines a hybrid modeling approach to facilitate weather-based operation and energy optimization for the largest Italian wastewater treatment plant (WWTP). Two clustering methods, K-means algorithm and Gaussian mixture model (GMM) based on the expectation-maximization (EM) algorithm, were applied to an extensive dataset of historical and meteorological records. This study addresses the problem of determining the intrinsic structure of clustered data when no information other than the observed values is available. Two quantitative indexes, namely the Bayesian information criterion (BIC) and the Silhouette coefficient using Euclidean distance, as well as two general criteria, were implemented to assess the clustering quality. Furthermore, seven weather-based influent scenarios were introduced to the process simulation model, and sets of aeration strategies are proposed. The results indicate that incorporating weather-based aeration strategies in the operation of the WWTP improves plant energy efficiency.

[1]  Ahmet Altin,et al.  Flow-rate and pollution characteristics of domestic wastewater , 2003 .

[2]  WeiXuan Hu The application of artificial neural network in wastewater treatment , 2011, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[3]  W. M. Alley,et al.  Distributed routing rainfall-runoff model; version II , 1982 .

[4]  G. A. Ekama,et al.  A GENERAL MODEL FOR THE ACTIVATED SLUDGE PROCESS , 1981 .

[5]  Douglas A. Reynolds Gaussian Mixture Models , 2009, Encyclopedia of Biometrics.

[6]  Maria Chiara Zanetti,et al.  Assessment of weather-based influent scenarios for a WWTP: Application of a pattern recognition technique. , 2019, Journal of environmental management.

[7]  Diego R. Amancio,et al.  Authorship attribution based on Life-Like Network Automata , 2016, PloS one.

[8]  J. B. Copp,et al.  Benchmark Simulation Model No 2: finalisation of plant layout and default control strategy. , 2010, Water science and technology : a journal of the International Association on Water Pollution Research.

[9]  J. A. Mueller,et al.  Aeration: principles and practice. , 2002 .

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  Ingmar Nopens,et al.  Data scarcity in modelling and simulation of a large-scale WWTP: Stop sign or a challenge , 2019, Journal of Water Process Engineering.

[12]  Ingmar Nopens,et al.  Data Mining Application in Assessment of Weather-Based Influent Scenarios for a WWTP: Getting the Most Out of Plant Historical Data , 2018, Water, Air, & Soil Pollution.

[13]  David Hsu,et al.  Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data , 2015 .

[14]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[15]  Lulu Kang,et al.  Predicting influent biochemical oxygen demand: Balancing energy demand and risk management. , 2018, Water research.

[16]  Paul R. Anderson,et al.  Defining Influent Scenarios: Application of Cluster Analysis to a Water Reclamation Plant , 2015 .

[17]  Barbara Ruffino,et al.  MODELLING AND CALIBRATION OF THE FULL SCALE WWTP WITH DATA SCARCITY , 2016 .

[18]  David G. Stork,et al.  Pattern Classification , 1973 .

[19]  E. Catmull,et al.  A CLASS OF LOCAL INTERPOLATING SPLINES , 1974 .

[20]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[21]  D. Muschalla,et al.  Potential and limitations of modern equipment for real time control of urban wastewater systems , 2013 .

[22]  Silvia Fiore,et al.  Evaluation of the energy efficiency of a large wastewater treatment plant in Italy , 2016 .

[23]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[24]  Abbas Yeganeh-Bakhtiary,et al.  Wave runup prediction using M5′ model tree algorithm , 2016 .

[25]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[26]  I. Takács A dynamic model of the clarification-thickening process , 1991 .

[27]  Dulakshi S. K. Karunasinghe,et al.  Chaotic time series prediction with a global model: Artificial neural network , 2006 .

[28]  Huu Hao Ngo,et al.  A mini-review on the impacts of climate change on wastewater reclamation and reuse. , 2014, The Science of the total environment.

[29]  Richard O. Mines,et al.  The Impact of Rainfall on Flows and Loadings at Georgia’s Wastewater Treatment Plants , 2007 .

[30]  J. Monod The Growth of Bacterial Cultures , 1949 .

[31]  Adrian J. Saul,et al.  Specific relationships for the first flush load in combined sewer flows , 1996 .

[32]  Peter A. Vanrolleghem,et al.  Analysing, completing, and generating influent data for WWTP modelling: A critical review , 2014, Environ. Model. Softw..

[33]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[34]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[35]  Mogens Henze,et al.  Activated sludge models ASM1, ASM2, ASM2d and ASM3 , 2015 .

[36]  Douglas A. Reynolds,et al.  Gaussian Mixture Models , 2018, Encyclopedia of Biometrics.