Combining multiway principal component analysis (MPCA) and clustering for efficient data mining of historical data sets of SBR processes.

A methodology based on Principal Component Analysis (PCA) and clustering is evaluated for process monitoring and process analysis of a pilot-scale SBR removing nitrogen and phosphorus. The first step of this method is to build a multi-way PCA (MPCA) model using the historical process data. In the second step, the principal scores and the Q-statistics resulting from the MPCA model are fed to the LAMDA clustering algorithm. This procedure is iterated twice. The first iteration provides an efficient and effective discrimination between normal and abnormal operational conditions. The second iteration of the procedure allowed a clear-cut discrimination of applied operational changes in the SBR history. Important to add is that this procedure helped identifying some changes in the process behaviour, which would not have been possible, had we only relied on visually inspecting this online data set of the SBR (which is traditionally the case in practice). Hence the PCA based clustering methodology is a promising tool to efficiently interpret and analyse the SBR process behaviour using large historical online data sets.

[1]  M Martínez,et al.  Case-based reasoning, a promising tool to face solids separation problems in the activated sludge process. , 2006, Water science and technology : a journal of the International Association on Water Pollution Research.

[2]  Peter A Vanrolleghem,et al.  Monitoring of a sequencing batch reactor using adaptive multiblock principal component analysis. , 2003, Biotechnology and bioengineering.

[3]  Gustaf Olsson,et al.  Disturbance detection in wastewater treatment plants , 1998 .

[4]  Dale E. Seborg,et al.  Pattern Matching in Multivariate Time Series Databases Using a Moving-Window Approach , 2002 .

[5]  Gürkan Sin,et al.  A calibration methodology and model-based systems analysis for SBRs removing nutrients under limited aeration conditions , 2006 .

[6]  P A Vanrolleghem,et al.  Evaluation of the impacts of model-based operation of SBRs on activated sludge microbial community. , 2006, Water science and technology : a journal of the International Association on Water Pollution Research.

[7]  Zhiguo Yuan,et al.  Sludge population optimisation: a new dimension for the control of biological wastewater treatment systems. , 2002, Water research.

[8]  C. Yoo,et al.  Nonlinear process monitoring using kernel principal component analysis , 2004 .

[9]  S. Joe Qin,et al.  Joint diagnosis of process and sensor faults using principal component analysis , 1998 .

[10]  J. Macgregor,et al.  Monitoring batch processes using multiway principal component analysis , 1994 .

[11]  P A Vanrolleghem,et al.  Optimal but robust N and P removal in Sbrs: a model-based systematic study of operation scenarios. , 2004, Water science and technology : a journal of the International Association on Water Pollution Research.

[12]  C Rosen,et al.  Multivariate and multiscale monitoring of wastewater treatment operation. , 2001, Water research.

[13]  Peter A Vanrolleghem,et al.  Adaptive multiscale principal component analysis for on-line monitoring of a sequencing batch reactor. , 2005, Journal of biotechnology.