Bayesian Belief Networks for predicting drinking water distribution system pipe breaks

Abstract In this paper, we use Bayesian Belief Networks (BBNs) to construct a knowledge model for pipe breaks in a water zone. To the authors’ knowledge, this is the first attempt to model drinking water distribution system pipe breaks using BBNs. Development of expert systems such as BBNs for analyzing drinking water distribution system data is not only important for pipe break prediction, but is also a first step in preventing water loss and water quality deterioration through the application of machine learning techniques to facilitate data-based distribution system monitoring and asset management. Due to the difficulties in collecting, preparing, and managing drinking water distribution system data, most pipe break models can be classified as “statistical–physical” or “hypothesis-generating.” We develop the BBN with the hope of contributing to the “hypothesis-generating” class of models, while demonstrating the possibility that BBNs might also be used as “statistical–physical” models. Our model is learned from pipe breaks and covariate data from a mid-Atlantic United States (U.S.) drinking water distribution system network. BBN models are learned using a constraint-based method, a score-based method, and a hybrid method. Model evaluation is based on log-likelihood scoring. Sensitivity analysis using mutual information criterion is also reported. While our results indicate general agreement with prior results reported in pipe break modeling studies, they also suggest that it may be difficult to select among model alternatives. This model uncertainty may mean that more research is needed for understanding whether additional pipe break risk factors beyond age, break history, pipe material, and pipe diameter might be important for asset management planning.

[1]  S. Spencer,et al.  Virus contamination from operation and maintenance events in small drinking water distribution systems. , 2011, Journal of water and health.

[2]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[3]  S. Spencer,et al.  Risk of viral acute gastrointestinal illness from nondisinfected drinking water distribution systems. , 2012, Environmental science & technology.

[4]  A. Saltelli,et al.  Reliability Engineering and System Safety , 2008 .

[5]  Roger M. Cooke,et al.  Mining and visualising ordinal data with non-parametric continuous BBNs , 2010, Comput. Stat. Data Anal..

[6]  Gal Elidan,et al.  Copula Bayesian Networks , 2010, NIPS.

[7]  David Maxwell Chickering,et al.  Learning Bayesian Networks is , 1994 .

[8]  James H. Garrett,et al.  A density-based spatial clustering approach for defining local indicators of drinking water distribution pipe breakage , 2011, Adv. Eng. Informatics.

[9]  Robert M. Clark,et al.  Condition assessment modeling for distribution systems using shared frailty analysis , 2010 .

[10]  Renaud Escudié,et al.  Control of start-up and operation of anaerobic biofilm reactors: an overview of 15 years of research. , 2011, Water research.

[11]  Luigi Berardi,et al.  Development of pipe deterioration models for water distribution systems using EPR , 2008 .

[12]  Marco Scutari,et al.  Structure Variability in Bayesian Networks. , 2009, 0909.1685.

[13]  D. Margaritis Learning Bayesian Network Model Structure from Data , 2003 .

[14]  Luigi Portinale,et al.  Bayesian networks in reliability , 2007, Reliab. Eng. Syst. Saf..

[15]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[16]  Y. Kleiner,et al.  I-WARP: Individual Water mAin Renewal Planner , 2010 .

[17]  Jose Emmanuel Ramirez-Marquez,et al.  An automated method for estimating reliability of grid systems using Bayesian networks , 2012, Reliab. Eng. Syst. Saf..

[18]  Melinda Friedman,et al.  The potential for health risks from intrusion of contaminants into the distribution system from pressure transients. , 2003, Journal of water and health.

[19]  Julie E Shortridge,et al.  Public health and pipe breaks in water distribution systems: analysis with internet search volume as a proxy. , 2014, Water research.

[20]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[21]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[22]  Roger M. Cooke,et al.  Hybrid Method for Quantifying and Analyzing Bayesian Belief Nets , 2006, Qual. Reliab. Eng. Int..

[23]  Nir Friedman,et al.  Discretizing Continuous Attributes While Learning Bayesian Networks , 1996, ICML.

[24]  Marco Scutari,et al.  Learning Bayesian Networks with the bnlearn R Package , 2009, 0908.3817.

[25]  Norman E. Fenton,et al.  Improved reliability modeling using Bayesian networks and dynamic discretization , 2010, Reliab. Eng. Syst. Saf..

[26]  Robin Gregory,et al.  Structured Decision Making: A Practical Guide to Environmental Management Choices , 2012 .

[27]  Constantin F. Aliferis,et al.  Time and sample efficient discovery of Markov blankets and direct causal relations , 2003, KDD '03.

[28]  Lars Barregard,et al.  The association of drinking water treatment and distribution network disturbances with Health Call Centre contacts for gastrointestinal illness symptoms. , 2013, Water research.

[29]  Judea Pearl,et al.  A Theory of Inferred Causation , 1991, KR.

[30]  Michèle Prévost,et al.  Assessing the public health risk of microbial intrusion events in distribution systems: conceptual model, available data, and challenges. , 2011, Water research.

[31]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[32]  Ross D. Shachter Advances in Decision Analysis: Model Building with Belief Networks and Influence Diagrams , 2007 .

[33]  Detlof von Winterfeldt,et al.  Advances in decision analysis : from foundations to applications , 2007 .

[34]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[35]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[36]  Ana Debón,et al.  Comparing risk of failure models in water supply networks using ROC curves , 2010, Reliab. Eng. Syst. Saf..

[37]  Barry J. Adams,et al.  Water Distribution Network Renewal Planning , 2001 .

[38]  Luigi Portinale,et al.  Improving the analysis of dependable systems by mapping fault trees into Bayesian networks , 2001, Reliab. Eng. Syst. Saf..

[39]  J. Villeneuve,et al.  Optimal replacement of water pipes , 2003 .

[40]  Jose Emmanuel Ramirez-Marquez,et al.  A generic method for estimating system reliability using Bayesian networks , 2009, Reliab. Eng. Syst. Saf..

[41]  A. Vanrenterghem-Raven Risk Factors of Structural Degradation of an Urban Water Distribution System , 2007 .

[42]  Seth D. Guikema,et al.  Statistical models for the analysis of water distribution system pipe break data , 2009, Reliab. Eng. Syst. Saf..

[43]  Gal Elidan,et al.  Speedy Model Selection (SMS) for Copula Models , 2013, UAI.

[44]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[45]  Preben Aavitsland,et al.  Breaks and maintenance work in the water distribution systems and gastrointestinal illness: a cohort study. , 2007, International journal of epidemiology.

[46]  D.,et al.  Regression Models and Life-Tables , 2022 .

[47]  James H. Garrett,et al.  Detection of Patterns in Water Distribution Pipe Breakage Using Spatial Scan Statistics for Point Events in a Physical Network , 2011, J. Comput. Civ. Eng..

[48]  Marc Parizeau,et al.  Multiobjective Approach for Pipe Replacement Based on Bayesian Inference of Break Model Parameters , 2009 .

[49]  D. Kurowicka,et al.  Mixed Non-Parametric Continuous and Discrete Bayesian Belief Nets , 2007 .

[50]  Adnan Darwiche,et al.  Modeling and Reasoning with Bayesian Networks , 2009 .

[51]  S. Rahman Reliability Engineering and System Safety , 2011 .

[52]  Ben J. M. Ale,et al.  Analysis of the Schiphol Cell Complex fire using a Bayesian belief net based model , 2012, Reliab. Eng. Syst. Saf..

[53]  Kristian G. Olesen,et al.  HUGIN - A Shell for Building Bayesian Belief Universes for Expert Systems , 1989, IJCAI.

[54]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[55]  大西 仁,et al.  Pearl, J. (1988, second printing 1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan-Kaufmann. , 1994 .

[56]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[57]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.