Prediction of water main failures with the spatial clustering of breaks

Abstract Due to limited budgets and an aging system, infrastructure managers have increasingly sought cost-effective means to evaluate asset condition. This is a particular challenge for water distribution systems due to the vast amount of buried and unseen pipelines. A spatial clustering of pipe breaks fits well into a wider asset management framework with the aim of identifying regions with abnormally high failure rates. The information about spatial clusters identified using historical breaks, if and where they exist, can potentially improve predictions on the location of future breaks. In this research, we present three algorithms (poisson based, density based, and locally weighted density based) for scanning and clustering pipe break data and demonstrate their application on a real pipeline network. We also explore whether the use of spatial clusters as an explanatory variable can improve the accuracy of pipe break machine learning models. Empirical findings show that the locally weighted density scan provides the greatest precision for finding high breakage zones. The application of these clusters generally improves the performance of predictive models by helping them prioritize high risk pipes with greater accuracy.

[1]  Lina Sela,et al.  Identification of Spatial Patterns in Water Distribution Pipe Failure Data Using Spatial Autocorrelation Analysis , 2019 .

[2]  A. Salman,et al.  Performance Evaluation of Water Distribution Systems and Asset Management , 2018, Journal of Infrastructure Systems.

[3]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[4]  Andrew Wood,et al.  Using Water Main Break Data to Improve Asset Management for Small and Medium Utilities: District of Maple Ridge, B.C. , 2009 .

[5]  Gregory M. Baird A game plan for aging water infrastructure , 2010 .

[6]  Ahmad Habibian,et al.  Effect of Temperature Changes on Water‐Main Breaks , 1994 .

[7]  Hans-Peter Kriegel,et al.  Density‐based clustering , 2011, WIREs Data Mining Knowl. Discov..

[8]  Wenzhong Shi,et al.  Spatial analysis of water mains failure clusters and factors: a Hong Kong case study , 2013, Ann. GIS.

[9]  M. Kulldorff,et al.  A Space–Time Permutation Scan Statistic for Disease Outbreak Detection , 2005, PLoS medicine.

[10]  Lei Shi,et al.  Anomalous window discovery through scan statistics for linear intersecting paths (SSLIP) , 2009, KDD.

[11]  John C. Matthews,et al.  Current and Emerging Water Main Renewal Technologies , 2013 .

[12]  Lan Huang,et al.  Evaluating spatial methods for investigating global clustering and cluster detection of cancer cases , 2008, Statistics in medicine.

[13]  James H. Garrett,et al.  Detection of Patterns in Water Distribution Pipe Breakage Using Spatial Scan Statistics for Point Events in a Physical Network , 2011, J. Comput. Civ. Eng..

[14]  Juyeong Choi,et al.  Systematic approach for asset management of urban water pipeline infrastructure systems , 2017 .

[15]  Upmanu Lall,et al.  National trends in drinking water quality violations , 2018, Proceedings of the National Academy of Sciences.

[16]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[17]  Balvant Rajani,et al.  A methodology to estimate remaining service life of grey cast iron water mains , 2000 .

[18]  I. C. Goulter,et al.  Spatial and temporal groupings of water main pipe breakage in Winnipeg , 1988 .

[19]  Man Lung Yiu,et al.  Clustering objects on a spatial network , 2004, SIGMOD '04.

[20]  Seth D. Guikema,et al.  Bayesian Belief Networks for predicting drinking water distribution system pipe breaks , 2014, Reliab. Eng. Syst. Saf..

[21]  Seth D. Guikema,et al.  Optimal pipe inspection paths considering inspection tool limitations , 2019, Reliab. Eng. Syst. Saf..

[22]  Sanjiv Gokhale,et al.  Comparison of Pipeline Failure Prediction Models for Water Distribution Networks with Uncertain and Limited Data , 2015 .

[23]  James H. Garrett,et al.  A density-based spatial clustering approach for defining local indicators of drinking water distribution pipe breakage , 2011, Adv. Eng. Informatics.

[24]  Vasilis Kanakoudis,et al.  Pipe Networks Risk Assessment Based on Survival Analysis , 2011 .

[25]  M. Kulldorff A spatial scan statistic , 1997 .

[26]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[27]  Statistical Modeling in Absence of System Specific Data: Exploratory Empirical Analysis for Prediction of Water Main Breaks , 2019, Journal of Infrastructure Systems.

[28]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[29]  S. Guikema,et al.  Extreme Precipitation, Public Health Emergencies, and Safe Drinking Water in the USA , 2018, Current Environmental Health Reports.

[30]  Seth D. Guikema,et al.  Drinking Water Distribution Systems Asset Management: Statistical Modelling of Pipe Breaks , 2017 .

[31]  Balvant Rajani,et al.  Forecasting Variations and Trends in Water-Main Breaks , 2002 .

[32]  Julie E Shortridge,et al.  Public health and pipe breaks in water distribution systems: analysis with internet search volume as a proxy. , 2014, Water research.

[33]  Kourosh Behzadian,et al.  Pipeline failure prediction in water distribution networks using evolutionary polynomial regression combined with K-means clustering , 2017 .

[34]  Anthony N. Tafuri,et al.  Rehabilitation of Aging Water Infrastructure Systems: Key Challenges and Issues , 2012 .

[35]  G. Pelletier,et al.  Modeling the evolution of the structural state of water pipe networks with brief recorded pipe break histories: Methodology and application , 2000 .

[36]  Balvant Rajani,et al.  Comprehensive review of structural deterioration of water mains: physically based models , 2001 .

[37]  M Kulldorff,et al.  Spatial disease clusters: detection and inference. , 1995, Statistics in medicine.