Predictive analytics for water main breaks using spatiotemporal data

ABSTRACT Water main breaks are a common recurring problem in water distribution networks, resulting in cascading effects in the whole system and the interconnected infrastructures such as transportation. Having integrated the physical features of pipes such as diameter and environmental factors like precipitation, we propose predictive models based on spatiotemporal data and machine learning methods. In this study, the dataset is the main breaks recorded from 2015 to 2020 in the city of Tampa, Florida. First, a spatial clustering is conducted to identify vulnerable areas to breaks. A time series analysis is also carried out for the temporal data. The result of these analyses informed the machine learning algorithms as independent variables. We then compared the predictive models based on information-based and rank-based criteria. Obtained results indicated that Boosted Regression Tree (BRT) model was superior to the others. Finally, we present predicted normalized failure rates for the water distribution network to inform rehabilitation and fortification decisions at the municipality level.

[1]  Seth D. Guikema,et al.  Prediction of water main failures with the spatial clustering of breaks , 2020, Reliab. Eng. Syst. Saf..

[2]  Andrew Wood,et al.  Using Water Main Break Data to Improve Asset Management for Small and Medium Utilities: District of Maple Ridge, B.C. , 2009 .

[3]  Balvant Rajani,et al.  Comprehensive review of structural deterioration of water mains: statistical models , 2001 .

[4]  Steven Renzetti,et al.  Buried Treasure: The Economics of Leak Detection and Water Loss Prevention in Ontario , 2013 .

[5]  Kourosh Behzadian,et al.  Pipeline failure prediction in water distribution networks using evolutionary polynomial regression combined with K-means clustering , 2017 .

[6]  Bahram Gharabaghi,et al.  Extreme learning machine model for water network management , 2017, Neural Computing and Applications.

[7]  Balvant Rajani,et al.  Comprehensive review of structural deterioration of water mains: physically based models , 2001 .

[8]  Edward A. McBean,et al.  Watermain breaks and data: the intricate relationship between data availability and accuracy of predictions , 2020 .

[9]  Edward A. McBean,et al.  IMPROVING TIME-TO-FAILURE PREDICTIONS FOR WATER DISTRIBUTION SYSTEMS USING GRADIENT BOOSTING ALGORITHM , 2018 .

[10]  H. Bozdogan,et al.  A new data adaptive elastic net predictive model using hybridized smoothed covariance estimators with information complexity , 2019, Journal of Statistical Computation and Simulation.

[11]  Mark V. E. Santana The Effect of Urbanization on the Embodied Energy of Drinking Water in Tampa, Florida , 2015 .

[12]  Seth D. Guikema,et al.  Bayesian Belief Networks for predicting drinking water distribution system pipe breaks , 2014, Reliab. Eng. Syst. Saf..

[13]  Raymond Y. K. Lau,et al.  Temporal event searches based on event maps and relationships☆ , 2019, Applied Soft Computing.

[14]  Jong Min Lee,et al.  A prioritization method for replacement of water mains using rank aggregation , 2017, Korean Journal of Chemical Engineering.

[15]  S. Hallett,et al.  Improving pipe failure predictions: Factors effecting pipe failure in drinking water networks. , 2019, Water research.

[16]  Sophie Duchesne,et al.  Validation and Comparison of Different Statistical Models for the Prediction of Water Main Pipe Breaks in a Municipal Network in Québec, Canada , 2016 .

[17]  Isam Shahrour,et al.  Application of Artificial Neural Networks (ANN) to model the failure of urban water mains , 2010, Math. Comput. Model..

[18]  Terje Aven,et al.  Risk assessment and risk management: Review of recent advances on their foundation , 2016, Eur. J. Oper. Res..

[19]  Amots Hetzroni,et al.  Getis-Ord's hot- and cold-spot statistics as a basis for multivariate spatial clustering of orchard tree data , 2015, Comput. Electron. Agric..

[20]  Solomon Tesfamariam,et al.  Prediction of Pipe Failure by Considering Time-Dependent Factors: Dynamic Bayesian Belief Network Model , 2017 .

[21]  Seth D. Guikema,et al.  Drinking Water Distribution Systems Asset Management: Statistical Modelling of Pipe Breaks , 2017 .

[22]  Hao Li,et al.  Normalization and integration of large-scale metabolomics data using support vector regression , 2016, Metabolomics.

[23]  Pablo Cortés,et al.  Prediction of pipe failures in water supply networks using logistic regression and support vector classification , 2020, Reliab. Eng. Syst. Saf..

[24]  Dragan Savic,et al.  Assessing pipe failure rate and mechanical reliability of water distribution networks using data-driven modeling , 2009 .

[25]  Hongwei Zhang,et al.  An assessment model of water pipe condition using Bayesian inference , 2010 .

[26]  Edward A. McBean,et al.  Improving Urban Water Security through Pipe-Break Prediction Models: Machine Learning or Survival Analysis , 2020 .

[27]  Seth D. Guikema,et al.  Statistical models for the analysis of water distribution system pipe break data , 2009, Reliab. Eng. Syst. Saf..

[28]  Rayid Ghani,et al.  Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks , 2018, KDD.

[29]  Guangji Hu,et al.  Managing water main breaks in distribution networks--A risk-based decision making , 2019, Reliab. Eng. Syst. Saf..

[30]  Statistical Modeling in Absence of System Specific Data: Exploratory Empirical Analysis for Prediction of Water Main Breaks , 2019, Journal of Infrastructure Systems.

[31]  Solomon Tesfamariam,et al.  Evaluating risk of water mains failure using a Bayesian belief network model , 2015, Eur. J. Oper. Res..

[32]  Yves Filion,et al.  State-of-the-art review of water pipe failure prediction models and applicability to large-diameter mains , 2017 .

[33]  Balvant Rajani,et al.  Considering Time-dependent Factors in the Statistical Prediction of Water Main Breaks , 2000 .

[34]  Mir Jafar Sadegh Safari,et al.  Pipe failure rate prediction in water distribution networks using multivariate adaptive regression splines and random forest techniques , 2019, Urban Water Journal.

[35]  S. Folkman Water Main Break Rates in the USA and Canada: A Comprehensive Study, April 2012 , 2012 .

[36]  Julie E Shortridge,et al.  Public health and pipe breaks in water distribution systems: analysis with internet search volume as a proxy. , 2014, Water research.

[37]  J. Ord,et al.  Local Spatial Autocorrelation Statistics: Distributional Issues and an Application , 2010 .

[38]  Dalwinder Singh,et al.  Investigating the impact of data normalization on classification performance , 2020, Appl. Soft Comput..

[39]  Massoud Tabesh,et al.  A comparison between performance of support vector regression and artificial neural network in prediction of pipe burst rate in water distribution networks , 2014, KSCE Journal of Civil Engineering.

[40]  Á. Martínez-Codina,et al.  Pressure as a predictor of occurrence of pipe breaks in water distribution networks , 2016 .

[41]  A. Vanrenterghem-Raven Risk Factors of Structural Degradation of an Urban Water Distribution System , 2007 .