Big Data Analytics and Knowledge Discovery Applied to Automatic Meter Readers

The volume of data collected by a water utility is constantly growing. In this new era, data are important because guarantees the success of decisions based on the relevant values and underlying information extracted from noisy data. For instance, automatic meter reading (AMR) systems offer households and businesses the chance to understand and reduce their energy and water usage in much greater detail than previously possible, when meter readings were taken once a quarter, or even annually. Moreover, AMR could help utility firms to improve the accuracy of billing and cut visits to properties to read meters. However, with AMR, there is an exponential growth of data: an AMR produces 17500 readings per year, with a single reading every half hour. These data should be first processed in a real-time streaming, in order to be validated before being stored and translated into a metadata model which may be usable in multiple further applications. Thus, utilities have found scaling smart meter management systems difficult to handle. This motivates the use of Big Data technologies in this application domain. On the other hand, applying data analytics and knowledge discovery tools to AMR data combined with other streams of information (data coming from the billing system, call centre service and meteorological information) could help with fraud detection, maintenance requirements prediction, water/energy user consumption patterns determination and response generation to variations in the demand. This chapter presents novel algorithms and methodologies to carry out real-time streaming data processing, data analytics, data quality assessment and improvement, as well as prediction and visualization tasks, at extremely large scale and with diverse structured and unstructured data from multiple sources such as water, power, telecommunication and other utilities, as well as from social media. The algorithms and methodologies will be illustrated using real data coming from several water utilities.

[1]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[2]  Joseba Jokin Quevedo Casín,et al.  Water demand estimation and outlier detection from smart meter data using classification and Big Data methods , 2015 .

[3]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[4]  Ulf Hashagen,et al.  The first computers: history and architectures , 2000 .

[5]  Vicenç Puig,et al.  Automatic Validation of Flowmeter Data in Transport Water Networks: Application to the ATLLc Water Network , 2014, IDEAL.

[6]  Kia Aksela,et al.  Demand Estimation with Automated Meter Reading in a Distribution Network , 2011 .

[7]  A Armon,et al.  Algorithmic network monitoring for a modern water utility: a case study in Jerusalem. , 2011, Water science and technology : a journal of the International Association on Water Pollution Research.

[8]  Ming-Hui Chen,et al.  A Survey of Statistical Methods and Computing for Big Data , 2015 .

[9]  Vicenç Puig,et al.  A methodology and a software tool for sensor data validation/reconstruction : application to the Catalonia regional water network , 2016 .

[10]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[11]  Jessica Lin,et al.  Pattern Recognition in Time Series , 2012 .

[12]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[13]  Sean Andrew McKenna,et al.  Water Demand Pattern Classification from Smart Meter Data , 2014 .