Automatic socio-economic classification of households using electricity consumption data

Interest in analyzing electricity consumption data of private households has grown steadily in the last years. Several authors have for instance focused on identifying groups of households with similar consumption patterns or on providing feedback to consumers in order to motivate them to reduce their energy consumption. In this paper, we propose to use electricity consumption data to classify households according to pre-defined "properties" of interest. Examples of these properties include the floor area of a household or the number of its occupants. Energy providers can leverage knowledge of such household properties to shape premium services (e.g., energy consulting) for their customers. We present a classification system - called CLASS - that takes as input electricity consumption data of a private household and provides as output the estimated values of its properties. We describe the design and implementation of CLASS and evaluate its performance. To this end, we rely on electricity consumption traces from 3,488 private households, collected at a 30-minute granularity and for a period of more than 1.5 years. Our evaluation shows that CLASS - relying on electricity consumption data only - can estimate the majority of the considered household properties with more than 70% accuracy. For some of the properties, CLASS's accuracy exceeds 80%. Furthermore, we show that for selected properties the use of a priori information can increase classification accuracy by up to 11%.

[1]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ian Beausoleil-Morrison,et al.  Disaggregating categories of electrical energy end-use from whole-house hourly data , 2012 .

[3]  Herbert Burkert,et al.  Some Preliminary Comments on the DIRECTIVE 95/46/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 24 October 1995 on the protection of individuals with regard to the processing of personal data and on the free movement of such data. , 1996 .

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  Ignacio Benitez Sanchez,et al.  Clients segmentation according to their domestic energy consumption by the use of self-organizing maps , 2009, 2009 6th International Conference on the European Energy Market.

[6]  Michael Conlon,et al.  Characterising domestic electricity consumption patterns by dwelling and occupant socio-economic variables: An Irish case study , 2012 .

[7]  Daswin De Silva,et al.  A Data Mining Framework for Electricity Consumption Analysis From Meter Data , 2011, IEEE Transactions on Industrial Informatics.

[8]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[9]  Michael Zeifman,et al.  Nonintrusive appliance load monitoring: Review and outlook , 2011, IEEE Transactions on Consumer Electronics.

[10]  Stéphane Rodrigues,et al.  Third “Energy Package”: The European Parliament and the Council adopt the third “Energy Package” which includes two directives repealing and replacing the 2003 directives concerning common rules for both electricity and natural gas internal markets , 2009 .

[11]  J. Zico Kolter,et al.  A Large-Scale Study on Predicting and Contextualizing Building Energy Usage , 2011, AAAI.

[12]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[13]  Ralf Steinmetz,et al.  On the accuracy of appliance identification based on distributed load metering data , 2012, 2012 Sustainable Internet and ICT for Sustainability (SustainIT).

[14]  Ian Witten,et al.  Data Mining , 2000 .

[15]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[16]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[17]  Manish Marwah,et al.  Unsupervised Disaggregation of Low Frequency Power Measurements , 2011, SDM.

[18]  Raquel Fandos ADAC system design and its application to mine hunting using SAS imagery , 2012 .

[19]  Silvia Santini,et al.  Towards automatic classification of private households using electricity consumption data , 2012, BuildSys@SenSys.

[20]  H. P Gassmann,et al.  OECD guidelines governing the protection of privacy and transborder flows of personal data , 1981 .

[21]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[22]  G. W. Hart,et al.  Nonintrusive appliance load monitoring , 1992, Proc. IEEE.

[23]  Mikko Kolehmainen,et al.  Reducing energy consumption by using self-organizing maps to create more personalized electricity use information , 2008 .

[24]  William J. Kirsch,et al.  The protection of privacy and transborder flows of personal data: the work of the Council of Europe, the Organization for Economic Co-operation and Development and the European Economic Community , 1982, Legal Issues of Economic Integration.

[25]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[26]  Francisco C. Pereira,et al.  Using pattern recognition to identify habitual behavior in residential electricity consumption , 2012 .

[27]  P. Postolache,et al.  Customer Characterization Options for Improving the Tariff Offer , 2002, IEEE Power Engineering Review.

[28]  Vasconcelos,et al.  Survey of Regulatory and Technological Developments Concerning Smart Metering in the European Union Electricity Market , 2008 .

[29]  Z. Vale,et al.  An electric energy consumer characterization framework based on data mining techniques , 2005, IEEE Transactions on Power Systems.

[30]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[31]  Friedemann Mattern,et al.  ICT for Green – How Computers Can Help Us to Conserve Energy , 2010 .

[32]  Jian Liang,et al.  Load Signature Study—Part I: Basic Concept, Structure, and Methodology , 2010, IEEE Transactions on Power Delivery.

[33]  J. Contreras,et al.  Forecasting Next-Day Electricity Prices by Time Series Models , 2002, IEEE Power Engineering Review.

[34]  Corinna Fischer Feedback on household electricity consumption: a tool for saving energy? , 2008 .

[35]  C. Senabre,et al.  Classification, Filtering, and Identification of Electrical Customer Load Patterns Through the Use of Self-Organizing Maps , 2006, IEEE Transactions on Power Systems.