Pattern recognition in building energy performance over time using energy benchmarking data

Abstract In recent years, many cities have adopted energy disclosure policies to better understand how energy is consumed in the urban built environment and how energy use and carbon emissions can be reduced. The diffusion of such policies has generated large-scale streams of building energy data, creating new opportunities to develop the fundamental science of urban energy dynamics. Nevertheless, there is limited research that rigorously analyzes building energy performance patterns over time. This paper provides a comprehensive framework to analyze building energy time series data and identify buildings with similar temporal energy performance patterns. We use data from approximately 15,000 properties in New York City, covering a six-year reporting period from 2011 to 2016. After pre-processing and merging the data for each constituent year, we use an unsupervised learning algorithm to optimally cluster the energy time series and statistical tests and supervised learning methods to infer how building characteristics vary between clusters. Our results show that energy reductions in New York City are mainly driven by its commercial building stock, with larger, newer, and higher-value buildings demonstrating the largest improvements in energy intensity over the study period. Moreover, voluntary energy conservation schemes are found to be more effective in boosting energy performance of commercial properties, compared to residential buildings. Our results suggest two distinct temporal patterns of energy performance for commercial and residential buildings, characterized by energy use reductions and increases. This finding highlights the differential response to energy reporting and disclosure, and presents a more complex picture of energy use dynamics over time when compared to previous studies. In order to realize significant energy use improvements over time and reach energy and carbon reduction goals, cities need to design and implement comprehensive energy policy frameworks, bringing together information transparency and reporting with targeted mandates and incentives.

[1]  David Hsu,et al.  Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data , 2015 .

[2]  Robert M. Haralick,et al.  Feature normalization and likelihood-based similarity measures for image retrieval , 2001, Pattern Recognit. Lett..

[3]  Christopher Tull,et al.  A data-driven predictive model of city-scale energy use in buildings , 2017 .

[4]  G. Mihalakakou,et al.  Using principal component and cluster analysis in the heating evaluation of the school building sector , 2010 .

[5]  Constantine Kontokosta,et al.  Low hanging fruit? Regulations and energy efficiency in subsidized multifamily housing , 2017 .

[6]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[7]  Frédéric Magoulès,et al.  A review on the prediction of building energy consumption , 2012 .

[8]  M. N. Assimakopoulos,et al.  Using intelligent clustering techniques to classify the energy performance of school buildings , 2007 .

[9]  Rishee K. Jain,et al.  Modeling the determinants of large-scale building water use: Implications for data-driven urban sustainability policy , 2015 .

[10]  Benjamin C. M. Fung,et al.  A systematic procedure to study the influence of occupant behavior on building energy consumption , 2011 .

[11]  Margaret Walls,et al.  Can Benchmarking and Disclosure Laws Provide Incentives for Energy Efficiency Improvements in Buildings? , 2015 .

[12]  Luis Pérez-Lombard,et al.  A review of benchmarking, rating and labelling concepts within the framework of building energy certification schemes , 2009 .

[13]  Constantine E. Kontokosta Modeling the energy retrofit decision in commercial office buildings , 2016 .

[14]  George Stavrakakis,et al.  Review on methodologies for energy benchmarking, rating and classification of buildings , 2011 .

[15]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[16]  Marilyn A. Brown,et al.  Machine learning approaches for estimating commercial building energy consumption , 2017 .

[17]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[18]  Luis Pérez-Lombard,et al.  A review on buildings energy consumption information , 2008 .

[19]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[20]  David Hsu,et al.  Estimating Energy Savings from Benchmarking Policies in New York City , 2017 .

[21]  Jonah “Cecil” Scheib,et al.  New York City can eliminate the carbon footprint of its buildings by 2050 , 2014 .

[22]  Constantine Kontokosta A Market-Specific Methodology for a Commercial Building Energy Performance Index , 2015 .

[23]  Jack Chin Pang Cheng,et al.  Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests , 2016 .

[24]  Margaret Walls,et al.  Using information to close the energy efficiency gap: a review of benchmarking and disclosure ordinances , 2017 .

[25]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[26]  Mikko Kolehmainen,et al.  Data-based method for creating electricity use load profiles using large amount of customer-specific hourly measured electricity use data , 2010 .

[27]  Zyad Shaaban,et al.  Data Mining: A Preprocessing Engine , 2006 .

[28]  Gareth J. Janacek,et al.  Clustering Time Series with Clipped Data , 2005, Machine Learning.

[29]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[30]  C. Rosenzweig,et al.  Cities lead the way in climate–change action , 2010, Nature.

[31]  Frank E. Harrell,et al.  Binary Logistic Regression , 2015 .

[32]  Richard Routledge Fisher's Exact Test , 2005 .

[33]  Paul Cooper,et al.  Existing building retrofits: Methodology and state-of-the-art , 2012 .

[34]  N. Nachar The Mann ‐ Whitney U: A Test for Assessing Whether Two Independent Samples Come from the Same Distribution , 2007 .