Examining the feasibility of using open data to benchmark building energy usage in cities: A data science and policy perspective

Buildings are by far the largest source of urban energy consumption. In an effort to reduce energy use, cities are mandating that buildings undergo energy benchmarking—the process of measuring building energy performance in order to identify buildings that are inefficient. In this paper, we examine the feasibility of using city-specific, public open data sources in two benchmarking models and compare the results to the same models when using the Commercial Building Energy Consumption Survey (CBECS) dataset, the basis for Energy Star. The two benchmarking models use datasets containing building characteristics and annual energy use from ten major cities. To examine the difference in performance between linear and non-linear models, we use random forest and lasso regression. Results demonstrate that benchmarking models using open data outperform models based solely on the CBECS dataset. Additionally, our results indicate that building area, property type, conditioned area, and water usage are the most important variables for cities to collect. Having demonstrated the benefits of using open data, we recommend two changes to current benchmarking practices: (1) new guidelines that support a data-driven benchmarking framework relying on open data and a transparent modeling process and (2) supporting policies that publicize benchmarking results and incentivize energy savings.

[1]  Jack Chin Pang Cheng,et al.  Identifying the influential features on the regional energy use intensity of residential buildings based on Random Forests , 2016 .

[2]  Margaret Walls,et al.  Using information to close the energy efficiency gap: a review of benchmarking and disclosure ordinances , 2017 .

[3]  G. Jin,et al.  The Effect of Information on Product Quality: Evidence from Restaurant Hygiene Grade Cards , 2002 .

[4]  Robert Cohen,et al.  Mandating transparency about building energy performance in use , 2015 .

[5]  David Hsu,et al.  Improving energy benchmarking with self-reported data , 2014 .

[6]  Maarten Jennen,et al.  The impact of energy labels and accessibility on office rents , 2012 .

[7]  David Hsu,et al.  How much information disclosure of building energy performance is necessary , 2014 .

[8]  Weimin Wang,et al.  Energy Savings Modeling of Standard Commercial Building Re- tuning Measures: Large Office Buildings , 2012 .

[9]  Zheng Yang,et al.  DUE-B: Data-driven urban energy benchmarking of buildings using recursive partitioning and stochastic frontier analysis , 2018 .

[10]  Ram Rajagopal,et al.  Data-Driven Benchmarking of Building Energy Efficiency Utilizing Statistical Frontier Models , 2014, J. Comput. Civ. Eng..

[11]  M. Filippini US Residential Energy Demand and Energy Efficiency: A Stochastic Demand Frontier Approach , 2012 .

[12]  Steven R. Schiller,et al.  Evaluation of U.S. Building Energy Benchmarking and Transparency Programs: Attributes, Impacts, and Best Practices , 2017 .

[13]  Xuan Luo,et al.  Development of city buildings dataset for urban building energy modeling , 2019, Energy and Buildings.

[14]  Patrik Thollander,et al.  Extending the Energy efficiency gap , 2012 .

[15]  Denise Young,et al.  The potential for energy efficiency gains in the Canadian commercial building sector: A stochastic frontier study , 2007 .

[16]  K. Gillingham,et al.  Bridging the Energy Efficiency Gap: Policy Insights from Economic Theory and Empirical Evidence , 2013, Review of Environmental Economics and Policy.

[17]  Dino Bouchlaghem,et al.  Predicted vs. actual energy performance of non-domestic buildings: Using post-occupancy evaluation data to reduce the performance gap , 2012 .

[18]  Constantine Kontokosta,et al.  Grading buildings on energy performance using city benchmarking data , 2019, Applied Energy.

[19]  John H. Scofield,et al.  Do LEED-certified buildings save energy? Not really… , 2009 .

[20]  A. Gouldson,et al.  The rationale for energy efficiency policy: assessing the recognition of the multiple benefits of energy efficiency retrofit policy , 2017 .

[21]  Ali M. Malkawi,et al.  A new methodology for building energy performance benchmarking: An approach based on intelligent clustering algorithm , 2014 .

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  Melissa R. Wong,et al.  Impact of a letter-grade program on restaurant sanitary conditions and diner behavior in New York City. , 2015, American journal of public health.

[24]  William Chung,et al.  Review of building energy-use performance benchmarking methodologies , 2011 .

[25]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[26]  M. Stone,et al.  The challenge of new marketing issues , 2010 .

[27]  Rishee K. Jain,et al.  Data-Driven, Multi-metric, and Time-Varying (DMT) Building Energy Benchmarking Using Smart Meter Data , 2018, EG-ICE.

[28]  Ram Rajagopal,et al.  Benchmarking building energy efficiency using quantile regression , 2018, Energy.

[29]  David Hsu,et al.  Estimating Energy Savings from Benchmarking Policies in New York City , 2017 .

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  Iain Staffell,et al.  The importance of open data and software: Is energy research lagging behind? , 2017 .

[32]  Anil Markandya,et al.  Electricity generation and health , 2007, The Lancet.

[33]  Parham A. Mirzaei,et al.  Urban heat island effect of a typical valley city in China: Responds to the global warming and rapid urbanization , 2018 .

[34]  Marilyn A. Brown,et al.  Machine learning approaches for estimating commercial building energy consumption , 2017 .

[35]  Charlie Wilson,et al.  The appeal of the green deal: Empirical evidence for the influence of energy efficiency policy on renovating homeowners , 2015 .

[36]  Mary Ann Piette,et al.  Automatic generation and simulation of urban building energy models based on city datasets for city-scale building retrofit analysis , 2017 .

[37]  Melek Yalcintas,et al.  An energy benchmarking model based on artificial neural network method utilizing US Commercial Buildings Energy Consumption Survey (CBECS) database , 2007 .

[38]  Zachary Hart,et al.  The benefits of benchmarking building performance , 2016 .

[39]  L. L. Doove,et al.  Recursive partitioning for missing data imputation in the presence of interaction effects , 2014, Comput. Stat. Data Anal..

[40]  Christopher Tull,et al.  A data-driven predictive model of city-scale energy use in buildings , 2017 .

[41]  F. Creutzig,et al.  Global typology of urban energy use and potentials for an urbanization mitigation wedge , 2015, Proceedings of the National Academy of Sciences.

[42]  Constantine Kontokosta,et al.  Pattern recognition in building energy performance over time using energy benchmarking data , 2018, Applied Energy.

[43]  David Hsu Identifying key variables and interactions in statistical models of building energy consumption using regularization , 2015 .

[44]  Vijay Modi,et al.  Spatial distribution of urban building energy consumption by end use , 2012 .

[45]  Peng Zhou,et al.  A survey of data envelopment analysis in energy and environmental studies , 2008, Eur. J. Oper. Res..