Making sense of building data: New analysis methods for understanding indoor climate

Abstract This work presents a novel approach for exploratory analysis of indoor environment measurements. Building monitoring is a complex task constrained by technological and environmental limits. Monitoring processes can be long, employ multiple sensors and equipment, and may involve multiple buildings across extended periods. Once the data is collected, it's not always straightforward how to make sense of it. When faced with complex heterogeneous amounts of data, early exploratory analysis is helpful to make sense of the broad patterns, highlight data weaknesses and features, and identify directions for more in-depth analysis. Inaccuracy, incompleteness and inconsistence are common characteristics of monitored field data, which then often needs intensive pre-processing to address data quality issues and improve data analysis results. This paper describes techniques from computer science, such as decision tree induction and others, that are not normally applied to field measurements of indoor environments in buildings, and as an example applies them to a complex set of data from climate-responsive spaces in India. The findings illustrate the power of these techniques for using simplified measures as proxies to extract more sophisticated information about building operation, and for quickly identifying patterns in how different design and operational characteristics affect thermal comfort.

[1]  Benjamin C. M. Fung,et al.  Advances and challenges in building engineering and data mining applications for energy-efficient communities , 2016 .

[2]  Ali Uyar,et al.  Measuring firm performance using financial ratios: A decision tree approach , 2013, Expert Syst. Appl..

[3]  Sanyogita Manu,et al.  Field studies of thermal comfort across multiple climate zones for the subcontinent: India Model for Adaptive Comfort (IMAC) , 2016 .

[4]  Hajar Mousannif,et al.  Data quality in internet of things: A state-of-the-art survey , 2016, J. Netw. Comput. Appl..

[5]  Geoffrey Qiping Shen,et al.  Occupancy data analytics and prediction: A case study , 2016 .

[6]  Weiming Shen,et al.  Characterization of a building's operation using automation data: A review and case study , 2017 .

[7]  Yoon Ho Cho,et al.  A personalized recommender system based on web usage mining and decision tree induction , 2002, Expert Syst. Appl..

[8]  Johannes R. Sveinsson,et al.  Random Forests for land cover classification , 2006, Pattern Recognit. Lett..

[9]  Francisco Herrera,et al.  A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning , 2013, IEEE Transactions on Knowledge and Data Engineering.

[10]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[13]  Usama M. Fayyad,et al.  On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[14]  Pei-Chann Chang,et al.  A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification , 2011, Appl. Soft Comput..

[15]  Karsten Menzel,et al.  Mining building performance data for energy-efficient operation , 2011, Adv. Eng. Informatics.

[16]  Kelvin K. W. Yau,et al.  Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks , 2007 .

[17]  Sotiris B. Kotsiantis,et al.  Decision trees: a recent overview , 2011, Artificial Intelligence Review.

[18]  Denis Allard,et al.  CART algorithm for spatial data: Application to environmental and ecological data , 2009, Comput. Stat. Data Anal..

[19]  W. Loh,et al.  SPLIT SELECTION METHODS FOR CLASSIFICATION TREES , 1997 .

[20]  Gail Brager,et al.  Performance evaluation of climate responsive buildings in India - Case studies from cooling dominated climate zones , 2019, Building and Environment.

[21]  G. Sathyadevi,et al.  Application of CART algorithm in hepatitis disease diagnosis , 2011, 2011 International Conference on Recent Trends in Information Technology (ICRTIT).

[22]  Paul M. Mather,et al.  An assessment of the effectiveness of decision tree methods for land cover classification , 2003 .

[23]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[24]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[25]  Benjamin C. M. Fung,et al.  A decision tree method for building energy demand modeling , 2010 .

[26]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[27]  David Biggs,et al.  A method of choosing multiway partitions for classification and decision trees , 1991 .

[28]  Tania Cerquitelli,et al.  Fault Detection Analysis of Building Energy Consumption Using Data Mining Techniques , 2013 .

[29]  Tianzhen Hong,et al.  A data-mining approach to discover patterns of window opening and closing behavior in offices , 2014 .

[30]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[31]  P. O. Fanger,et al.  Thermal comfort: analysis and applications in environmental engineering, , 1972 .