A time series clustering approach for Building Automation and Control Systems

Abstract Structured data of all sensors and actuators are a requirement for decisions about control strategies and efficiency optimization in Building Automation. In practice, the analysis of data is a challenging and time-consuming task. In previous work, it has been demonstrated that classification algorithms may reach high classification accuracies when applied to building data. However, supervised algorithms require labelled training data sets and a predefined classes, and depend highly on the selection of input features. In this paper, we investigate how unsupervised machine learning techniques can be used to tackle both the problem of classification of time series as well as the problem of feature selection. We present a selection of the most promising algorithms and apply them on data extracted from the E.ON Energy Research Center. We then investigate the use of an unsupervised feature extraction compared to the statistical features used in previous literature by comparing the results of the classification on different data sets. Our investigations show that the unsupervised methods we apply to not find data clusters that represent the pre-defined class labels. They, however, are able to find groups of similar data points, showing that clustering is in general possible and that the time series have distinguishable properties. We also see a more robust performance of the classification algorithms when unsupervised feature extraction is used. The results of this paper show that unsupervised machine learning algorithms cannot generally mitigate the issue of missing training data. However, they can improve supervised classification by providing a more robust set of features compared to manual selection. From the clusters that where found we can derive insights about the properties of the time series, that allow us to make a better assessment which information that can be extracted using data-driven algorithms.

[1]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[2]  Johannes Fütterer,et al.  Mode and storage load based control of a complex building system with a geothermal field , 2016 .

[3]  Adel Mellit,et al.  Multiclass adaptive neuro-fuzzy classifier and feature selection techniques for photovoltaic array fault detection and classification , 2018, Renewable Energy.

[4]  Nan Li,et al.  Linking energy-cyber-physical systems with occupancy prediction and interpretation through WiFi probe-based ensemble classification , 2019, Applied Energy.

[5]  Fu Xiao,et al.  Analytical investigation of autoencoder-based methods for unsupervised anomaly detection in building energy data , 2018 .

[6]  Wenpeng Luan,et al.  Dynamic time warping based non-intrusive load transient identification , 2017 .

[7]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[8]  Yannis Manolopoulos,et al.  Feature-based classification of time-series data , 2001 .

[9]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[10]  Gerhard Zucker,et al.  Sanitation and Analysis of Operation Data in Energy Systems , 2015 .

[11]  Eamonn J. Keogh,et al.  Clustering Time Series Using Unsupervised-Shapelets , 2012, 2012 IEEE 12th International Conference on Data Mining.

[12]  Rayner Alfred The rise of machine learning for big data analytics , 2016 .

[13]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[14]  Peter Grabusts,et al.  Clustering Methodology for Time Series Mining , 2009, Sci. J. Riga Tech. Univ. Ser. Comput. Sci..

[15]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jingkun Gao,et al.  A Data-driven Meta-data Inference Framework for Building Automation Systems , 2015, BuildSys@SenSys.

[17]  Ranveer Chandra,et al.  Fall-curve: A novel primitive for IoT Fault Detection and Isolation , 2018, SenSys.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  Ranveer Chandra,et al.  Sensor Identification and Fault Detection in IoT Systems , 2018, SenSys.

[20]  Yong Shi,et al.  A review of data-driven approaches for prediction and classification of building energy consumption , 2018 .

[21]  Dirk Müller,et al.  Structuring building monitoring and automation system data , 2018 .

[22]  Ke-Lin Du,et al.  Clustering: A neural network approach , 2010, Neural Networks.

[23]  Jujie Wang,et al.  Multi-step ahead wind speed prediction based on optimal feature extraction, long short term memory neural network and error correction strategy , 2018, Applied Energy.

[24]  Ying Shi,et al.  Adaptive cost dynamic time warping distance in time series analysis for classification , 2017, J. Comput. Appl. Math..

[25]  En Zhu,et al.  Deep Clustering with Convolutional Autoencoders , 2017, ICONIP.

[26]  Kamin Whitehouse,et al.  The Building Adapter: Towards Quickly Applying Building Analytics at Scale , 2015, BuildSys@SenSys.

[27]  Kai Yang,et al.  A data-driven approach to identify households with plug-in electrical vehicles (PEVs) , 2015 .

[28]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[29]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[30]  Clayton Miller,et al.  The Building Data Genome Project: An open, public data set from non-residential building electrical meters , 2017 .

[31]  Changliang Liu,et al.  Scale-varying dynamic time warping based on hesitant fuzzy sets for multivariate time series classification , 2018, Measurement.

[32]  Dirk Müller,et al.  Application of selected supervised learning methods for time series classification in Building Automation and Control Systems , 2017 .

[33]  Jie Zhang,et al.  A data-driven multi-model methodology with deep feature selection for short-term wind forecasting , 2017 .